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PSYCHOTHERAPISTS’ APPROACH-AVOIDANCE 
REACTIONS TO PATIENTS’ EXPRESSIONS 
OF HOSTILITY 


ALBERT BANDURA, DAVID H. LIPSHER, axb PAULA E. MILLER 
. Stanford University 


In a previous study (Bandura, 1956), an 
inverse relationship was found between thera- 
pists’ anxiety level and ratings of their psy- 
chotherapeutic competence. The purpose of 
the present investigation was to determine, 
through a response by response analysis of 
actual patient-therapist interactions, the spe- 
cific ways in which therapists’ anxieties may 
affect their psychotherapeutic work. Since 
hostility conflicts tend to be present to some 
degree in all patients, it was decided to single 
out for study the patient-therapist interac- 
tion around the expression of hostility. 

Although it may be possible to specify 
the conditions essential for effecting certain 
changes in a patient’s behavior, whether or 
not a particular therapist can produce these 
conditions may be partly determined by his 
own personality characteristics. A minimal 
condition for the resolution of a patient’s 
conflicts would seem to be that the patient’s 
conflictive feelings are permitted to occur 
within the therapy situation. If, however, 
certain classes of expression are anxiety pro- 
voking for the therapist, he is less likely to 
permit or encourage the patient to express 
himself in these ways. Moreover, when the 
patient does express tendencies that are 
threatening to the therapist, the anxieties he 
elicits often motivate the therapist to avoid 
a continuation of the anxiety producing in- 
teraction (Cutler, 1958; Little, 1951; Lubor- 
sky, 1952; Reich, 1951; Rigler, 1956). In 
line with this reasoning, it was predicted that 
therapists who display high hostility anxiety 
would be more likely to respond to patients 
hostility with avoidance reactions and 5e 
likely to respond with approach song 2 a 
would therapists who show low hostility 


anxiety. 


Hypotheses were also advanced concerning 
the effect of the therapists’ approach and 
avoidance reactions on the patients’ behavior. 
It was assumed that the therapists’ approach 
reactions would constitute positive reinforce- 
ments for the patients and that such reac- 
tions would therefore serve to encourage the 
patients to express feelings of hostility. It was 
further assumed that the therapists’ avoid- 
ance reactions would serve either as nonre- 
wards .or as negative reinforcements and 
would have the effect of decreasing or in- 
hibiting the expression of hostile feelings. It 
seemed possible in some instances, particu- 
larly where the patients are strongly insti- 
gated to hostility, that the therapists’ nega- 
tive responses would lead the patients to 
displace the hostility to some other object 
rather than to discontinue all expressions of 
hostility. It was therefore predicted that if 
the patients were to continue in the hostility 
topic following the therapists’ avoidance re- 
sponses they would change the object of their 
hostility more often than they would follow- 
ing therapists’ approach responses. 


METHOD 


Subjects. The tape recorded interviews of 17 par- 
ents who were undergoing psychotherapy at a parent- 
child clinic provided the data for this study. Twelve 
different therapists, 2 females and 10 males, were 
represented. All were advanced clinical psychology 
students, and all except 2 were undergoing individual 
psychotherapy at the time of the study. 

Therapists’ personality characteristics. The thera- 
pists’ personality characteristics were assessed through 
ratings provided by four clinical psychology staff 
members who had extensive contact, both socially 
and professionally, with the student therapists. Each 
therapist was rated independently by three staff 
members on eight 5-point rating scales. 

Hostility anxiety, which was central to this study, 


2 A. Bandura, D. H. Lipsher, and P. E., Miller 


was assessed in three ways. In the first place, it was 
assumed that with high anxiety there would be a 
greater amount of indirect than of direct hostility so 
that, if hostility were expressed at all, its forms 
would more likely be indirect. Ratings were conse- 
quently made of the amount of direct and of the 
amount of indirect hostility that the therapists typi- 
cally displayed in their interactions with others. The 
mere absence of hostility, whether of a direct or in- 
direct form, would not in itself necessarily indicate 
the presence of anxiety. For instance, one might ex- 
pect a person to show little hostility if he has ex- 
perienced little frustration or instigation to hostility. 
On the other hand, if a person strongly inhibits any 
expression of hostility in the face of strong instiga- 
tion, the presence of anxiety would seem to be in- 
dicated. Therefore, ratings were also made of the 
degree to which the therapists inhibited any expres- 
sion of hostility. The hostility inhibition scale is 
given below. 

This scale deals with the readiness with which the 
S gets irritated, annoyed, angered when frustrated, 
provoked or thwarted, eg, when delayed, incon- 
venienced, criticized, opposed, etc. 


1. Very easily irritated and angered—expresses irri- 
tation, annoyance, anger at the slightest frus- 
tration 

2. Easily irritated and angered—expresses irrita- 
tion, annoyance, anger even when mildly frus- 
trated, provoked, or thwarted 

3. Expresses irritation, annoyance, anger when 
moderately frustrated, provoked, or thwarted 

4. Difficult to arouse to anger or to irritate—ex- 

Presses irritation, annoyance, anger only when 

strongly frustrated, provoked, or thwarted 

5. Very difficult to arouse to anger or to irritate— 
practically never expresses irritation, annoyance, 
or anger when strongly frustrated, provoked, or 
thwarted. 


The presence of hostility anxiety was thus inferred 
from the therapist’s tendency to inhibit hostile reac- 
the face of high instigation and from his 
tendency to respond with predominantly indirect 
forms of hostility. 

In addition to the three measures of hostility, the 
therapists were also rated on several Scales measur- 
ing dependency behavior: amount of help Seeking, 
amount of approval Seeking, amount of dependency 
inhibition; they were also rated on a scale measuring 
the extent of sex inhibition; and on a scale measur- 
ing warmth. These latter measures were not used as 
a basis for any predictions. 

Procedure for rating patient-therapist interaction. 
From the total number of interviews available for 
the 17 patients, 110 interviews (approximately two 
thirds of the total) were randomly selected for analy- 
sis. Coding of these interviews was done by two 
judges who worked independently on different inter- 
views. However, in order to provide an estimate of 
the reliability of the judges’ ratings, a sample of 20 
interviews, randomly selected from the pool of 110, 
was independently coded by both judges. In the case 


of these 20 interviews, the mean coding was used in 
the further analyses. 

In order to insure independence of the coding, the 
judges had no knowledge of the ratings of the thera- 
pists’ personality characteristics, 

Scoring unit. The unit scored was an interaction 
sequence beginning with a patient statement, the 
therapist's response, and the immediately following 
patient response. An example is given below: 


P.: I had more work to do than I felt I should 
do, that I had to do, work which I felt was unim- 
portant, wasn't nece: y to be done. I was told 
to do it. I didn’t think it was right or necessary 
at all. 

T.: How do you feel when you have to do some- 
thing he wants you to do? 

P.: Well, I don’t like it but I do it. 


Patient response category. The patient category 
scored was hostility which was defined as any ex- 
pression of dislike, resentment, anger, antagonism, 
opposition, or of critical attitudes. In order to study 
changes in the object toward whom the patient at 
rected hostility, this category was further divided 
into six subcategories according to the referent: (a) 
spouse, (b) children, (c) parents, (d) self, (e) thera- 
pist, and (f) other persons or objects. oe 

Therapist response categories. The therapists nee 
sponses were divided into two general classes: aP 
proach and avoidance reactions. that 

Approach reactions included verbal responses aes 
were primarily designed to elicit from the pa cot 
further expressions of hostile feelings, attitudes, eee 
behavior. The following classes of responses were 
cluded in the approach category: 


a. Approval, Therapist sanctions and igen 
plicit agreement with the patient’s hostile fee 
or behavior. 

P.: I don’t know but I got so mad when 
came home. you 
T.: Under the circumstances, how could 3 

have felt otherwise ? 


> rif- 
b. Exploration. Therapist asks for further sri A 
cation, elaboration, and detailing of the pa 
hostile feelings or behavior. 


P.: For some reason I had a bad day, 
couldn’t fall asleep. I just felt aggravated. 
T.: Aggravated? Can you tell me a little a 
that? san 
c. Instigation. Therapist shifts the discussion rein- 
a nonhostility topic to the hostility topic oF ar 
troduces the hostility expressed by the patient 
lier in the interview or in previous interviews- 
P.: I was pretty fatigued when I got mp 
went to bed right off the bat but I just taste ais 
turned. When I woke up in the morning I had aie 
pain in the leg. I went to the doctor and he £ê 
me a shot. ‘if 
T.: Yes. Let’s get back again to that evening 
that irritated feeling you had. 


he 


just 
pout 


rom 


Bs a 
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d. Reflection. Therapist repeats or restates the pa- 
tient’s hostile feelings. 

P.: And when the kids don’t listen to me it rubs 
me the wrong way. I lose my temper. 

T.: You get mad. P 
e. Labeling. Therapist names the patient's feelings, 
attitudes or behavior as hostile; points out pat- 
terns in the patient's feelings or behavior; suggests 
relationships between present hostile feelings and 
behavior and past experiences. 

P.: I've picked up the tabs on that girl all my 
life. Always paying, always costing money. I think 
I resented her a good deal. 

T.: Maybe some of your resentment for Joyce is 
partly displaced from your mother in that your 
mother shipped her to you to take care of. You 
were mad at your mother as well as Joyce. 


Avoidance reactions included those verbal responses 
designed to inhibit, discourage, or divert the pa- 
tients’ hostile expressions. The following classes of 
responses were included in the avoidance category: 


pist is critical of the patient's 
ed in a hostile, aggressive 


a, Disapproval. Ther: 
having felt or bel 
manner. 

P.: So I blew my top and hit her. 

T.: Just for that you hit her? 
b. Topical transition. Therapist changes the dis- 
cussion from the hostility topic to a nonhostility 
topic. 

P.: My mother annoys me. 

T.: How old is your mother? 
c. Silence. Therapist makes no verbal response for 
4 sec, or more after the patient has expressed hos- 
tility. 

P.: I just dislike it at home so much at times. 

T.: [Silence] 

P.: So I just don't know what to do. 
d. Ignoring. Therapist responds to the content of 
the patient’s response but ignores the hostile affect. 

P.: I lose my temper over his tardiness. 
T.: What are the results of his being tardy? 
e. Mislabeling. Therapist labels as nonhostile feel- 
ings that are clearly hostile. 

P.: When are you going to give me the results 
of those tests? I think I’m entitled to know. 

T.: You seem to be almost afraid to find out. 


Unclassified responses included unscorable utter- 
ances and responses that were irrelevant to the above 
subcategories. 

In coding the interviews, the record was played 
and each patient statement was judged as een Hoe 
tile or nonhostile. If a statement was rated as hos- 
tile, both the therapist response and the imimenintely 
succeeding patient response i aha The object of 

ient’ tility was also noted. 
e lose been desirable to have used kaa 
rate sets of judges for rating the therapist m pa- 
tient responses respectively. This could have been ac- 


TABLE 1 


INTERJUDGE AGREEMENT IN SCORING RESPONSE Units 


Number 
of Units 
Perfect agreement 261 
Single discrepancy 
Therapist’s response 72 
Hostility following the therapist’s response 16 
Object of hostility 2 
Two discrepancies 
Therapist's response and object S 
Therapist’s response and hostility k 
Object and hostility 1 
Two discrepancies on object of hostility 2 
Unit scored by one judge only 81 


complished only through the use of typescripts from 
which therapist or patient statements had been de- 
leted (Dittman, 1952). On the other hand, direct re- 
cordings had certain advantages. In many instances 
a patient’s hostility is communicated more through 
voice cues than through content cues and had the 
ratings been made from typescripts some of the more 
subtle hostile responses would have been lost. As a 
partial control, however, the judge rated the thera- 
pist response before rating the immediate response it 
elicited in the patient. 

The scores yielded by the content analysis were 
the number of interaction sequences in which th ra- 
pists responded with approach or avoidance reactions 
to the patients’ hostility and the number of sequences 
in which hostility followed therapists’ approach and 
avoidance, respectively. The hourly totals were then 
summed for each therapy case. 


RESULTS 
Interjudge Reliability 


Since the unit of analysis is an interaction 
sequence, the most relevant index of reliabil- 
ity is the degree of agreement between the 
judges in coding the response units. There are 
a number of ways in which discrepancies can 
occur. The judges may disagree as to whether 
or not the patient is expressing hostility and/ 
or disagree as to the object toward whom the 
hostility is directed. They may differ in scor- 
ing the therapist's response. Finally, they may 
disagree on whether or not hostility is ex- 
pressed in the immediately following patient 
response and/or in the object of the patient’s 
hostility. 


4 A. Bandura, D. H. Lipsher, and P. E. Miller 


TABLE 2 


FREQUENCY OF SPECIFIC INTERJUDGE DISCREPANCIES 


Number 
of Units 
Therapists’ responses 
Reflection—labeling 13 
Reflection—exploration 11 
Reflection—Approval 2 
Exploration—Instigation 7 
Exploration—Labeling + 
Labeling—Approval 2 
Ignoring—Topical transition 4 
Ignoring—Mislabeling s 
Mislabeling—Topical transition 1 
Ignoring—Exploration 22 
Ignoring—Labeling S 
Ignoring—Reflection 9 
Exploration—Topical transition 1 
Object of hostility 
General—self 4 
General—therapist 3 
General—spouse 4 
General—child 1 
Therapist—self 3 
Therapist—spouse 3 
Therapist—child 1 
Spouse—child 3 


The degree of interjudge agreement in cod- 
ing the response units was obtained from the 
20 interviews that the two judges rated in 
common. Comparison of the ratings revealed 
that in 261 of the units scored the judges 
were in perfect agreement. In 100 of the units 
they showed only minor discrepancies, most 
of which involved differences in their rating 
of the therapists’ responses (Table 1), 

The frequency with which specific types of 
scoring disagreements occurred are presented 
in Table 2. Many of the discrepancies in cod- 
ing the therapists’ responses involved cate- 
gories which overlap to some degree as, for 
example, reflection and labeling which are 
responses on an_ interpretative continuum. 
More marked discrepancies occurred, how- 
„ever, in 37 units where one judge rated the 
therapists’ responses as avoidance whereas 
another judge rated them as some form of 
approach reaction. 

The reliability of the ratings of the thera- 


pists’ personality characteristics was esti- 
mated by the use of an analysis of variance 
technique developed by Ebel (1951). The re- 
sults are given in Table 3. 


Therapists? Characteristics and the Use of 
Approach-Avoidance Responses 


Of the 4734 interaction sequences that were 
scored, 1619—approximately 34% —were ones 
in which the patients expressed hostility. The 
therapists, in turn, responded with 938 ap- 
proach and 859 avoidance reactions. In or 
der to test whether differences in therapists 
personality characteristics were related to e 
relative frequency of their approach of ayoni 
ance interventions, the therapists were Œ- 
chotomized into a high scoring and low scor- 
ing group on each of the personality seali 
The measure of the therapists’ relative pre 
erence for approach or avoidance reactions 
was based on the ratio of the number of ap 
proach to the number of avoidance a 
The significance of the differences ae 
the two groups was then tested by means 
the Mann-Whitney U test (Siegel, 1956). a 

Of -the three hostility anxiety mea 
only one—direct hostility—yielded a att 
cally significant difference. Meee wile 
expressed their hostility in direct se va 
more likely to respond with approac reas 
tions when the patients expressed hos a 
toward extratherapeutic objects than Pe 
therapists who were rated low on the s 
hostility scale. This difference was signa 
at the .03 level. These two groups of ving 
pists did not differ, however, in their han wi 
of hostility when the therapist himself 
the object of the patient’s hostility. 


TABLE 3 
RELIABILITIES OF THE RATINGS OF THE 
THERAPISTS’ CHARACTERISTICS n 
r 
Scale a 
86 
Direct hostility + 
Indirect hostility “30 
Hostility inhibition D 
Help seeking Up 
Approval seeking 1 
Dependency inhibition a 
Sex inhibition : 


Warmth 86 4 
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TABLE 4 


PERCENTAGE OF APPROACH REACTIONS TO 
PATIENTS’ HOSTILITY 


Object of Hostility 


Therapist Therapist Others 
A 50% 85% 
B _ 75 
C 50 65 
D 50 58 
E 43 56 
F -i 53 
G — 52 
H 49 51 
I 30 50 
J 38 43 
K 20 H 
L i 35 


a The interviews that were coded in these cases yielded no 
responses in the hostility toward therapist category. 


Only one of the remaining personality meas- 
ures was related to the therapists handling 
of hostility. Therapists who displayed a high 
need for approval were more likely to avoid 
the patients’ hostility, whether directed to- 
ward the therapist (p < .06) or toward others 
(p < .002), than were therapists who were 
rated low on the approval seeking scale. 

In order to determine which of the specific 
approach and avoidance reactions may have 
contributed to the over-all differences that 
were obtained, separate statistical analyses 
were made for each of the therapist subcate- 
gories. The findings showed that therapists 
who readily expressed hostility in direct forms 
were less likely to ignore the patients hos- 
tility than were therapists who were rated low 
on the direct hostility scale (p < .03). In ad- 
dition, therapists high in approval seeking 
were less likely to make efforts to explore the 
patients’ hostility (p < .08), were more likely 
to ignore hostility (p < .004), and were more 
inclined to change the discussion from hos- 
tility to nonhostility (p< 02) than were 
therapists who displayed low approval seeking. 

In general, therapists were less likely to ap- 
proach the patients’ hostility when the thera- 
pist was the object of hostility than when the 
patient directed his hostility toward anon 
(Table 4). This difference, tested by nen 
of the sign test for the difference between two 


correlated sets of scores (Siegel, 1956), was 
significant at the .004 level. 


Influence of Therapists’ Approach and Avoid- 
ance Reactions on Patients Expression of 
Hostility f 
The influence of the therapist’s behavior on 

the patient was measured in terms of the fre- 
quency with which the patient continued to 
express hostility following a particular thera- 
pist intervention. These results are presented 
in Table 5. Very few outright disapproval re- 
sponses were scored, consequently, this cate- 
gory was not included in the analysis. 

As can be seen from Table 5, if a therapist 
approached the patient’s hostility, the patient 
was almost certain to continue expressing 
such feelings. On the other hand, if the thera- 
pist responded with an avoidance reaction, 
the patient was likely to drop the hostility 
topic. In testing for the significance of these 
differences, each specific approach category 
was compared with the total avoidance cate- 
gory. Similarly, each specific avoidance cate- 
gory was compared with total approach. The 
significance levels, yielded by the sign test, 
are given in Table 6. 

Although the different approach reactions 
appeared to be equally effective in eliciting 
hostile feelings, some variability was noted 
among the avoidance reactions, with silence 
and mislabeling drawing hostility significantly 
more often than the remainder of the avoid- 


TABLE 5 


PERCENTAGE OF Times Hostitity FOLLOWED 
SPECIFIC THERAPIST RESPONSES 


Total 
Therapist Response Hostility 
Total approach 92% 
Reflection 93 
Labeling 92 
Exploration 90 
Instigation 93 
Approval 84 
Total avoidance 43 
Ignoring 41 
Topical transition 22 
Silence 60 


Mislabeling 70 


6 
TABLE 6 
SIGNIFICANCE OF DIFFERENCES BETWEEN THE 
APPROACH AND AVOIDANCE CATEGORIES 
Total 
Hostility 
Categories Compared p 
Total approach vs. total avoidance .002 
Reflection vs. total avoidance .002 
Labeling vs. total avoidance .002 
Exploration vs. total avoidance -003 
Instigation vs. total avoidance -002 
Approval vs. total avoidance .01 
Ignoring vs. total approach -002 
Topical transition vs. total approach 001 
Silence vs. total approach 02 
Mislabeling vs. total approach .01 


Topical transition vs. remainder of avoidance .02 
Silence vs. remainder of avoidance 05 
Mislabeling vs. remainder of avoidance 02 


ance interventions. Inasmuch as silence is a 
rather ambiguous cue, the patient may at 
times interpret the therapist’s behavior as in- 
dicative of interest and attentiveness rather 
than a sign of avoidance, disinterest, or dis- 
approval. In such instances, the therapist’s 
silence might be responded to by the patient 
with further expressions of hostility. As for 
mislabeling, such responses oftentimes led the 
patient to express his feelings even more di- 
rectly so as to correct the apparent failure in 
communication. 


Table 7 presents the findings on changes in 


TABLE 7 


PERCENTAGE OF TIMES PATIENTS CHANGED THE OBJECT 
OF Hostitity FOLLOWING SPECIFIC 
THERAPISTS’ RESPONSES 


% of Object 


Therapist Response Changes 
Total approach 32% 
Reflection 30 
Labeling 42 
Exploration 29 
Approval 35 

Total avoidance 70 
Ignoring 72 
‘Topical transition 81 
Silence 59 
Mislabeling 52 
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the object of hostility following the therapists’ 
responses. 

In contrast to approach reactions, the thera- 
pists’ avoidance reactions were far more likely 
to lead the patients to change the object to- 
ward whom they were expressing their hos- 
tility. These differences, tested by means of 
the sign test, were clearly significant for the 
over-all categories as well as for the subcate- 
gories (Table 8). i 

Again, we find that silence and mislabeling 
had less of a disruptive effect on the patients 
expression of hostility than did the other 


TABLE 8 


‘Ces BETW 
\TEGORIES 


SIGNIFICANCE LEVELS OF DIF 
APPROACH AND ÅVOIDAN 


Categories Compared p 
sil B ae: 7 1 
Total approach vs. total avoidance i 
Reflection vs. total avoidance “ol 
Labeling vs. total avoidance i 
Exploration vs. total avoidance isl 
Approval vs. total avoidance : 
ol 
Ignoring vs. total approach a 
Topical transition vs. total approach ‘02 
Silence vs. total approach ‘02 
Mislabeling . total approach 
Bias 05 
Silence vs. remainder of avoidance ‘05 
Mislabeling vs. remainder of avoidance p 
tag ig in ace 
i is finding is m ¢ 
avoidance responses. This g eding 


cordance with that reported in the prec 
section (Table 5). 


DISCUSSION 


The hypotheses concerning the influence 5 
therapists’ hostility anxieties on their m 
dling of hostility in the psychotherapeute ije 
teraction were only partially confirmed. W at 
therapists who expressed hostility in dir to 
forms tended to encourage their patien’s g 
express hostility, no significant differen 
were found between therapists who differ 
in the amount of indirect hostility or 1” dae 
degree of hostility inhibition that MEN ear 
played. The failure to find significant di i. 
ences for the latter measures may be due; si 
part, to the relative homogeneity of the oe 
pists under study. Few, if any, of them ma? 


Ga 


fested severe hostility conflicts; and few, if 
any, were completely free of hostility anxieties. 

It was of interest to find that therapists 
who displayed a high need for approval 
tended to discourage the expression of hos- 
tility on the part of their patients. One might 
assume that a therapist who is strongly mo- 
tivated to elicit and maintain the approval of 
others would generalize to some degree the 
approval seeking to his patient as well. Thus, 
the therapist may refrain from encouraging 
the patient to express feelings that may be 
somewhat anxiety or guilt provoking, since 
to make the patient uncomfortable may lose 
him the patient’s approval. 

During the course of psychotherapy the pa- 
tient may generalize to the therapist some of 
his conflicts and emotional reactions. Once 
these responses are transferred, their modifi- 
cation will depend on the therapist’s response. 
Since the therapist has control over most of 
the conditions that may be essential for re- 
learning to occur, the patient’s behavior can 
be more readily modified in relationship to 
the therapist than in extratherapeutic inter- 
actions where interpersonal consequences are 
apt to be somewhat inconsistent or even con- 
tradictory. Our findings suggest, at least as 
far as the handling of hostility is concerned, 
that therapists are less inclined to accept and 
to analyze hostile feelings directed toward 
themselves than they are to accept and ana- 
lyze similar feelings expressed toward others. 
Tt is difficult to know how much this finding 
may be a function of the relative inexperience 
of the therapists in the study, the majority 
of whom were still in training. 

Most previous studies that have investi- 
gated the influence of interviewers behavior 
on patients’ productions have focused on posi- 
tive reinforcing responses (Krasner, 1958; 
Rogers, 1958) and, except for Murray’s 
(1956) investigation, there is little data on 
the effect of therapists’ negative responses on 
patients’ verbalizations. From the results of 
this study it is clear that avoidance reactions 
occurred almost as frequently as did those of 
approach. The avoidance reactions, however, 
tended to take the form of nonrewards rather 
ly negative reinforcements. Dis- 


than of frank 
i very rarely encountered, whereas 


approval was 
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same form of nonresponse was much more 
common. 

The prediction that approach responses 
would be more effective than avoidance re- 
sponses in getting the patient to express his 
hostility feelings was clearly confirmed. Not 
only were the patients apt to drop the dis- 
cussion of hostility if the therapist responded 
with avoidance, but also, in instances where 
they did continue with the topic, they were 
more likely to change the object of their hos- 
tility. In evaluating these results, it is impor- 
tant to bear in mind that the criterion of the 
influence of the therapists’ behavior was the 
immediately observable effect it had on the 
patients’ verbal behavior. Had some criterion 
of delayed outcome been used, the results 
may very well have been different. Moreover, 
the study was limited to the patients’ verbal 
behavior and the accompanying verbal reac- 
tions of the therapists. The operation of re- 
ward and punishment, on the other hand, is 
apt to take more subtle and nonverbal forms, 
particularly in the case of disapproval where 
there is little sanction for such reactions in 
the psychotherapeutic relationship. It is highly 
probable that many nonverbal cues, which in 
themselves could have produced the changes 
in the patients’ behavior, accompanied the 
therapists’ verbal responses. 

A number of the personality measures did 
not appear to be related to the therapists’ 
handling of hostility. They may, however, be 
related to the therapists’ handling of other 
forms of behavior; e.g., sexual and depend- 
ency expressions. A study of such relation- 
ships is now being planned. 


SUMMARY 


The present investigation was designed to 
test the following hypotheses: (a) that thera- 
pists with high hostility anxiety are more 
inclined to avoid rather than to approach pa- 
tients’ hostility than are therapists who dis- 
play low anxiety; (b) that approach reac- 
tions encourage the patient to express further 
hostility, whereas avoidance reactions serve 
to decrease or inhibit such expression; and 
(c) that patients are more likely to change 
the object of their hostility following an avoid- 
ance response by the therapist than they are 
following an approach response. 
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To test these hypotheses, tape recordings of 
110 interviews obtained from 17 patients 
treated by 12 therapists were analyzed. The 
therapists’ personality characteristics were as- 
sessed through ratings provided by clinical 
psychology staff members. Patient-therapist 
interaction sequences were coded for the 
number of times therapists responded with 
approach or avoidance reactions to the pa- 
tients’ hostility and also for the frequency 
with which patients continued to express hos- 
tility immediately following therapists’ ap- 
proach or avoidance interventions. The ob- 
jects toward whom the patients directed their 
hostility were also rated. 

The results obtained were as follows: 

1. Therapists who typically expressed their 
own hostility in direct forms and who dis- 
played low need for approval were more likely 
to permit and encourage their patients’ hos- 
tility than were therapists who expressed 
little direct hostility and who showed high 
approval seeking behavior. No significant re- 
lationships were found for six other meas- 
ures of the therapists’ characteristics, 

2. In general, psychotherapists were more 
inclined to avoid hostility when it was di- 
rected toward themselves than when the pa- 
tients directed their hostility toward other 
objects, 
3. The patients were more likely to drop 


the hostility topic or to change the object of 


their hostility following therapists’ avoidance 
reactions than they were following the thera- 
pists’ approach reactions. 


REFERENCES 


Banpura, A. Psychotherapists’ anxiety level, self-in- 
sight, and psychotherapeutic competence. J. ab- 
norm. soc. Psychol., 1956, 52, 333-337. 

CUTLER, R. L. Countertransference effects in psycho- 
therapy. J. consult, Psychol., 1958, 22, 349-356. 
Dittman, A. T. The interpersonal process in ‘eae: 
therapy: Development of a research method. J. 
abnorm. soc, Psychol., 1952, 47, 236-244. $ 

Ener, R. L. Estimation of reliability of ratings. Psy- 
chometrika, 1951, 16, 507-524. on >i 

KRASNER, L. Studies of the conditioning of verba 
behavior. Psychol. Bull., 1958, 55, 148-170. A: 

Littte, M. Counter-transference and the paner i 
response to it. Int. J. Psycho-Anal., 1951, 32, 

40. i 

Lusorsky, L, The personality of the psychothera 
pist. Menninger Quart., 1952, 6, 1-6. aa 

Murray, E. J. The content-analysis —— 
studying psychotherapy. Psycho, Monogr., ’ 

Thole No. 420). cee 

Pi gy ing a Int. J. Psy 
cho-Anal., 1951, 32, 25-31. i nga 

Ricter, D. Some determinants of Be paa 
ior, Unpublished doctoral dissertation, . 
Michigan, 1956. rer: eT, 

Rocers, J. M. Operant conditioning in a guas o 
apy situation. Unpublished doctoral disserta 
Stanford Univer., 1958. : J 

Stecet, S. Nonparametric statistics. New York: MC 
Graw-Hill, 1956. 


(Received February 11, 1959) 


her- 
ny, 


Oa a a u oa a i 


Journal of Consulting Psychology 
Vol, 24, No. 1, 1960 


IMPLICATIONS OF HOMOSEXUALITY AMONG AIR 
FORCE TRAINEES 


WILLIAM T. DOIDGE 
USAF Hospital, Lackland Air Force Base 
anD WAYNE H. HOLTZMAN 


University of Texas 


During the past decade, research on sexual 

behavior has rapidly gained wider acceptance 
by the general public, and many commonly 
held attitudes are seriously challenged by new 
information, One major issue concerns the na- 
ture of homosexuality. Is homosexuality symp- 
tomatic of a more general personality disturb- 
ance or may it be regarded, in some instances 
at least, as confined to the sexual sector alone? 
In a recent article Hooker (1957) suggests, 
albeit very cautiously: 
Even if one assumes that homosexuality represents a 
severe form of maladjustment to society in the sexual 
sector of behavior, this does not necessarily mean 
the homosexual must be severely maladjusted in 
other sectors of his behavior. Or if one assumes that 
homosexuality is a form of severe maladjustment 
internally, it may be that the disturbance is limited 
to the sexual sector alone (p. 30). 


A contrasting opinion is advanced by the 
Group for the Advancement of Psychiatry 
(1955, p. 2): “When such homosexual be- 
havior persists in an adult, it is then a symp- 
tom of severe emotional disorder.” It is with 
this conflicting opinion that the present pa- 
per is concerned. : ; 

As part of a more extensive project, recent 
studies conducted at Lackland Air Force Base 
were analyzed to see which, if either, of the 
two views could be supported. From a mili- 
tary point of view the issue is one of practical 
importance in deciding upon proper disposi- 
tion of homosexual cases brought to light in 
the Air Force. i A 

The GAP report quite properly distin- 
guishes between homosexuality and homo- 
sexual behavior. Homosexuality is considered 
to be a “persistent emotional and physical at- 
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traction to members of the same sex,” and 
according to this report “constitutes an ab- 
normal personality development.” Adolescent 
exploratory experiences, occasional contacts 
while intoxicated, or isolated acts are more 
appropriately called homosexual behavior as 
distinguished from the condition of homosexu- 
ality. This distinction is especially important 
in the military service, where homosexual ac- 
tivity may result in trial by courtmartial and 
separation from the Air Force. Occasions also 
arise when it is necessary to identify those 
individuals who pretend to be homosexuals 
for the purpose of obtaining a discharge. 

There are a number of unique advantages 
to a study of male homosexuals in the Air 
Force. Nearly 100,000 airmen a year pass 
through basic training at Lackland Air Force 
Base under controlled conditions, providing a 
large sample of young adult men representing 
all characteristics of the general population. 
Most previous research on homosexuality has 
been severely limited by strong biases in the 
population studied. The almost complete lack 
of privacy and the barracks-type of living are 
likely to stimulate sexual drives in male ho- 
mosexuals. Prevailing cultural attitudes and 
stringent military policy heighten the con- 
flict, precipitating contact with the psychi- 
atric clinic. 

In the Air Force all homosexual suspects 
are subjected to exhaustive interrogations by 
special investigators. Since heterosexual sub- 
jects (Ss) are not usually put through this 
procedure, it was deemed advisable in formu- 
lating an experimental design to include a 
heterosexual group which had been through 
such an investigation. In the final formula- 
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Table 1 


Categories of Experimental Groups According to Homosexuality-Heterosexuality 


Dimension and Reason for Special Investigation 


Reason for 
Investigation 


Group N Psychosexual Orientation 
H Homosexual 20 Predominantly homosexual 
(predominant) 
A Homosexual 20 


(accessory) 


D Heterosexual 20 
(disciplinary) 

N Heterosexual 20 
(normal) 


tion two homosexual and two heterosexual 
groups, each containing 20 individuals, were 


selected for thorough case study and psycho- 
logical testing. 


Description of the Study 

All homosexual sus 
chiatric clinic at L 
during a period of s 
an experimental 


pects referred to the psy- 
ackland Air Force Base 
everal months were given 
battery of psychological 
Psychiatric interview, and 
history. The Psychiatrist ? 
t erview rated each individual 

terosexual-homosexual scale 


nsey’s classification (Kinsey, 
Pomeroy, & Martin, 1948) 


from exclusively he 
through various mixt 
and homosexualit 
behavior, 


orientation, 


ures of heterosexuality 
y, to exclusively 


€ individuals of both ho- 
mosexual groups had been subjected to spe- 


cial investigation aS suspected homosexuals 

a control Sroup (D) was formed, consisting 

of heterosexuals who had been through a 

similar investigation for other kinds of alleged 
1 All psychiatric examinations were accomplished 

by Louis J. West, De 

Neurology, Universi 


Predominantly heterosexual 
with homosexual experience 


Exclusively heterosexual 


Exclusively heterosexual 


Homosexual behavior 
Homosexual behavior 
Nonsexual offenses 


Not investigated 


offenses. A fourth group (N) of normal ie 
erosexual men constituted a second ae 
group for comparison with the two classes 0 
homosexuals. : 

All 80 men were given a battery of 10 tests 
in the following order: Sexual ee ee 
Survey, Homosexual Homonyms, Edwards 
Personal Preference Schedule (Edwards, 
1955), Heineman’s Forced-Choice aan 
Scale (Heineman, 1953), Worchel’s Sela 
tivities Inventory (Worchel, 1957), the poa 
Preference and Aversion Scale, the at 
schach test (including a special testing k 
limits for sex), the Blacky Tames: T 
nique (Blum, 1950), the MMPI, and six sub 
tests of the Wechsler Adult Intelligence Scale 
(Vocabulary, Information, Similarities, pa 
ture Completion, Block Design, and Digit 
Symbol). The Sexual Identification a 
was designed especially for this study an 
consisted of 30 human figures taken from 
drawings, paintings, statues, and other sources 
which were photographically reproduced in 
such a way as to make the sex anigo 
The S’s task was to identify the sex of eac 
figure. The Food Preference and Aversion 
Scale was adapted from Wallen (1943) an 
consisted of 29 foods to be rated by the S 0” 
a five-point like—dislike scale. 3 

The third test developed especially for this 
study was the Homosexual Homonyms. Using 
Secord’s (1953) method, a list of 10 homo- 
nyms (blow, gay, fairy, French, fruit, queer 
rear, suck, pansy, Greek) was embedded in & 
buffer list of 50 words and given to each 5 
as a word association test. 
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Each test was scored without awareness of 
the sexual class into which the S had been 
placed in order to avoid any possible bias. 
Where score distributions were reasonably 
normal, comparisons among the four experi- 
mental groups were made by analysis of 
variance. Where distributions were severely 
skewed or truncated, chi square or the 
Kruskal-Wallis nonparametric test (Walker 
& Lev, 1953) was used. 


Results 


Striking differences were obtained between 
the H group and the other three groups on 
most of the MMPI scales. F ratios significant 
beyond the .01 level were obtained for F, Hy, 
Pd, Mf, Pa, Pt, and Sc; and beyond the .05 
level for Hs, D, and Si. Mean T scores for 
each scale and each group are presented in 
Fig. 1. : 

The sharp difference in elevation of most 
scores in the H group, as compared to the 
other three groups, strongly suggests that the 
symptom of homosexuality is but a part of 
a more general personality disturbance. The 
close similarity of the A group to the two 
control groups is convincing evidence of the 


88 
xxxeyx W- GRovP 
84 oooooo A-CROVP 
e—e—s O- GROUP 
80 @——_ n-GRovP 


T-SCORE 


I F K Hs D 


Fig. 1. Mean T scores 
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important distinction between the exclusively 
homosexual individual and the person who 
has only occasionally engaged in homosexual 
activities. Of particular interest is the com- 
parison of the four groups on the masculinity— 
femininity scale. Only two men in the H 
group had T scores on the Mf scale less than 
70 (and one of these obtained a score of 67)! 
By contrast, the highest T score obtained in 
the A group was 70—and there were only 4 
such individuals out of 20. None of the nor- 
mal controls (N) and only one of the delin- 
quent controls (D) had an Mj score as high 
as 70. 

Similar results were obtained for Heine- 
man’s Forced-Choice Anxiety Scale. Adapted 
from Taylor’s Manifest Anxiety Scale (Tay- 
lor, 1953) and the MMPI, Heineman’s scale 
was designed to eliminate social desirability 
as a response set in anxiety questionnaires. 
All of the statistical significance in the F 
ratio between groups can be attributed to the 
unusually high mean score for the H group. 
The H, A, D, and N groups obtained mean 
scores of 46.6, 35.4, 30.5, and 31.5 respec- 
tively. Again, it is apparent that the exclu- 
sively homosexual male manifests a general 


Pd Mi Pa Pt Se Ma Si 


of the MMPI for four groups of airmen (predominantly homosexual, accessory ho- 
mosexual, disciplinary, and normal). = 80; each group = 20, 
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personality disturbance while the occasional 
homosexual does not. 

Assuming that sexual associations to the 
critical words of the Homosexual Homonyms 
are anxiety arousing, delayed response times 
may reflect this anxiety. The mean trans- 
formed scores for response times yield identi- 
cal rankings among the groups with the scores 
obtained from Heineman’s Forced-Choice Anx- 
iety Scale, and the magnitudes of the differ- 
ences are almost identical, suggesting that the 
similarity is due to the effects of anxiety. 

Food aversions have been positively corre- 
lated with anxiety and neuroticism (Smith, 
Powell, & Ross, 1955; Wallen, 1945). Com- 
bining the scores of the two homosexual 
groups and the two heterosexual groups on 
the Food Preference and Aversion Scale pro- 
vided adequate frequencies for contingency 
tables. The total list and 5 of the 29 foods 
yielded aversion scores for the combined ho- 
mosexual groups significantly different from 


the combined heterosexual groups beyond the 
-05 level. 


Based upon the assumption that sexual 
confusion may exist i 


n a person of one sex 
who has the interests, 


tions of the other, th 


popular in- 
feminine role con- 


Perhaps the most important implication to 
emerge from this study concerns the severity 
of psychopathology which is likely to accom- 
pany the markedly homosexual individual. 
For this person, the findings support the con- 
tention that he is likely to be suffering from 


an emotional disorder which is relatively per- 
vasive and severe. By comparison, the acces- 
sory homosexual does not show evidence of 
extensive personality disturbance of the se- 
verity found to characterize the markedly 
homosexual individual. 

Another implication is the need for devel- 
oping assessment instruments which will in- 
crease the accuracy and the objectivity of the 
evaluation of a homosexual condition with 
respect to both its presence and its severity. 
Such information would have diagnostic, 
prognostic, and treatment values. Some of 
the tests used in this study show develop- 
mental potentialities for such purposes. 

Further research similar in kind and scope 
to the present study should be undertaken for 
the female sex. The same kinds of data for 
homosexual women could be integrated with 
data from this study to provide more com- 
plete information on homosexuality in at 
eral. Complementary findings from both i 
ies might contribute to a better ner san 
ing of psychosexual development, eet 
personality functioning, and the relationship 
of homosexuality to psychopathology. 


Summary 


A battery of 10 psychological tests, inal 
ing 2 original tests, was administered to et 
enlisted airmen divided equally into pa 
groups. Ss were assigned to groups por 
to whether they were heterosexual, parey 
homosexual, or markedly homosexual in ea 
psychosexual orientations. Only the marke * 
homosexual group gave test records that we! 
strikingly different from the control grovur 
suggesting that markedly homosexual in 4 
viduals are likely to be suffering from on 
emotional disorder which is relatively pet 
vasive, severe, and disqualifying for military 
service. The partly homosexual group (eee 
Posed of individuals who were predominantly 
heterosexual but with varying degrees of 1e 
mosexual experience) gave test records tha 
closely approximated the results of the tw? 
control groups. Implications for improving 
existing methods of assessment and disposi- 
tion of homosexual cases are discussed, 20 
extension of the present study to include fe- 


male members of the Air Force is recom- 
mended, 
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THE EFFICACY OF A SHORT FORM OF THE MMPI 
TO EVALUATE DEPRESSION AND MORALE LOSS 


ARTHUR CANTER 


Johns Hopkins Hospital and School of Medicine, Baltimore, 


Several studies have consistently reported 
high frequencies in the MMPI of D and Pt 
(the so-called 27 or 72 codes) scores among 


_ Patients suffering severe emotional disturb- 
ances, in particular 


depression (Welsh & 
Dahlstrom, 1956). One study (Simon & 
Hales, 1949) found a high incidence of this 
pattern in hospitalized suicidal veterans. A 
different study demonstrated a significant re- 
lationship between 
acceptance (Zuckerman & Monashkin, 1957). 
It would seem th 

would be of specifi 
or other situation 


tacts might prec 
MMPI. Arguments against using single scales 
of the MMPI as t i 


stic categories (Welsh & 
is the possi- 


Context of being 
host of other 


e scale to another 
the kind of behavior one 


quently found in the MMPI profiles of pa- 
tients having sym 


loss of self-confi 


Maryland 


diagnoses. Thus if one were interested in the 
identification of individuals who are seriously 
depressed without regard to teir paatos 
status, these two keys used alone in a sing 
scale might be expected to work well. i 
The purpose of this study was to test m 
discrimination power of a brief form of a 
MMPI on population samples differing phen 
grees of depression using only those items 
necessary to score D and Pt. m 
The experimental scale, to be referred r i 
ML (for Morale Loss), consisted of the D 
items of the MMPI that are scored for re 
Pt, and K. The K items were Sa Pal 
they enter into a corrected T score for Pe id 
also its role as a “suppressor variable she 
Possible clinical as well as research value. P 
ML scale was made up in a booklet sane 
allowing it to be used either with an or 
sheet or by having the subject (S) record . es 
answers directly in the booklet. The average 
time for individual administration was be 
tween 15 and 20 minutes. 1 
Three groups of Ss were tested by the M g 
scale. Group A consisted of 18 patients (nin 
female) who had been hospitalized pg 
of a neuropsychiatric disorder and who e 
recently (within a period of 15 days B 
testing) made an unsuccessful active coupe 
attempt. In every instance some action wa 
taken which was judged to represent a pei 
fide attempt at self-injury or destruction an 
not mere verbal threats. The Group A pa 
tients were all given a psychiatric diagnosis 
during the hospitalization following the sui- 
cide attempt: five patients were considered t° 


*These included: 


face and neck, wrist 
Poisons, 
attempt 


gunshot wounds, slashing of 

slashing, ingestion of know! 
Overdose of sedatives, inhalation of gas, an 
at hanging self, 
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Table 1 
Mean Scores? on the ML Form of the MMPI 
of Patient and Normal Adult Groups 


Group D Pt Pi+K K 
A patients Mean 80.94 72.17 79.89 11.11 
(N= 18) SD 14.6 15.44 15.9 34 
B patients Mean 68.33 56.56 65.67 15.94 
(N =18) SD 14.9 10.8 117 5.2 
C normals Mean 48.72 40.83 FH 17.94 
(V=18) SD 7.9 dot 52 6.9 


a Entries for K are raw scores. All others are T scores. 


be schizophrenic; three, manic-depressive de- 
pressed; and the remainder severe psycho- 
neurotic reactions with emphasis placed on 
the depressive symptoms. . 
Group B contained 18 patients (nine fe- 
male) who were either hospitalized for a psy- 
chiatric disorder or seen in the outpatient de- 
partment of Henry Phipps Psychiatric Clinic 
for diagnostic study and treatment. None had 
a history of overt suicide attempts nor were 
they considered suicidal at the time of study. 
The diagnoses included: four schizophrenic 
reactions, four depressive reactions (not psy- 
chotic), and the remainder psychoneurotic 
reactions with anxiety and some depressive 
features. Thus the A and B patients differed 
essentially in the fact that the former had at- 
tempted suicide and were considered more 
depressed. 
pi C eonaisted of 18 presumably nor- 
mal adults (nine female) who had no history 
of mental illness requiring psychiatric treat- 
ment and were comparable to the A and B 
patients with respect to age and socioeco- 
nomic status. The C normals were obtained 
from groups of persons working in diverse oc- 
cupations in and around a hospital setting, 
such as machinists, plumbers, clerks, labora- 
tory technicians, etc. The mean age for Group 
A was 32.2; for Group B, 31.4; for Group C, 
33.3. All ML scales were administered in in- 
dividual sessions with the Ss. The directions 
for the test were essentially the same as those 
used in the standard MMPI booklet. i 
The D, Pt, and K raw scores were ser 
mined from the appropriate MMPI Er mn 
the T score equivalents for the two : ae 
scales were determined from the MI 
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tables. Both the K-corrected and uncorrected 
T scores for Pt were recorded. The T scores 
of the clinical scales were evaluated in prefer- 
ence to the raw scores since half the Ss were 
female, and the T score norms for D and Pé 
have different values for the sexes. 


Results and Discussion 


The mean scores for the three groups on 
the D, Pf, and K scales may be seen in 
Table 1. The significance of differences among 
the mean scores of the two patient groups and 
the normal adults was evaluated by simple 
analysis of variance. Differences between ad- 
jacent group means were then analyzed by 
the ¢ test, using the within groups variances 
obtained from the analysis of variance to de- 
termine the best estimate of the standard 
error of the difference. Table 2 provides a 
summary of the results of the analysis. It 
may be seen that both patient groups were 
significantly higher on D and either Pt vari- 
able, with Group A scoring significantly 
higher than Group B, considering the patient 
groups. Only in the case of K was there no 
significant difference found between Groups 
B and C. 

It is apparent that the short MMPI scale 
as used in this study does permit discrimina- 
tion between the two types of patient groups 
which differed originally in the degree of de- 
pression, as well as between the patient and 
normal adult groups. The results also tend 
to support the findings of Simon and Hales 
(1949) on their suicidal group in that a 
higher proportion of the A patients scored 


Table 2 


Summary of Tests of Differences Among Patient and 
Normal Groups on D, Pt, and Pi + K T Scores 


Values of 
F for Dif- 
ferences Values of t 
among all 
Three Group Group 
Variable Groups A vs. B B vs. C 
D 26.88** 2.85* 4.43* 
Pi 33.55" 4.00%" 4,12% 
PI+K 3 +. 3.89** 4.98** 
K (raw scores) 11.78* 3.35* 1.39 
*p <01. 
++ p <.001. 
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Table 3 


Distribution of the ML Key Scores of Patient and 
Normal Groups 


Group C 


Group A Group B 
Range of Scores (N = 18) (N= 18) (N = 18) 

29-30 
27-28 LLI 

25-26 /// 

23-24 // 

21-22 M/// / 

19-20 i 

17-18 // 

15-16 / 

13-14 // ZH 

11-12 

210 111 / 

es / / 

=o /// / 

$ Ml 
ae 711111 

Q 11 


above T score 70 on both D and Pé than did 
the B patients, Seventy-two per cent of the A 
patients scored this, high on both scales as 


value of K drops as the degree of disturbance 
increases. The Pt scale differentiated the 
groups as well as Pt + K, although the latter 
yielded higher 7 scores for each of the three 
gi . 
The fact that Pt shows high positive rela- 
tionships with D consistently and appears i 
be sensitive to emotional disturbances eve 
where no dominant obsessiye:compulsive W 
phobic syndromes are apparent, | a 
point that P? measures emotional distur a 
per se better than “psychasthenia.” Supp on 
for this view was provided when the cate A 
the present study were analyzed for the re 
differentiating of the ML test sia 3 
would help separate Group A patients a 30 
patients and both from normals. A tota a 
items seemed to approximate an ersten ol 
terion whereby a response in one mee! of. 
an item would be given by two yor A 
Group A to one third of Group B me 
than this by Group C.? Table 3 sum i 
2 These items with their MMPI booklet numbe' 


: ST; 
S: 
and the direction scored for ML are as follow: 6 F, 


107, 
8F, 32T, 39T, 41T, 67T, 76T, 86T, 84T, et joi, 
14ST, 153F, 1827, 1897, 236T, 238T, 2907 SGT, 
337T, 340, 3437, 352T, 357T, 358T, 3597 
3627, 366T, 3837. 


Table 4 
Distributions and Means of ML Key Scores of Brucellosis and CMI Groups 
I II Il IV ss 
Chronic Nonchronic High-Positive Ta ca e 
Brucellosis Brucellosis CMI C) 
Range of Group Group Group Gronn, 
ML Scores WN = 9) (W = 8) (N = 15) (N = 26) 
a /// 
18-19 
16-17 
14-15 // / 
12-13 / / 
10-11 // M / 
San // //// i 
a / / // / 
as / // //1/ | 
2-3 // ii M1111 1/1 
aie / /// IMM MA 
Mean ML 
Score 74 2.7 11.3 3.5 a 
I vs. II III vs. IV 
t= 3.41 t= 5,85 i 
(p < 01) 


(b < .01) | 
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the distributions of the scores obtained by 
such a key from the three groups. When the 
items were identified as to which of the three 
MMPI keys were involved, it was found that 
22 of the 30 ‘items are scored for Pt, 7 are 
from D (with 2 scored oppositely), and 1 
from K (also scored oppositely). 

The likely meaning applied to Pé or the 
key derived as above was explored further by 
the results obtained from other studies car- 
ried out by the writer. In one study (Im- 
boden, Canter, Cluff, & Trever, 1959) both 
D and Pt were found to differentiate between 
two groups of adults who had suffered acute 
brucellosis infections. One group had recov- 
ered completely from the illness while the 
other was chronically ill for at least several 
years and up to the time of the study. While 
the chronic group scored higher on D and Pż, 
the psychiatric studies did not suggest promi- 
nence of obsessive-compulsive or phobic symp- 
toms, However, the psychiatric studies and 
a self-concept scale indicated they were de- 
pressed and suffering low self-esteem. The 
records of both groups were scored by the 
tentative key, labeled ML, made up of the 
30 items referred to above. 

In a separate study still in progress, Ss who 
had taken the Cornell Medical Health Ques- 
tionnaire (Brodman, Erdman, & Wolff, 1956) 
were divided into two groups and compared 
on the ML key. One group had extremely 
high CMI scores indicative of positive medi- 
cal illness histories, while the other group 
represented a sample having extremely low 
CMI scores. Table 4 gives the results of com- 
paring the various groups in these two studies 
on the ML key. As can be seen, the ML vari- 
able separates the groups as might be ex- 
pected if one assumes that chronically ill or 
frequently ill people are more likely to be low 
in self-esteem or morale than healthier people. 

From such findings, the high proportion of 
Pt items in the ML key, and other studies in 
the literature with respect to Pt (Welsh & 
Dahlstrom, 1956), it would appear that the 


Pt scale might be profitably reanalyzed with 
respect to its self-esteem components and 
those more uniquely associated with psy- 
chasthenia. The ML key is offered as only 
one tentative approach to this problem. 


Summary 


A short form of the MMPI made up of the 
D, Pt, and K items only was administered to 
two patient groups, which differed in their 
degree of depression and emotional disturb- 
ance as judged by their histories of suicidal 
attempts, and to a group of normal adults. 
The scores of the three groups on the three 
keys revealed that they could be differentiated 
from each other according to the degree of 
disturbance. Thus the effectiveness these keys 
have as ordinarily used in the entire MMPI 
scale was maintained when isolated in this 
fashion. The Pé scale was given further atten- 
tion when a key made up of the most dis- 
criminating items of the short test was found 
to be heavily weighted with Pt. The results 
of follow-up studies and the long observed 
relationship between Pt and D suggest that 
Pt should be redesignated as a morale loss or 
self-esteem scale. An alternate key to Pt was 
offered by the study toward this end. 
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THE EXPRESSION OF HOSTILITY AND GUILT IN 
MELANCHOLIC AND PARANOID WOMEN 


T. M. CAINE? 
Runwell Hospital, Wickford, England 


That feelings of hostility and guilt, with the 
concomitant expression of extrapunitive and 
intropunitive attitudes, are important factors 
in diagnosis and psychopathology has been 
recognised for many years. The Freudian 
doctrine of superego formation, for example, 
involves aggressive and guilt components. In 
discussing symptomatology in general psychi- 
atry, Henderson and Gillespie (1950) con- 
sider that certain delusions (the idea that, 
one’s thoughts are being read, and self-accu- 
satory ideas of delusional intensity) may rest 
on an affective basis of guilt. In clinical psy- 
chology, Foulds (1958) has adapted a tech- 
nique, devised by Rosenzweig, Clarke, Gar- 
field, and Lehndorff (1946), which elicits the 
expression of intropunitive and extrapunitive 
attitudes for diagnostic purposes. Fundamen- 
tally, Foulds’ adaptation measures the de- 
gree to which one is more critical or less criti- 
cal of oneself than of others. Siegel (1956) 
has developed a “hostility” scale from the 
Minnesota Multiphasic Personality Inventory 
(MMPI) which he has related to “authori- 
tarianism.” He found that groups scoring 
highly on his dimension of authoritarianism 
had greater overt hostility as measured by his 
hostility scale. In a further paper (Siegel, 
Spelka, & Miller, 1957) presented at the 
American Psychological Association Conven- 
tion in September 1957, evidence was pro- 
vided of the feasibility of measuring varying 
aspects of hostility such as “extrapunitive,” 
“intropunitive,” and “projected.” Alexander 
(1949) has distinguished between provocative 
hostile behaviour, projected hostility, guilt 

1The author would like to express his indebted- 
ness to G. A. Foulds for his encouragement and ad- 
vice during the course of this investigation, and his 
thanks to R. Strém-Olsen for permission to publish 
this report. 
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. . TAE . ms 
feelings, and inferiority feelings. It seem 


clear then, that hostility and guilt are global 
terms and that different authors have sepa- 
rated out certain aspects for study. i l 
In line with this tradition, the present stu e 
is concerned with the measurement of i A 
aspects of hostility and of self-criticism /an 
at various levels of expression. Since inte i, 
was centered on measurement, persons kaoi 
to be extreme to a pathological degree Sai, | 
expression of these feelings, namen M 
cholics (guilty) and paranoid states (h 
were used as subjects (Ss). 


Method 


“3 pntro- 
Measurement of Extrapunitive and | 
punitive Attitudes 


The purpose, in testing, has been ata 
from an obvious, overt statement of a 
to an indirect, covert expression. af 

At the superficial level, two hasty aie L, 
were made up from the MMPI, one “a 
ing of 21 acting out hostile statemen oil 
as: “at times I have a strong urge to do al 
thing harmful or shocking,” “I Ba comei 
make other people afraid of me, an 5 that 
times do for the fun of it,” and norse aM 
don’t pull should be beaten or = “A 
second scale, implying criticism of ss was 
accusing others of hostility to onese i 
devised as a measure of projected os 
This scale consisted of 26 items such nea 
think most people would lie to get eee ” 
and “I believe I am being plotted aga! nis 

As a measure of self-criticism/ guilt ad’ | 
level, items implying criticism of self E pl 
mission of guilt were drawn from the = «f 
This scale included such statements “trom 
have one or more habits which are 50 5%, «f, 
that it is no use fighting against them, 


work 
de: 


scales f 


/ 
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have several times given up doing things be- 
cause I thought too little of my ability,” and 
“T believe my sins are unpardonable.” The 
items of this scale were allocated by the au- 
thor and agreed upon by the senior psycholo- 
gist at this department. 

At a less direct level, a development of a 
method of measurement described by Watson, 
Pritzker, and Madison (1955) was employed. 
This method consisted of 60 sets of scrambled 
words which could be assembled in either of 
two ways by leaving out one word in each set. 
One completion led to a hostile sentence and 
the other to a neutral. These authors found 
that a neurotic group made significantly more 
hostile completions than did a nonneurotic 
control group. In the test constructed for the 
present study, 48 sets of four scrambled words 
were compiled. Paralleling the MMPI scales, 
in 16 of the sets one completion led to an 
acting out hostile response and the other to 
a neutral; in 16, one completion led to a pro- 
jected hostile response (as defined above for 
the MMPI scale) and the other to a neutral, 
and in 16, one completion led to a self-criti- 
cal/guilt sentence and the other to a neutral 


one. Examples: 


acting out hostility 
HIT YOU 
TLL TAKE 


projected hostility 
THEY'D ME 
CHEAT FIND 


self-critical/guilt 

COURAGE NO 

BOOKS I'VE 
The order of the words was randomized in 
making up the sets. The number of possible 
combinations was so great that the effect of 
a methodical approach by the S, such as al- 
ways reading along the top line from left to 

right, was reduced to a minimum. S 

In administering this Sentence Building 
test, the card with the four scrambled words 
on it was laid in front of the S, who was in- 
structed to give the first sentence he could 
make up as fast as he could, by leaving one 
word out. The emphasis was on reaction time, 
prominently displayed and 
n. The number of acting 
, projected hostility, 


with a stop watch 
each time noted down. 
out hostility constructions. 


and guilt construction given first was noted 
for each S. This may be regarded as a projec- 
tive type of test in which perceptual factors 
such as the “selection of personally relevant 
cues,” to use Bruner’s (1951) phrase, may 
be operative. The word or phrase frequency 
effect, the problem first raised by Solomon 
and Howes (1951), should be the same for 
both groups. 

At a more indirect level still, eight The- 
matic Apperception Test cards (TAT), 
namely 4, 3, 13MF, and 18GF of the stand- 
ard set, and four specially prepared pictures 
were used. These cards were selected be- 
cause they were thought to suggest aggressive 
themes. Administration was that described by 
Foulds (1953), involving the four questions, 
“what is the situation,” “what led up to it,” 
“how are they thinking and feeling,” and 
“how does it finish up?” The protocols were 
then scored for hostility and guilt content by 
two psychologists independently and without 
knowledge of the Ss’ identity. 

A detailed scoring system was devised for 
this purpose. For the hostility scoring, the 
mention of murder or death through accident 
or war was given a score of 3. Stories of physi- 
cal assault, injuries dangerous to life, or rape 
were given scores of 2. Minor injuries, verbal 
assault, or the mention of interpersonal fric- 
tions were given scores of 1. For the guilt 
scoring, the mention of suicide scored 3, re- 
morse with reparation 2, and remorse with- 
out reparation 1. If several of these themes 
were mentioned in one picture the highest 
score was taken. These highest scores were 
then summed for the series of eight pictures 
to get a total score. Since a correlation of .98 
for ratings of hostility and .95 for guilt were 
obtained, the ratings of the senior psycholo- 
gist were accepted as working scores. 

These unusually high correlations may be 
explained by the avoidance of interpretation. 
The raters had, in the main, merely to check 
whether certain statements had or had not 
been made. Some difficulty arose, however, in 
deciding whether a particular word or expres- 
sion could be treated as synonymous with re- 
morse. 

At the most indirect and ambiguous level, a 
tapping psychomotor test was administered. In 
a study by Foulds, McClelland, McClelland, 


* 
20 > 


and Creasy,? the Tapping test was found to 
differentiate significantly between paranoid 
and nonparanoid schizophrenic groups. In a 
recent investigation by Foulds and Caine 
(1958), female dysthymic patients were suc- 
cessfully distinguished from hysterics on this 


2G. A. Foulds, W. J. McClelland, Marilyn Mc- 
Clelland, and Monica Creasy. Personal communica- 
tion. 1958. 
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test. Since, in these studies, hysterics ang 
paranoids scored relatively high on tapping 
and on various measures of hostility, the 
Tapping test was introduced here to investi: 
gate further its possible relationship to hos- 
tility. In the administration here employed. 
the S was merely instructed to tap on a stand- 
ard sized piece of plain white paper, with a 
pencil, as fast as he could for 10 seconds: 


Mean Score Comparisons of Melancholics and Paranoid States on Age, Vocabulary, MMPI Scales, 


Sentence Building Test, and TAT Hostility and Guilt Ratings 


Table 1 
Item N Mean SD t b 
Age 
Melancholics 17 51.12 6.48 119 a 
Paranoids 14 78 8.92 
Mill Hill Vocabulary 
Melancholics 17 26.53 5.81 
Paranoids 14 25.86 4.87 333 >A 
Sentence Building test 
Acting out hostility 
Melancholics 17 7.12 1.62 4 l 
Paranoids 14 7.00 03 .088 >A 
Projected hostility 
Melancholics 17 5.35 3.25 2.323 905 
Paranoids 14 8.21 3.36 
Guilt 
Melancholics 17 10.00 2.00 
Paranoids 14 6.79 2.65 3.733 <.001 
MMPI hostility 
Acting out hostility 
Melancholics 17 4.88 2.47 1 
Paranoids 14 5.86 3.44 891 cas 
Projected hostility 
Melancholics 17 4,47 3.53 o1 
Paranoids 14 10.07 5.83 3.182 <. 
MMPI self-criticism and guilt 
Melancholics 17 15.41 4.06 
Paranoids 14 7.43 4.87 4.807 <.001 
TAT 
Hostility 
Melancholics 12 3.50 2.90 5 
Paranoids 14 6.36 3.52 2.150 <0 
Guilt 
Melancholics 12 1.75 1.64 
Paranoids 14 1.57 1.88 .079 > 
MMPI depression 
Melancholics 17 84.65 12.74 1 1 
Paranoids w = 7.18 8.392 x00 


Melancholic and Paranoid Women vil 


This was done three times. The spread of the 
resulting pencil dots was then determined by 
counting the number of half inch template 
squares entered for each trial, and the aver- 
age for the three taken. 

Since the present battery of tests was mainly 
of a verbal nature a minimum of Grade IV 
(i.e., above the 10th centile) on the Mill Hill 
Vocabulary Scale (Raven, 1954) was set. As 
a diagnostic check the depression scale of the 
MMPI was included. 


Subjects 

Three chronic hospitalized paranoids and 
all paranoid and melancholic female patients 
fulfilling the verbal level criterion who en- 
tered Runwell Mental Hospital during the 
period September 1957 to February 1958 
were considered for testing. Of those entering 
the hospital, three paranoids could not be in- 
cluded since one refused to cooperate, one 
was deaf, and one had been leucotomized. 
Two of the melancholics had to be rejected 
as untestable. Care was taken not to include 
any “mixed” cases of paranoids with depres- 
sive features or melancholics with paranoid 
features, since these are to form the basis of 
a separate investigation. In all, 14 paranoids 
and 17 melancholics were tested, but only 12 
of the latter took the TAT. 


Results 


“It was considered that the distributions of 
scores on age and tests, with the exception 
of tapping, were sufficiently normal to at 
the use of a ¢ test comparison and the results 
are set out in Table 1. For the Tapping test 
the nonparametric Mann-Whitney U Test 
(for Na between 9 and 20) was used. 

It will be seen from Table 1 that there 
were no significant ae oe the 
groups in age or vocabulary levet. 

j Of the racially devised MMPI scales, the 
acting out hostility scale failed to differentiate 
between the groups. On the other hand, the 
paranoid group had a Dant pein 

jection of hostility scale and sig- 
neat Heit scores on the self-criticism / 
Supporting this finding, at the 
] of response of the Sentence 
ing out hostility sen- 


guilt scale. 
more covert leve 
Building test, the act f 
tences failed to differentiate, 


but both the _ 


projected hostility and guilt sets have done 
so in the same direction as the MMPI scales. 
At the most covert level, the paranoid TATs 
were given higher hostility ratings than were 
the melancholics. At this level the guilt meas- 
ure failed to differentiate. The Tapping test 
was, surprisingly, the most successful diag- 
nostic test of the battery. The median score 
of the melancholics on this test was 2, that 
of the paranoids was 21. The score rankings 
gave a U value of 22, yielding a p of <.002. 


Discussion 


This study supports the contentions of 
Siegel that the varying directions of hostility 
(if guilt be regarded as introjected hostility) 
can be objectively measured. Siegel (1957) 
suggests that individuals may vary more in 
the manner and direction in which they ex- 
press hostility than in the amount of hostility 
they possess, and it may well be that the ap- 
proach used in the present study could be 
used to clarify this point. The value of dis- 
tinguishing various aspects of hostility and 
guilt has been demonstrated and further logi- 
cal divisions are indicated if the study were 
to be extended to other groups. Two further 
divisions would be to separate a general “criti- 
cism of others” from complaints of the direct 
receipt of hostile acts on the part of others, 
and a separation of self-criticism (Alexander’s 
inferiority feelings?) from admission of guilt 
and wickedness.® 

The use of different levels of expression 
might well be a measure of the amount of 
hostility possessed. Normal, neurotic, and psy- 
chopathic groups would be expected to differ 
from psychotics and between themselves, not 
only in the aspects of hostility and guilt ex- 
pressed, but also in the level of expression. 
One suspects that the consistency at all levels 
of response might not hold for neurotics and 
normals, and that this may well indicate the 
profundity and pervasiveness of the feelings 
for the individual. Before this can be at- 
tempted, however, improvements in the pres- 
ent battery of tests would be required, par- 
ticularly at the covert levels. It was not 
possible in the TAT thus administered, for 

3 This has now been done for a number of neu- 


rotic and psychotic groups. A paper is being pre- 
ared for publication, 
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example, to isolate different aspects of hos- 
tility satisfactorily, and since an expression 
of guilt tended to accompany acts of hostility 
in the stories, this measure of guilt failed to 
differentiate between the groups. This may 
well be more culturally than emotionally de- 
termined. It is of considerable interest that 
although the paranoids avoided acting out 
hostility on the MMPI scales and Sentence 
Building test to the same degree as the melan- 
cholics, they gave free rein to their feelings 
on the TAT, in which stories of murder, rape, 
violence, and quarrelling were marked. 

With the Sentence Building test there was 
considerable loss of distance. Patients often 
related their response directly to themselves, 
denying or confirming the truth of a state- 
ment made out of the words. That many of 
the patients were aware of the possibility of 
alternative responses was obvious from the 
remarks to this effect, from such comments 
as, “Oh, I couldn’t say that,” or exclamations 
followed by a neutral sentence. It can be ar- 
gued that the paranoid patient believes that 
his only way of obtaining release from the 
hospital lies in convincing the staff of the 
justification of his position and attitude. An 
admission of inadequacy, failure, or guilt on 
his part may seem damaging to his case. 
Hence his reluctance to voice sentences ex- 
pressing these feelings, which seem to him, 
through loss of distance, to have personal 
relevance. The melancholics, on the other 
hand, are anxious to impress one with their 
wickedness, and, indeed, are often worried 
that the staff may disbelieve their protesta- 
tions of unworthiness, failure, and guilt. 
Hence their choice of guilt sentences. A ta- 
chistoscopic presentation, to facilitate a closer 
examination of perception and report with re- 
gard to this test, is indicated. 

Little can be said of the Tapping test be- 
yond the observation that the group express- 
ing least hostility had lower tapping scores 


as measured by the number of template 
squares entered, 


An explanation of this finding might be 
offered, perhaps, in terms of the psychomotor 
slowing of depressed patients, rather than of 
hostility. An elaboration of the scoring of this 
test might be helpful in clarifying this point, 
since the clinical impression in administering 


Caine 


the test was that force rather than speed had 
more bearing on the results. Some means of 
recording the number of taps and the pres- 
sure of tapping would be a useful addition. 


Summary 


* . | 
Seventeen melancholic patients and 14 


paranoid states were given a battery of Poi 
chological tests purporting to measure the 
expression of some aspects of hostility an” 
guilt at various levels of response, from p 
overt, direct expression of attitude to a coverh 
indirect one. Consistent differences were ace | 
between the groups at all levels. A aba 
of problems worthy of further investigatio | 
have been discussed. 
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A sociological conception of a continuum 
of socialization, running from persons of ex- 
emplary probity and rectitude at one end, 
through persons of more typical and less 
beneficent coadunations of positive and nega- 
tive propensities, to persons of frankly errant 
and wayward impulse can easily be deline- 
ated. The precise location of a particular in- 
dividual or group along this continuum may 
pose something of a problem, but the existen- 
tial nature of the continuum itself and the 
fact that persons can and do make ready and 
reliable judgments in reference to it cannot 
be gainsaid. ; 

An adequate theory of socialization must 
pay attention to this continuum and be in 
consonance with it. This admonition also ap- 
plies to any procedures of measurement em- 
ployed in behalf of the theory. That is, a scale 
of measurement for “socialization should po- 
sition individuals in the “asocial,” “normal, 
and “supernormal” zones of the continuum in 
general accordance with the verdict which the 
sociocultural environment has handed down 
concerning them. , ; 

If the scale of measurement 1s psychologi- 
appealing to factors of individual re- 
d calibrated on such reactions, then 
psychosociological one, as 
it were, requiring the scaling of the psycho- 
logical dimension in such a way as to covary 
with the sociological one. Discrepancies are 
of course to be expected in individual in- 
stances between the sociological baseline and 
the psychological measurement, if for no other 
reason than that the culture will occasionally 


cal, 
sponse and cal 
the problem is a 


Barli sork on this program of research was 
Me eo a series of grants from the National 
Institute of Mental Health, U. S. Public Health 
Service. Since 1953 the program has been mapparna 
by two basic research grants awarded by the For 


Foundation, 


make mistakes in putting some men in prisons 
and others in positions of trust and responsi- 
bility. One of the benefits to be derived from 
a psychological method for assessing the so- 
cialization continuum is that errors of this 
kind can be identified. 

For a number of years the author has been 
concerned with a theoretical formulation of 
the socialization problem which attempts to 
do justice to this notion of the continuous 
nature of the function (Gough: 1948, 1954; 
Gough & Peterson, 1952). Perhaps a brief 
summary of the theoretical position would be 
in order here. Socialized behavior is behavior 
based on a proper viewing of the self as a so- 
cial object, where the terms self and social ob- 
ject are used in the sense defined by Mead 
(1934). That part of the personality which 
links an individual to the social community 
is the “self.” The sense of self, or view of 
self, is a product of social interaction and of 
the capacity of the individual to view him- 
self as an object; that is, from the standpoint 
of the other. The self, therefore, has its roots 
in role taking, in a developing objectification 
of the critiques and evaluations of oneself. In 
early life these views of self (called “me’s” 
by Mead) may be discrete and unintegrated, 
but in time a certain communality and con- 
sistency in the patterns permits the evolve- 
ment of a conception of the “generalized 
other” which represents social reality as wit- 
nessed by the self. 

Such a process of introjection of societal 
standards is always coupled with a degree of 
uncertainty or independence in the expression 
of the self. This unpredictable element in the 
self is called the “I” by Mead. On the other 
hand, social adaptability, cooperation, and 
socialization itself are functions of the “me’s,” 
that is, of the role taking experiences and the 
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role taking capacities. Thus the degree to 
which the person will be able to govern in- 
ternally his thought and behavior in accord- 
ance with the imperatives of his culture will 
be a consequence of the depth and validity 
of the role taking experiences that he has 
enacted. 

In an earlier paper (Gough, 1948) the au- 
thor tried to show how this theoretical posi- 
tion could be brought to bear on the problem 
of psychopathy, and how the various symp- 
tomatic expressions of psychopathy could be 
deduced from the fundamental propositions of 
role theory. Later (Gough & Peterson, 1952) 
an attempt was made to develop a measuring 
instrument for asocial behavior using ques- 
tionnaire materials derived on the basis of 
the role theory and focusing on the domain 
of social interaction and role taking experi- 
ences. Diagnostic items pertaining to the self 
view and role psychology were assembled and 
then administered to experimental samples, 
male and female. These included high school 
students, nondelinquent disciplinary problems 
in high school, and institutionalized delin- 
quents. Statistical analysis permitted the 
identification of 64 items (from some 200 
evaluated) having significant correlations with 
the socialization criterion. 

These 64 items were grouped into a scale 
which was then cross-validated on a military 
sample of 1,092 inductees vs. 99 stockade 
Prisoners, and a second military sample in 
which 144 prisoners with two or more offenses 
were compared with 209 first offenders. The 
difference between the mean of the inductees 
and that of the prisoners was 7.26, critical 
ratio 11.52. For the first offenders vs. reci- 
divists the difference between means was 2.45, 
critical ratio 3.31. The signs in each instance 
were in the predicted direction, but even with 
use of the two-tailed test both critical ratios 
give probabilities well under .01. 

However, these two cross-validational cita- 
tions cover only two points on the socializa- 
tion continuum: delinquents vs. nondelin- 
quents and first offenders vs. repeaters. It 
must also be shown that socialized behavior 
can be brought under the Purview of the 
theory and its scale of measurement to the 
same extent as asocial behavior. A scale or 
theory which would apply only to the distinc- 
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tion between delinquents and nondelinquents 
would be a limited one indeed. The goal of 
the theory, on the contrary, is to encompass 
the full range of phenomena implied by the 
continuum and the goal of the scale is to lo- 
cate persons and groups in their proper places 
in any of its zones. y 

To test the validity of the scale in making 
such differentiations, a greater variety of sam- 
ples was needed than reported in the 1952 
paper, especially samples in the “more s0- 
cialized” regions of the continuum. Such sam- 
ples have gradually been obtained over E 
past seven years, in a sufficient number 1 
would now seem to warrant analysis and re- 
porting. 


Shortening of the Scale 


As a first step, the 64-item scale developed 
in the 1952 study was item analyzed agim 
the first offender vs. repeater dichotomy. Such 
an analysis follows logically from aee” 
retical conception of socialization—asocia a 
tion as a continuum, and from the contenti 
that individual items as well as the full iE 
should possess validity at different regions 
this continuum. Ten items were climi 
by the analysis, leaving 54 in the Ponen 
scale.” The following are representative ©" 
those retained (shown with the scored re 
sponse for socialization). 


sider how 
1. Before I do something I try ago! h 
my friends will react to it. (true ae 
2. I often think about how I look and what ! 
pression I am making upon others. (true) ask 
3. I would rather go without something than 
for a favor. (false) ai 
4. I find it easy to “drop” or “break with 


friend. (false) 
Because the purpose of the scale is to posits 
either individuals or groups along the ae 
underlying socialization continuum, it def 
been designated “So” for socialization. A Af 
nition of the psychological implication of A 
scale can be given as follows: “To ie 
the degree of social maturity, integrity, Bis 
rectitude which the individual has attaineC: 


ay the 
2 This 54-item version is the one included pet 
California Psychological Inventory (Gough, 1 an 
The CPI Manual gives a great deal of clinica: 
psychometric information about the scale, 
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Table 1 


Means and Standard Deviations on the Californi: i 
d Devi a Psychological Inv z 
(Socialization) Scale for Male Samples Tndicated vee) 


More Socialized 


Less Socialized 


Sample N M SD Sample N M 
SS f SD 
1. Nominated high school 90 39.44 4.95 4 á wee 
ener ete 5 i e disciplinary 91 31.25 5.40 
2. Medical school 70 39.27 482 2. County jail inmate: m 
applicants ates 177 29.27 6.44 
3. Banking executives 121 39.00 4.59 3. Prison inmates, New 94 z 
i Re = York 28.28 5.80 
4. Regional wholesale flour 85 38.31 4.37 4. Young delinquents 426 28.0 
s E d 07 5.72 
5. City school officials 200 37.58- 4.19 5 Pian 177 7” 
A udite iie a iss California 77 27.76 603 
. Psychiatric maes 132 37.57 437 6. Training school inmates 100 26.53 
m n 5 7 New York á ý 4.89 
7. Business executives 116 3747 4.72 7. Inmates, federal 230 2 
a E reformatory 6.23 6.53 
8. College students 1745 37.41 5.28 
9, Civil Service supervisory 122 37.25 98 
personnel 
10. Electronic technicians 55 36.93 5.66 
11. Correctional officers 620 36.72 547 
12. Semiskilled workers 108 36.62 5.17 
13. High school students 4474 3646 595 
14. Social work graduate 182 36.40 4.62 
students 
15. Military officers 495 36.38 4.74 
16. Machine operators 105 35.99 4.98 
17. Psychology graduate 142 34.60 4.13 
students 
18. Selective service 139 32.83 6.71 
inductees 
Total 9001 36.74 5.61 1295 27.98 6,08 
‘a a! 
P E e a Tel +73. os: 


New Samples 


The present findings are based upon an en- 
tirely new series of 41 research samples, to- 
talling 1,295 male delinquents and 9,001 non- 
delinquents, and for females 784 delinquents 
and 9,776 controls. The samples cover a wide 
spectrum of the socialization continuum, rang- 
ing from nominated “best citizens” through 
various occupational and professional groups, 
through disciplinary samples, to known de- 
linquents and prison inmates. 

Table 1 lists the male samples tested and 
presents summary statistics on the 54-item So 
scale. The samples are ranked by mean score 
on So, with “more socialized” or nondelin- 
quent groups on the left, “less socialized” or 


delinquent on the right. The highest mean 
score among the 25 samples is observed in 
that of the nominated best citizens, and the 
lowest in the sample of federal reformatee 
inmates. In general, the rank ordering of ae 
ples by the So scale seems to accord uite 
well with what would occur if they Be 
ranked sociologically for socialization. It 
should be noted that all seven of the a 
ples carrying some explicit designation of 
asocial behavior score below the mean of the 
lowest scoring of those samples not so desig- 
nated. Comparison of the 9,001 cases in the 
more socialized group of samples with the 
1,295 cases in the less socialized group gives 
a difference between the means of 8.76 criti 
meg ii 
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cal ratio 48.94, P < .001. The biserial corre- 
lation with the So scale for this same di- 
chotomy is +.73. 

Quite clearly, the So scale locates male 
samples along the socialization continuum in 
the way required by the underlying theory. 
The question now is, can this same differen- 
tiation be demonstrated for women? Table 2 
presents summary statistics on the CPI So 
scale for 15 female samples. The progression 
of mean scores for the female samples in 
Table 2 shows the same correspondence to the 
underlying socialization continuum as was 
previously observed for the male samples. 
There is again a perfect separation between 
the nine samples in the more socialized col- 
umn and the seven less socialized samples 
manifestly designated, in one way or another, 
for some defection from the socialization 
norms. For both the male and female sam- 
ples the validity of the specific placement of 
a sample on the socialization continuum by 
the average So score might be disputed, but 
the general correspondence between the psy- 
chometric ordering and the sociological hier- 
archy is remarkable. The Statistical tests are 
also significant for the female samples, with 
a difference between the means of the more 


Table 2 


Means and Standard Deviations on the 
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(Socialization) Scale for the Female Samples Indicated 


vs. less socialized cases of 9.52, critical ratio | 
37.83 (P < .001), and a biserial r of +.78. ] 
If one is willing to accept these results as 
indicating the validity of the So scale as aq 
measure of socialization, some interesting ob- | 
servations can be made about the relative 
standings of the groups tested. For example, 
there is the bemusing finding that psycholo- 
gists (University of California, Berkeley, psy” 
chologists, it might be cautioned) tend t0 
rank rather low on the socialization Con- 
tinuum, although fortunately not quite so 10W 
as to fall into the outrightly troublesome re- 
gion of the dimension. The sample of unmat- 
ried mothers stands midway in socialization 
between high school disciplinary poi 
and county jail inmates. In both male and i 
male rankings incarcerated persons score 4 


the low end of the scale. It should also be | 


noted that female samples in nearly all n 
stances score significantly higher than the! 
male counterparts. 

The E of cases in Tables 1 and 7 
into more socialized and less socialized 
maintained in Table 3, where percent i 
quency distributions are presented. fe 
dication of possible cutting scores and af 
screening efficiencies for the usual delinqu 


California Psychological Inventory So 


Less Socialized 


age fre 


p, 
R ss: = - v g 
Sample N M SD Sample N M 
a i 00 
1. Nominated high school 90 41.51 4.55 1. High school disciplinary s7 3479 T 
“best citizens” problems o 62% 
2. High school students 5295 39.69 5:55 2. Unmarried mothers 213 or 5.86 
3. College students 3452 39.37 5.05 3. County jail inmates 51 29.61 62t 
4. Factory workers 291 38.99 4.76 4. Prison inmates, Indiana 127 28.37 5.68 
5. Psychiatric aides 67 38.70 3.91 5. Prison inmates, Cali- 135 28.36 n : 
fornia 2 7.0 
6. Nurses 142 38.24 4.89 6. Prison inmates, Wis- 76 26.83 £ 
consin "e 5.13 
7. Airline hostesses 60 38.07 4.51 7. Young delinquents, 95 25.83 
California 
8. Social work graduate 320 37.99 4.38 
students 
9. Psychology graduate 59 36.44 3.93 
students 9 
8 
Total 9776 3946 530 Ia 299 © 
annene” 


Note, —M: — M: = 9.52; CR = 37,83; P < 001 i This 
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Table 3 


Proportion of Cases Scoring at or above the Indicated 
Scores of the California Psychological Inventory So 
(Socialization) Scale for Ss Classified as More 
Socialized and Less Socialized 


Males Females 
More Less More Less 
So Scale Socialized Socialized Socialized Socialized 
Score N= 9001 N= 1295 N= 9776 N = 784 
42 2 1 40 4 
41 28 2 47 6 
40 34 3 55 9 
39 41 4 62 11 
38 48 6 68 15 
29 91 47 97 58 
28 94 53 98 64 
27 95 60 98 68 
26 97 65 99 72 
25 98 70 99 1 6 
24 98 75 99 81 
23 99 81 99 85 
22 99 86 99+ 88 


vs. nondelinquent dichotomy can be gained 
from this table. 
However, one might still ask whether any 
ractical validity of the So scale for making 
k y classification in mass testing 


the two-categor, fering 
might not be vitiated by a base rate limita 


tion. That is, if Category A Cio a 
quite rare, the rules of inverse probabi iy 
might make it more valid (accurate) vid e 
long run simply to predict tmar eon 
tested will be non-A. As Cureton ‘ : 57) as 
shown, this problem can be handle! 7 ronet 
setting of cutting scores so as to : o #3 
base rate asymmetries. The be | an 
c for predicting a 
Taulnquent vs. nondelinquent here) ora 
continuous variable (the So scale) is the poi 
of intersection of the smoothed Tar = 
tributions of the two classes on the oe ced 
variable, where the areas of the wo, is 
tions are proportional to the base a P 
The question in our case is what base we 
frequencies to use? As an example, suppose 
it is assumed that the true a o ka 
linquent personalities is 10% n t hc 
tion. Using Cureton’s method for sr — 
the distribution curves for arlene oe 
in Table 3 and checking for the de 


tersections leads to the specification of a cut- 
ting score of 23 (and below) for delinquency 
in males, and of 26 (and below) in females. 
If the true incidence of individuals sufficiently 
asocial to be classed as delinquent personali- 
ties is at least 1 in 10, then diagnostic classi- 
fications of all men with scores of 23 and be- 
low as delinquent types, and as nondelinquent 
types those with scores of 24 and above, will 
be more accurate than the 90% accuracy 
level which would automatically follow from 
classifying everyone tested as a nondelinquent. 

With a more misanthropic view that delin- 
quent personalities in reality constitute as 
much as 20% of the population, the optimum 
So cutting score for males is at 25 or below 
for delinquency. For females, the one in five 
assumption determines a cutting score of 28 
(and below). 

Perhaps two comments should be inter- 
jected before closing this section. The first is 
that the possibility of deriving dichotomies of 
the above type does not in any way alter the 
fundamental theory of the scale and of its 
psychosociology as representing a continuous 
socialization function. The second is that 
specifications for optimum use of the scale 
in other categorization problems could just as 
easily be made, for example, for super-citizens 


vs. all others. 


Work of Others 


A number of other investigators have car- 
ried out studies using the CPI So scale which 
help to illuminate the relationship between 
the measure and the theoretical continuum of 
socialization hypothesized to underlie it. One 
of these studies is that by Clark Vincent of 
unwed mothers.* The So scale data were gath- 
ered for 232 subjects (Ss), tested in several 
public and private maternal care centers. For 
this sample the mean So score was 32.25, SD 
6.58. These figures closely approximate those 
for the smaller sample of 213 reported in 
Table 2 earlier. 

The theoretical rationale for the So scale, 
however, requires a more detailed analysis of 
this sample, an analysis attentive to the gra- 
dations of the continuum. In the present 

3 Permission to use these data was very kindly 


given by Vincent, Univer. of California, in his book, 
Unwed Mothers, to be published. 


e 
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instance socialization gradations can be re- 
flected by the number of pregnancies of each 
S. Classification on this basis gave the follow- 
ing So results: 


Group N AM SD 
1, One child 201 33.60 5.65 
2. Two children 18 24.39 5.98 
3. Three or more children 13 22.23 4.16 


The progression of means here is that speci- 
fied by the theory of the scale, and is sup- 
ported statistically by the significance of the 
F test over the three means. Groups 2 and 3 
can be combined into a sample of 31 Ss hav- 
ing had two or more illegitimate children. 
The biserial correlation for this sample vs. the 
201 females having only one child is +.83. 
This coefficient offers rather striking evidence 
of the power of the So scale to discriminate 
within a certain range of the continuum. 

The mean scores listed above can also be 
reviewed for hypotheses about the proper 
location of each sample on the socialization 
continuum. The Ss having one child score at 
about the same point as high school discipli- 
nary problems; Ss having two or more chil- 
dren fall into a distinctly lower region of the 
dimension, into the range populated by the 
delinquent and criminal samples. 

The next study to be considered is that of 
Donald (1955). He administered the So scalè 
to 230 consecutively admitted federal re- 
formatory inmates. The mean of 27.77 (SD 
6.53) places the total sample in its proper 
Position on the continuum. The question for 
analysis is again that of differentiation within 
the total sample. Five of Donald’s compari- 
sons can be reported. The first is one in which 
socialization theory would not predict a dif- 
ference, a comparison between white and Ne- 
gro inmates. This expectation was confirmed, 
as the two part-samples did not differ signifi- 
cantly. The mean for 56 Negro inmates was 
29.39, for 174 white inmates 27.75; however, 
the ¢ test of the difference was not significant. 

A second of Donald’s comparisons was be- 
tween 135 Dyer Act commitments, M = 25.96, 
SD = 6.14, and the remaining 95 cases, M = 
30.34, SD = 6.19. Dyer Act offenses are in 
general more serious, and this difference is 
therefore in the predicted direction; the ¢ 


ratio of 5.3 is significant well beyond the 01 
level. 

Comparison 3 pitted 111 boys with zero of 
one previous commitment vs. 119 boys with 
two or more. The means and standard devia-] 
tions were 29.72 and 6.27 vs. 25.95 and : 


The difference is in the expected direction; 
and the ¢ ratio of 4.5 is again statistically 
significant. 

The fourth contrast was between boys 
whose first commitment occurred at age 15 of 
before vs. those whose first commitment came 
at age 16 or later. Socialization theory woull 
predict lower So scores for the former group: 
Findings were in accord with these expecta 


= 


tions; for the 96 boys in the first group M el 
24.76, SD = 5.89, and for the 134 in ta 
second M = 29.93, SD = 6.09. The # ratl 
for this difference was 6.4, P < 01. 3 
The last comparison is a novel one, con 
trasting boys committed for “moonshining 


: S at AF r law 
infractions vs. all others. Federal liquor R. 
comm! 


violations lead to federal institution de 
ment, but from a socialization standpo! 4 
seem more akin to troublesome or moderate i 
wayward behavior than to the asocial nati 
of most felonies. Therefore, one would one 
higher So scores for this special sample. Suc’ 
indeed was the case, for the 18 “moonshine, 
had a mean score of 32.44, SD 6.99, and if 
212 remaining inmates had a mean of 27 
SD 6.33. This difference is also statistica 
significant, the ¢ ratio being 2.9. The absolu 
level of the So mean, 32.44, should also, 
noted. It would rank the “moonshiners J 3 
above “high school disciplinary problems 
the socialization hierarchy (see Table 1). ; 
The third study to be mentioned is he 
the Pilot Intensive Counseling Organizat 
Project (1956) being conducted at the De 
Vocational Institution in California under p 
auspices of the California Youth Author A 
and the California State Department of ©° | 
rections In the PICO project, youth as 
thority wards are assigned on a random ba 
at intake to experimental and control samP TA 
the experimental cases participating jn an je 
tensive counseling program. The aim ° 


ust 


usly’ 
*The data reported here were very gener jed 
made available by Sherwood Blair, PICO Palgi 


supervisor, and Alvin Rudoff, research coo 
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project is to discover the effects of this coun- 
seling program, hoping to find that the ex- 
perimental cases will show better parole and 
postinstitutional records. The data reported 
here are all drawn from cases in the control 
sample. j 
The CPI is administered to all Ss in the 
study at the time of admission to the training 
school. From the project records three sam- 
ples were established for our use: (1) parolees 
receiving suspensions during the first four 
months of parole; (2) parolees suspended 
during the fifth or a later month; and (3) 
parolees still free of suspension after five or 
more months on parole. The presumed sociali- 
zation gradient is from lowest (1) to highest 
(3). CPI So scale statistics for these samples 


are as follows: 


Sample N M SD 
ithi S yë 5.10 
1. Suspended within four months 3 Ka in 


9i 
2. Suspended after four months ES 
3. Five or more months of parole 226 28.25 


without suspension 


5.78 


The progression of mean scores 16 in the di- 


rection specified, but the F test falls short á 
statistical significance. The follow-up perio! 
in the PICO project was not far enough along 
at the time of this writing to permit desig- 
nation of a “successful discharge fom pios 
sample; the third group above 35 only Dor 
proximation of such a sample. If comparisı $ 
is limited to early suspensions On paroh 
(Sample 1) vs. later suspensions eae 
(Sample 2), @ critical ratio of 1. n 5 
difference between the two means 15 0 aine : 
Using the one-tailed tet > E 9 
«ate here because the Aere, } 
aed direction, the probability level is 
tween .05 and .10. i 
E Hourth reference to the work of other 
investigators concerns the program of mig 
being conducted by Reckless and Dinitz an 
their associates at the Ohio State Ea y 
(Dinitz, Kay, & Reckless, 1957; Ren ess; 
Dinitz, & Kay, 1957; Reckless, Dini Zs x 
Murray, 1957). In the Reckless, phi a 
m p ori bee Ohio, en 
i reas in 1s, Ohio, 
m by teachers as being “insulated” 
against delinquency. These Sominatio n er 
confirmed by a prief social service check © 


school and police records. The mean So scale 
score for this sample was 39.43, SD 6.42. 
Only one sample in Table 1, the high school 
best citizens, had a higher average score. In 
another study (Reckless, Dinitz, & Kay, 
1957), these 125 boys were compared with 
101 boys from the same high delinquency 
areas nominated by their teachers as being 
delinquency prone. The So scale statistics for 
this second sample were Jf = 31.40, SD = 
7.99. The difference between this and the 
earlier mean was highly significant. 

The paper by Reckless, Dinitz, and Mur- 
ray (1957, p. 568) also contains a comment 
bearing directly on the view of socialization 
as a continuous dimension and on the ca- 
pacity of the So scale to differentiate within 
regions of this dimension: 


The delinquency vulnerability and social responsi- 
bility scales did more than differentiate between the 
potentially delinquent and non-delinquent nominees. 
The scales also discriminated significantly, within 
the sample of potential delinquents, between those 
who had and those who had not experienced previ- 
ous police and court contact. 


The “delinquency vulnerability” scale referred 
to in the quotation is the CPI So scale dis- 
cussed throughout the present paper.’ The 
“social responsibility” scale is another one of 
the scales included in the full set of 18 in the 
California Psychological Inventory. 


Summary 


Arguments in behalf of viewing socializa- 
tion as a continuous dimension rather than 
as merely a dichotomy of social vs. asocial 
behavior were presented. A theory of sociali- 
zation was summarized, and a method of 
psychological measurement consonant with 
this theory was described. 

The greater portion of the paper was then 
taken up with findings seeking to demonstrate 
the validity of the measuring scale along the 
full range of the socialization continuum. 

Evidence from the writer’s inquiries as well 


5In the Ohio State study the scale was scored in 
the opposite direction, so as to emphasize the delin- 
quency vulnerability implications. The mean values 
reported in the present paper, however, have been 
reconverted to So scale form, so that higher scores 
are indicative of higher socialization levels. 
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as from those of other investigators gave 
strong support to the systematic validity of 
the scale. 
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Most Rorschachers believe that the Ror- 
schach test is a fairly sensitive indicator `of 
anxiety in the subject (S). However, there is 
much disagreement on specific anxiety signs, 
with the possible exception of the shading re- 
sponse. Eichler (1951a) listed 15 factors, 
which one or another expert alleged to be an 
indicator, but almost every Rorschach factor 
which can be quantified has been employed 

in this fashion in some investigation. The 
| usual design of such a study is to contrast 

performances of groups of psychiatric 
tients and normal individuals. In general, 
these studies have tended to produce equivo- 
cal and often conflicting results. Interstudy 
differences in administration and scoring tech- 
niques may have caused these discrepancies in 
part, but the most probable confounding fac- 
ò tor is sampling. There appears to be so many 
characteristics of the S which can influence 
Rorschach performance that the matching of 
patients and normals to eliminate bias is 
extremely difficult to accomplish. Successful 
matching is likely to be accomplished by pure 
accident, and the experimenter cannot be ex- 
pected to k nw when this has occurred. 
The most obvious way around the sam- 
pling problem is to use the same Ss for both 
the normal and anxiety performances. This 
introduces & new problem: how to induce 
anxiety in normal individuals. Many tech- 
niques have been used (cf. May, 1950) but 
their efficacies are usually questionable, if 
not downright dubious. 
The use of hypnotic $ 


, 


uggestion to produce 
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emotional states in normals has been reported 
periodically in the literature, and recently by 
Eichhorn and Tracktir (1955a, 1955b). The 
results suggest that this approach is rela- 
tively successful, and that it could be utilized 
to induce anxiety. 


Procedure 


The basic design of the study was to compare the 
Rorschach records of normal Ss as they would be 
obtained under ordinary circumstances with those 
obtained in an hypnotically induced anxiety state. 
In order to control for the effects of an hypnotic 
state itself on test performance, a record in hypno- 
sis without anxiety was also obtained. 

The Ss were 12 medical and nursing students, six 
males and six females. All Ss were volunteers and 
were paid for participation in the study. They had 
been selected from a pool of such Ss on the basis of 
two criteria: capacity to develop amnesia for perfor- 
mance in hypnosis, and general psychological adjust- 
ment. All Ss scored within usually accepted normal 
limits on the Taylor Anxiety Scale and the Barron 
Ego Strength Scale and were screened by psychiatric 
and medical examinations. 

The three records were obtained from each S on 
a single day in the following order: in the hypnotic 
condition, in hypnotically induced anxiety, and, 
finally, in the waking state. The S was made amnesic 
for the blots after each of the first two performances. 

Diffuse, nonspecific anxiety was induced in each S 
by a psychiatrist (H. J. Grosz) using a detailed sug- 
gestion in which the words “anxiety,” “fear,” “ap- 
prehension,” and “panic” were used. This use of a 
number of general synonyms for anxiety represents 
an effort to circumvent the effects of different pri- 
vate meanings of the single word “anxiety” among 
the Ss. The Ss were told that they did not know 
what was making them anxious. After the induction 
of anxiety, all Ss verbalized anxiety feelings and all 
showed some physical manifestations commonly as- 
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i Table 1 

Factors Treated by Covariance Analysis l 

(Influence of R Removed) m 

Adjusted Condition Means S 

Factor F ratio r with R Waking Hypnosis Anxiety o 

6.49** —.10 7.16 981 

Pr 1.02 AGS 2.12 1.85 eras l 

F (—FM) 2.88 PY cg 15.19 14.17 ns 
F (+FM) 1.71 .79%* 17.89 18.45 ie 
F+ 5.42* .65** 8.63 8.51 3. A 
M 0.12 —.04 3.60 Stl Pre 
PM 5.36* -23 2.70 4.28 ans 
Sum Y 82" as% 252 2.23 EEA 
Sum C 0.51 34 5.24 5.99 er 
y 0.06 —.25 1.73 1.79 aan 
Sp 0.88 -29 3.23 4.13 H 
P 1.83 af 6.39 6.7 7 res 
Tot H 2.05 -52* 5.43 5.48 one 
Tot A 0.49 Ead 10.04 11.23 Bee 
Anat 0.45 -63** 2.15 2.80 ani 
RY 1.83 13” 3.83 3.75 Be 
FC 1.97 -28 417 2.96 aaa 
CF 1.65 24 3.03 4.12 Se) 
Hd 1.57 58** 1.74 = 2i a4 

Ad 0.45 33 2.24 1,72 


* Significant at the .o5 level, 
Significant at the [01 level, 


chiatrist was 3.04, SD = .82. The 
rating was 2.92 with an SD of 83. 


Results 


A total of 25 Rorschach factors were ana- 
lyzed. For 20 of these, the treatment was a 
Covariance analysis in which the influence of 

of responses, was held con- 

stant.’ The F ratios among mean scores for 
1The use of a covariance analysis to control for 
the effects of R has b 
(1951b) as the most eff 


when W is small, Since 


is ordinarily 
required before applying the Covariance approach. 
This was not true in the present study. Most of the 
distributions tended to be relatively symmetrical 


: ith Ry 
the three conditions, the correlation vie 
and the adjusted condition means for 


. ia 
20 factors appear in Table 1. Three factor f 
R, F+%, and median reaction time o 


. esi 
response—were treated by variance anil 
the data are in Table 2. There were se 


the 
about a maximum ordinate near the center oti 
distribution, Means and medians coincided S phe 
A transformation was therefore not pees ap“ 
primary explanation of the absence of skew igon 
parently lies in the relatively limited aY bles, 6 
range in our sample. Most orreta vine to im 
pecially determinants, show some relationinip 
tellectual factors; this is reflected in toig. a and 
tions with R, as shown, for example, by dependent 
Baughman (1953). R is itself heavily mars 
upon intelligence; as Beck (1954, p. 53) endi 
“the response total is one of the test’s pees hig 
of intelligence in liberation,” The relative! a A 
mean R in our group is a second Gite si 
absence of skewness. This mean would fall a 4 
the 70th percentile of the R distribution for pma 
the normals or patients in the Fiske and Ba S prog 
study. What this means, simply, is that our nae d 
duced more scores above 2 on various ait a por Í 
fewer of the scores of 0 and 1 which form an ution 
tant causal component of the skewed distri 
usually obtained with Rorschach factors. 
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factors whose incidence was too low and 
whose distributions were too skewed to per- 
mit covariance analyses. Two of these, M— 
and pure C, were components of broader 
factors which were found in the covariance 
analyses to be unrelated to R. These factors 
were analyzed by means of the Cochran Q 
Test (Siegel, 1956) for correlative propor- 
tions with more than two conditions. These 
results are shown in Table 3; the figures un- 
der the respective conditions show the num- 
ber of Ss whose records had at least one of 
the factors. 

There are significant differences among the 
three conditions on 6 of the 25 factors: W, 
F+, FM, Sum Y, median reaction time, and 
M-—. The inferences are clear-cut with re- 
spect to F+, Sum Y, reaction time, and M+, 
since in each of these three conditions the 
significance is a function of a difference be- 
tween the anxiety state on one hand and the 
other two states on the other. In the case of 
W, the conclusion is that its incidence is fa- 
cilitated by hypnosis and is then depressed 
by the anxiety condition. We may conclude, 
then, that as far as the results of this study 
are concerned, these five indices reflect anx- 
iety. In the case of FM, the significance is 
due to a common increase in both hypnosis 
and anxiety, with no difference between the 
two. The inference must be that the increased 
incidence is a result of the hypnotic state it- 
self, and not of anxiety. It is of incidental in- 
terest that the F+ which is significant is not 
F+% in the conventional sense but F+ as 
the total number of responses. 
%, which appears in 
ificant, though it is in 
f the 15 factors sug- 
chler (1951a), 


a function of 
The conventional F+ 
Table 2, was not sigh 
the right direction. (0) ‘ 
gested ag anxiety indices by Pi 


Table 2 


Factors Treated by Variance Analyses 


Condition Means 


F ratio Waking Hypnosis Anxiety 


Factor 
R 1.61 34.3 31.0 ae 
7 
F4A9 1.09 89 8 
Mad eer (secs.) 4.83* 24.4 18.0 31.2 


* Significant at .05 level. 


Table 3 


Factors Treated by Cochran Q Test 


Factor Incidence 


(N of Ss) 
Factor ChiSquare Waking Hypnosis Anxiety 
M- 8.40* 1 2 6 
Pure C 3.43 1 3 5 


* Significant at .05 level, 


12 appear in this study. Of these, two—W 
and Sum Y—significantly differentiated the 
anxiety state (interestingly enough, this was 
also Eichler’s result). Four others—R, Dd, 
Hd, and P—are in the right direction. The 
remainder—M/, Sum C, Ad, Total A, anatomy 
responses, and F—are unpredictive. Of inter- 
est is the fact that F, as in the Eichler study, 
seems to be reduced rather than increased in 
anxiety. Also as in the Eichler study, this 
seems to be a function of the increase in 
shading responses. The data of Table 1 also 
show that it makes very little difference 
whether F is scored to include animal move- 
ment responses a la Beck, or without them as 
in the Klopfer system. However, the fact that 
FM seems to be affected by hypnosis suggests 
that it has some merit in the general analysis 
of personality. 

About half of the factors treated by co- 
variance analyses are significantly related to 
number of responses, despite the small num- 
ber of Ss. This is, of course, not unexpected. 
Fiske and Baughman (1953) reported that 
all factors were related to R in a large sam- 
ple of outpatients, and all but a few in the 
Spiegel normal sample on which Beck’s 
(1950) norms are based. The absence of cor- 
relation of many variables in our study is 
probably due to the attenuated range of R, 
which is itself due to the limited intelligence 
range in the sample. It is interesting to note 
that the correlation of .50 between R and 
number of populars is very close to the coeffi- 
cient of .48 reported by McCall and Doleys 
(1955) for the Spiegel sample. 

Even though our results are based on a 
small group of Ss, they may nevertheless be 
more revealing than those of larger scale, con- 


trasting groups studies. Despite other limita- 
tions, there is no sampling bias involved in 
the selection of the “contrasting groups” in 
our investigation. Whether the use of indi- 
viduals of relatively high intelligence and 
hypnotic susceptibility restricts generaliza- 
tions from our results is a question requiring 
further study. Certainly the mean scores are 
not representative. This does not, however, 
mean that the change in mean scores from 
state to state might not be characteristic of 
a broader population. 


Summary 


Rorschach records were obtained from 12 
carefully selected, normal individuals in the 
waking state, in hypnosis, and in hypnoti- 
cally induced anxiety. Appropriate analyses 
of 25 quantifiable Rorschach factors indicate 
that decreases in W and F+, and increases 
in Sum Y, reaction time of first response, and 
incidence of M— reflect the anxiety state. An 
increase in FM was found to characterize the 


hypnotic state per se, but not the anxiety 
State. 


Eugene E. Levitt and Hanus J. Grose 
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In the psychological assessment of cortical 
damage, the Spiral Aftereffect Test (SAET) 
has become an important and controversial 
technique. This technique was first introduced 
into the literature in 1949 by Freeman and 
Josey. Further pioneering of the SAET was 
conducted by Standlee (1953) and Price and 
Deabler (1955). 

The illusion, or aftereffect, following 
tion of the Archimedes Spiral has long been 
known and used in experimental psychology 
(Boring, 1942). Research on the SAET is 
summarized in Table 1. 

It is noted that the mechanical technique 
used in presenting this test has varied consid- 
erably. Some research indicates high validity 
in differentiating subjects (Ss) with cortical 
damage from normals and functional states 
(Price & Deabler, 1955). Some research raises 
question as to the usefulness of this test in 
diagnosing brain damage (Gilberstadt, Schein, 
& Rosen, 1958). Researchers have had to con- 
struct their own apparatus, and no two stud- 
ies have utilized identical apparatus and pres- 
entation, These factors, as well as differences 
in sampling technique and criteria, may ac- 


ri 
count for the variation in research results. 
For instance, 


in evaluating the apparatus 
over a considerable period o 


f time, it was dis- 
covered that any deviation of the m 


rota- 


otor shaft 


biguous aftereffect perception. 


caused am 
edicting 


The importance of base rates in pr 
valid measurements of cortical damage using 


sh to acknowledge the consider- 
his study by Arturo G. Gonzalez, 
t made the detailed criterion 


1 The authors Wi 
able part played in t 
whose skill and interes 


data possible. 


2 Now in private practice, Tampa, Florida. 


the SAET has been reported in recent litera- 
ture (Stilson, Gynther, & Gertz, 1957; Gil- 
berstadt et al., 1958). i 

Berger, Everson, Rutledge, and Koskoff 
(1958) conducted an elaborate study using 
extensive clinical criteria. Results indicated 
negative correlation when the SAET was com- 
pared with EEG records, pneumoencephalo- 
grams, and skull X-rays. Three studies are 
reported using children as the experimental 
sample (Spivack & Levine, 1957; Davids 
Goldenberg, & Laufer, 1957; Harding Glass- 
man, & Helz, 1957). The results seem to in- 
dicate the usefulness of the SAET in the 
measurement of cortical involvement in these 
children. 

Of 16 studies reported, 10 have used four 
trials, and the remainder, from three to six 
trials in presenting the revolving spiral. Yates 
(1954), in his evaluation of research measur- 
ing cortical damage, indicates this area has 
been studied most inadequately. He refers to 
such criticism as: lack of comparable groups 
for validation studies, classification of corti- 
cal damage, the control of relevant factors 
such as age, sex, intelligence, subjective scor- 
ing systems, and adequate control of the test- 
ing procedures. 

_ The present study gave particular atten- 
tion to the above criticisms. 


s Procedure 
Subjects 


Four hundred and twenty children between the 
ages of 5 and 16 years were examined during a 
riod of two-and-a-half years at an PATATE = 
chological clinic. The variety of complaints and nis 
orders ranged from requests for a general psycho- 
logical checkup to referrals for differential diagnosis 


Table 1 


Showing the Various Mechanical F eatures, Experimental Procedures, Predictors, Criteria, 


and Results in Spiral Aftereffect Test (SAET) Research 


ei ee 


Author (s) 


Freeman and Josey 


Standlee 


Price and Deabler 


Gallese 


Spivack and 


Levine 


Page, Rakita, 
Kaplan, and Smith 


Davids, Goldenberg, 
and Laufer 


Stilson, Gynther, 
and Gertz 


Garret, Price, 
and Deabler 


Date 


1949 


1953 


1955 


1956 


1957 


1957 


1957 


1957 


D 
Spiral 


1260 


920 


920 


920 


920 


Critical review of previous research 


rpm (inches) 


100 6 


No details of apparatus 


100 6 
90 6 
78 No 

details 

100 8 
78 8 


No apparatus 


Subjects? 


50 Normals 
85 Psychotics 


16 Normals 
25 Psychotics 


40 Normals 
40 Psychiatric 
120 Organics 


30 Normals 

41 Schizophrenics 

47 Organics (CNS) 

50 Organic—convulsive 

12 Lobotomized Schizo- 
phrenics 


32 Brain injured adoles- 
cent boys 


35 Emotionally disturbed 


adolescent boys 


20 organic cases 
20 Psychiatric cases 


24 Normal children 

29 Psychiatrically dis- 
turbed children 

15 Organic children 


40 Organics 
30 Normals 


Criterion 


Hospital diagnosis 


Hospital diagnosis 


1. Hospital diagnosis 
2. Personal history 


Hospital diagnosis 


APA Nomenclature 


Hospital diagnosis 


Medical and psychologi- 
cal evaluation 


Hospital diagnosis 


to 


Results 


Memory impairment and no aftereffect correlated 


. Test-retest reliability 94 


Electroshock therapy does not impair capacity to per- 
ceive aftereffect 

No relation between Perception of aftereffect and 
memory 


Nonorganic psychoties and normals perceived aftereffect 


. Cortical involvement cases unable to perceive aftereffect 


SAET differentiated organic from nonorganic patients 

Some organics more sensitive to aftereffect than others 

Lobotomized patients perceived aftereffect 

Perception of aftereffect unrelated to age, sex, length of 
hospitalization 


. Test-retest reliability (.84) 


Emotionally disturbed adolescents perceived aftereffect 
more frequently than brain damaged adolescents 


Organic patients less likely to preceive aftereffect than 
functional cases 


The SAET as well as the Trail Making Test possess prom- 
ise as methods for assessing cortical impairment 


Stresses importance of the base rate in determining dis- 
crimination of any diagnostic instrument 

SAET can be used with considerable confidence to differ- 
entiate nonorganics 


. SAET falls into category of phenemenological movement 


(phi phenomena) 


2. SAET with the Graham Kendall Test form a valid battery 


for indicating cortical damage 


yr eee 


9¢ 


4agoyIg `H q4aQoy puv ndg `H aL40DOIY T, 


Table 1—Continied 


Degree of Disc Size l ? TA 
Author (s) Date Spiral rpm (inches) Trials Subjects? Criterion Results 
Harding, Glassman 1957 920 78 6 4 81 Normal children Staff decision 1. Normal = hein ne at 4.5 to 6 years of age per- 
4 < z ceive aftereffec 
and Helz 2. No children below 55 months perceived aftereffect 
3. Mental age had more direct relationship to perception of 
aftereffect than chronological age 
i 78 10 4 30 No pathology Hospital diagnosis 1. All normals perceived aftereffect. 
Goldberg and Smith 1958 229 24 Mixed neurologic 2. Results warn against indiscriminate use of SAET for 
38 Psychiatric differential diagnosis 
92 6 4 65 Epileptics Hospital diagnosis 1. Poor performance on SAET associated with impairment 
Aaronson 1958 2720 TS Da of visual function 
2. SAET seems most sensitive to temporal lobe involvement 
; details of apparatus 22 Normals Hospital diagnosis 1. Structured instructions aided organics to see aftereffect 
London and Bryan 1958 No de pp 14 Brain teiuted ag freuuently as Kornei 
2. Organics unable to see aftereffect with neutral instructions 
i in, 1958 920 100 No 4 87 Psychiatric Hospital diagnosis 1, All Ss considered, the SAET diagnoses brain damage 
arr ee 3 details 140 Neurologic no better than base rates i l ; 
2. Base rates can be helpful in determining cutting score in 
particular populations using the SAET as a screening 
technique 
3. The SAET has limited usefulness 
i 5 920 78 6 4 110 Neurologic cases 1. Hospital diagnosis 1. Visual acuity correlated with perception of aftereffect 
Mergen yeon 1239 2. EEG 2. The aftereffect phenomena involves radial movement and 
Koskoft 4 3. Pneumoencephalogram Gamma movement 
4, Snellan chart 3. SAET performance not related to electroencephalogram, 
5. Skull x-rays pneumoencephalogram of skull X-rays 
6. Visual field 4, SAET performance related to spinal fluid studies, visual 
field studies and likelihood of brain damage 
Spivack and 1958 No details of Staggered 24 Organics Staff diagnosis 1. Performance on SAET differentiates between brain 
Levine apparatus trials 20 Normals 


damaged and normal group with slight efficiency 

2, The duration of the aftereffect may be a fruitful area for 

research 
3. Lack of perception of aftereffect by organic cases may 
relate to well known perseveration phenomena in brain 

injury cases 
ns 
a As designated in the original research. 
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of what was tentatively described as childhood schizo- 
phrenia. All of these Ss were given various psycho- 
logical tests, but in each case a SAET of eight trials 
was administered. Figure 1 presents graphically the 
distribution of SAET scores for the cases. 

Experimental group. Fifty of the children shown 
in Fig. 1 were selected as the “predicted abnormal” 
group. These children did not perceive the aftereffect 
phenomenon on eight trials of the SAET. Four of 
the original sample were dropped from the study be- 
cause EEG evaluation was not obtainable. This re- 
sulted in an experimental group of 46 cases. The 
mean age in months for the predicted abnormal 
group was 122.98. There were 34 males and 12 fe- 
males included in the group. The average grade place- 
ment was at the 4.0 grade level. Included in the 
group were 37 cases where one of the initial com- 
plaints was poor reading ability, 13 cases where 
speech defect was part of the presenting picture, 20 
cases where the child was overly aggressive, and 22 
cases where the child was said to be withdrawn. 
Total complaints exceed 46 since several symptoms 
were frequently described for a single case. 

Control group. A matched control group was se- 
lected for this study. Twenty cases, matched for age, 
were selected from the sample shown in Fig. 1. All 
Ss in this group achieved a full score of eight on the 
SAET. Mean age for this group was 119.65 months, 
and the group included 11 males and 9 females. The 
average grade placement was at the 4.4 grade level. 
Within this group were two children with severe 
reading problems, one with speech difficulty, three 
overly aggressive children, and one child who was 
said to be withdrawn. The remainder of the group 
were children who were seen for routine psychologi- 


cal evaluation. This group was designated the “Dre- 
dicted normal group.” 


200 


150 


FREQUENCY 


o'1234 5678 


SPIRAL SCORE 


Fig. 1. Distribution of Spiral Aftereffect test scores 
for 420 children referred for outpatient psychological 
services, 


Table 2 


Matching Data for the Experimental 
Group and Control Group 


Experi- 

Control mental 

Group Group 
(Predicted (Predicted | 
Normal Abnormal | 

EEG) EEG) 
ee > - una 
Variable N=20 N=46 | 
Mean age in months 119.65 12298 
Sex | 
F Males 11 24 | 
E Females 9 12 | 
School placement, grade 4.4 2. 
iment 

Table 2 presents the data for the experimen 


group and the control group. 


Predictor Battery 


Although the primary purpose of this study Y 
the evaluation of the SAET, it was decided | sal 
clude some standard psychological sakes i 
to be related to the presence or absence © 
damage. : PE 

The psychological tests selected for this study s 
viewed as screening devices rather than ages 4 
tools. It was hypothesized that psychologica nol 
urements can discriminate children of essentia’ 7 
mal cortical function from children having dis" 
cortical function. ; for this 

The Spiral Aftereffect Test. The stimulus 2) iw 
procedure was an Archimedes Spiral GO witi 
printed on a 7-in. white disc, presented to ) 4 
a rotation speed of 80 rpm (5 rpm variability loch 
apparatus was battery operated and reversible 
wise and counterclockwise rotation) .? , ft fro" 

Each S was seated approximately eight e 
the spiral. The following instructions were 8? ol 

This is a test of the way you see. Blea th 
your eyes on the silver button in the cente” chin 
wheel. In a moment I will turn on the r wht 

The wheel will go around and I will ask ae 

you see. Please keep your eyes on the silv i det 

ton even when I ask you questions. Do you | 

stand? 0 


J 
fter 4. 
At this point the machine was turned on. 4 $ od 


yas îã 4 
sec., as timed with a stop watch, the S h 0° 
“What is happening to the black line?” *D pes 


rA 

“Ts the black line moving toward the center 0 the 
P a from y 

? Details of the apparatus are available cor 


Psychological Research and Development 
tion, Tampa, Florida. | 
| 
| 
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from the center?” In all cases, Ss were able to re- 
spond to this inquiry. At the end of 20 sec. the S 
was told: “In a moment I will turn the machine off. 
Please watch the wheel very carefully when I turn 
the machine off.’ At the end of 30 sec. the apparatus 
was stopped and the S was asked: “What is now 
happening to the black line?” Any reported after- 
effect of opposite direction from the original effect 
was given a score of 1. A zero score was given if no 
aftereffect was reported. 

Eight trials were presented in an ABBABBAA 
order. 

Reliability of this test is well known (Freeman & 
Josey, 1949; Gallese, 1956; Price & Deabler, 1955) 
with test-retest and interjudge reliability having been 
found to be between .84 and .94 by various experi- 
menters. 

The Bender Visual Motor Gestalt Test. This well- 
known test has been described elsewhere (Bender, 
1938). Although commonly evaluated by clinical 
judgment, a five-point rating scale was developed for 
this study. Standard instructions were followed in 
the administration of this test. 

The five-point rating scale was applied as an “in- 
spection technique.” The judges examined the entire 
Bender record of each S and noted such discrepancies 
as distortion, collision (one figure running into an- 
other) and rotation (a figure turned from the nor- 
mal axis). Thus the test productions were rated in 
terms of their apparent equivalence to the original 
cards. Each S was given a score ranging between 0 
and 4, with O representing what would be consid- 
cred an essentially perfect production, with consid- 
eration for the age of the S (Bender, 1946). 

Two judges, each with several hours’ training on 
this technique, rated all of the cases in the study. 
Applying a raw data formula, a Pearson correlation 
of .77 was obtained. Thus, the reliability coefficient 
is significant well beyond the .01 Jevel. A cutoff point 
was established on the rating scale whereby scores of 
0, 1, and 2 were considered to be normal and scores 
of 3 and 4 were considered to be abnormal. In ee 
paring the jinterjudge reliability using this method, 
agreement was obtained in 93% of the cases. 

Draw-A-Person Test. This was the standard clini- 
cal test in which the S was asked to draw a per- 
son.” No further directions were given other than to 
encourage the S to complete the drawing. setts 

Each drawing was rated on the single variable o 
“tipping.” A perpendicular figure was considered to 
have a score of 0, while a “floating figure, lying 
horizontal on the paper, was given a score of 4. 
Scores of 1 through 3 were assigned to various re 
grees of rotation between the perpendicular and the 


horizontal. 
Two judge 
Applying the raw 
ings, a Pearson corr 
is significant beyon! 
ability evaluation W 
between 2 and 3 on 
of 0, 1, and 2 were co 
normal cortical function. 


rated all of the drawings in the study. 
data formula to the judges’ rat- 
elation of .75 was obtained. This 
d the .01 level. A second reli- 
as done by setting a cutoff score 
the five-point scale. Thus, scores 
nsidered to be predictive of 
Scores of 3 and 4 were 


considered to be predictive of abnormal cortical func- 
tion. Agreement was found on 85% of the ratings 
using this method. 

The Wechsler Scales. For the purposes of this 
study, several subtests of the Wechsler Intelligence 
Scale for Children (WISC) were administered to all 
of the Ss. The scales selected were, Digit Symbol, 
Block Design, Arithmetic, and Digit Span. An addi- 
tional Digits-Reversed Score was also included. These 
scales were administered and scored according to 
standard procedures. Except for the Digits-Reversed 
variable, weighted scores were used. 


Criterion 


Whereas all previous studies have dealt with known 
brain damaged groups or assumed non-brain-dam- 
aged groups, this study deals with a group of chil- 
dren who were supposedly non-brain-damaged, as a 
result of physical examination before coming to the 
outpatient clinic in which they were tested. Empiri- 
cal evidence indicates that about 10% of clinic pa- 
tients, even though physical examination is negative, 
have some cortical disruption which might be de- 
scribed as subclinical intracranial pathology. It was 
deemed necessary that this study would have an ob- 
jective neurologic evaluation, and the EEG was 
chosen as the preferred criterion measure. All mem- 
bers of the experimental and the control groups 
were referred for EEG evaluation. A two-hour EEG 
record was taken on each child. A standard eight- 
channel Medcraft instrument, CS7C Model D, was 
used. Tracings were obtained in all leads, and a por- 
tion of the recording was a sleep record. 

Only one judge was used in this experiment, since 
in the locality of the study there was a neuropsy- 
chiatrist available who had extensive experience in 
EEG with children. Each child was rated by the 
neurologist on a five-point scale ranging from 0 to 
4, on the factor of focal findings vs. no focal find- 
ings. A second rating scale was administered to meas- 
ure diffuse findings vs. no diffuse findings. In deter- 
mining the final criterion, those cases which showed 
any EEG anomalies were called the “abnormal EEG 
group.” Those Ss whose EEGs resulted in no find- 
ings were considered to have normal cortical function. 

Six months after the original study, the EEG trac- 
ings of 18 of the Ss, including both “predicted nor- 
mal” and “predicted abnormal,” were selected using 
a table of random numbers. These records were given 
to the original rater for blind re-evaluation. He was 
asked to rate each of these records in terms of find- 
ings vs. no findings. When these ratings were fin- 
ished, it was found that all 18 cases had been rerated 
exactly as the original ratings. Since an intrajudge 
correlation of 1.00 seemed highly unlikely, the re- 
sults were questioned. In order to insure a more 
careful evaluation of the reliability of the EEG rat- 
ings, 36 cases were selected for re-evaluation. Half of 
these were predicted normal and half were predicted 
abnormal. In all cases, identifying data were removed 
and the neuropsychiatrist rating the cases re-evalu- 
ated the records with no idea of which were normal 
and which were abnormal. Between 6 and 18 months 
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Table 3 


Correlation of Various Predictors with EEG Criterion 
and Intercorrelation with Each Other 


Predictor Variable EEG Spiral Bender DAP 


EEG (criterion) By ia 

Spiral Afterefiect 749%  84* 

Bender Gestalt 365* 420% 

Draw-A-Person 30%* = 358.* (Eg 

WISC 
Digit Symbol 25 One .36* 09 
Block Design E Via .54* .45* A9 
Arithmetic Alt .43* 32 -00 
Digit Span .28 30 ot 02 
Digits Reversed 08 .01 23 -10 
* Phi coefficient. 


* Significant at 01% level or better (.01 level = .345; .05 
level = ,166), 


had elapsed between 
final rerating, 

The same criteria of findings vs. no findings was 
applied, and the resulting data yielded a phi coeffi- 
cient of .77. The neuropsychiatrist was able to re- 
rate the EEG tracings reliably in 87% of the cases. 
This sort of reliability is unusual in view of what 
has been found in the literature. It should be stated, 
however, that the EEG records taken on these chil- 
dren were more carefully done than one might ex- 


the original ratings and this 


Results 
The Spiral A ftereffect Test 


All Ss were given eight trials on the SAET, 
and scores of 6, 7, and 8 were considered to 
be predicted normal. A score of 0 through 5 
Was considered to be predicted abnormal. 
This dichotomy was compared with the EEG 
ratings of findings vs. no findings. 


The Bender Visual-Motor Control Test 


Scores on the five-point scale of the Bender 
were separated into Predicted normal (score 
of O or 1) and predicted abnormal (score of 


2, 3, or 4), and compared with the EEG 
criteria. 


The Draw-A-Person Test 


Scores on the five-point scale of the Draw- 
A-Person test were dichotomized: predicted 
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normal (score of O or 1), and predicted ab- 
normal (score of 2, 3, or 4), and compared 
with the EEG criteria. 


The Wechsler Scales 


The raw scores on the Wechsler Scales were 
compared with the EEG criterion of findings 
vs. no findings using the point biserial corre- 
lation technique. 

Intercorrelations were calculated among the 
various predictor variables. Table 3 presents 
the correlations among the various predictors 
and the EEG criterion. Unless otherise indi- 
cated, point biserial correlation technique was 
used. 

It will be noted on Table 3 that the SAET 
had the highest correlation with the EEG cri- 
terion. The Bender, the Draw-A-Person test, 
the WISC Arithmetic, and the WISC Block 
Design all correlated beyond the .01 level o 
significance with the EEG criteria. 

Significant intercorrelations were found be- 
tween the SAET and the Bender, between the 
SAET and the Draw-A-Person test, beea 
the SAET and the WISC Digit Symbol, an 
between the SAET and the WISC Arithmetic. 

The Bender correlated significantly, Lers 
the SAET, the Draw-A-Person test, the WIS 
Digit Symbol, and the WISC Block Derine 

The Draw-A-Person test correlated signi 
cantly with the SAET and the Bender. oe 

Table 4 presents the percentage of hiss 
in predicting normal EEG records and abnor 


Table 4 


4 “EG 
Accuracy of Prediction of Normal“and Abnormal ue 
Records by the Various Predictor Variable, 1 
Percentage Hits 


Percentage of | Percentage of 


Normal EEG Abnormal 

Records EEG Recor 

Correctly Correctly 

Predicted Predicted 

Predictor Variable (Hits) (Hits) 

Spiral Aftereffect test 1.00 86 
Bender Gestalt 65 .70 
Draw-A-Person 60 A 
Digit Symbol 58 Kii 
Block Design .67 AL 
Arithmetic 54 71 
Digit Span 31 81 
Digits Reversed 37 72 


Spiral Aftereffect Test and EEG in Children 


mal EEG records for all the predictor vari- 
ables. 

An attempt was made to analyze the char- 
acteristics of the EEG records. This was 
found to be difficult in that the different rec- 
ords were described with such a variety of 
terminology that statistical analysis was ques- 
tionable. Eight characteristics of the records 
were chosen in order to present some defini- 
tive data.* Table 5 presents this brief analy- 
sis of the EEG records. 


Discussion 


The data presented above would seem to 
indicate that the SAET and other psycho- 
logical tests are variously effective for dis- 
criminating children exhibiting abnormal elec- 
troencephalographic records from children 
who exhibit normal records. These findings 
support some of the earlier work done with 
the SAET. The SAET seemed to be particu- 
larly effective as a discriminating technique. 
The sample was considered normal by gen- 
eral physical examination. This raises the 
hypothesis that psychological tests are per- 
haps more sensitive indicators of minimal 
cortical disruption than the usual medical- 
neurological examinations. 

The present study differs in its findings 
from some of the previous research on the 
SAET, particularly one study which used 
EEG as the criterion. The EEG record must 
be considered an intermediate criterion rather 
than an ultimate criterion. There are a va- 
riety of ways and means by which the EEG 
may be used as a validating instrument. In 
the present study, it was used with the con- 
siderable care, and the reliability data pre- 
sented lends some support to this technique 
as an adequate criterion. The fact that it is 
not an ultimate criterion is evidenced by the 
fact that one of the children whose EEG rec- 
ord was rated “no findings” had a grandmal 
seizure in the clinic. It is conceivable, in view 
of suspected criterion fallibility, and less than 
perfect intrajudge EEG reliability, that the 
predictability of the SAET as well as other 
psychological measurements may actually be 
better than the present data indicates. Since, 


8 Analysis of the EEG ri 
be presented in a later papet- 


ecords continues and will 
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Table 5 


A Brief Analysis of Some Characteristics of the EEG 
Records of the Predicted Normal and the Predicted 
Abnormal Groups 


Predicted Predicted 


Normal Abnormal 
Group Group 
Mean occipital alpha rhythm 8.7 8.2 
(per second) 
Mean rating of paroxysmal 1.08 2.06 
activity (rated 1—4) 
Mean rating of build-up under 2.0 2.31 
hyperventilation (rated 
1-4) 
Mean rating of dysrhythmia or 2.25 2.52 
disorganization (rated 1-4) 
Percentage of records with 17 25 
focal slow activity 
Percentage of records with 
spikes 
Single 0 .40 
Multiple 0 18 
Spike & Dome 0 14 


however, we must be dependent upon an in- 
termediate criterion, all we can do is attempt 
to establish the criterion used with the great- 
est of care and attention to questions of reli- 
ability and validity. 

The most significant question in the evalua- 
tion of the SAET, as well as any other diag- 
nostic technique, is the question of base rate. 
This matter has been previously discussed in 
the introduction to this study. It was decided 
not to apply the base rate formula to the 
present data since there is no information 
available of random samplings of unselected 
EEG tracings in children between the ages of 
5 and 16 years. This would seem to be a very 
necessary area for further research. 

In judging the effectiveness of any instru- 
ment, one must consider its application. Psy- 
chological tests, when considered in terms of 
the presence or absence of brain damage, are 
screening devices. In and of themselves they 
cannot be diagnostic. The diagnosis of the 
presence of brain damage would be a medi- 
cal problem, whether it be through labora- 
tory tests, histological examination, or re- 
sults of treatment. In the present study, the 


42 


SAET was the best of a series of psychologi- 
cal tests for the prediction of EEG findings. 
Other psychological tests were significant but 
less accurate in their predictability. Undoubt- 
edly we will always be dealing with the ques- 
tion of false positives and false negatives. It 
is still a moot question whether errors in pre- 
diction are based on inaccuracies of the pre- 
dictor variables, or in the criterion itself. 
Much careful research is necessary to clarify 
these points. For the time being, however, 
since clinical psychologists are called upon to 
give tests and, as part of their testing pro- 
cedure, to make some determination of who 
shall be referred to the neurologist and who 
shall not be referred, the SAET, probably in 
combination with one or more standard psy- 
chological instruments, would seem to be the 
best available technique for this purpose. 
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A COMPARISON OF SHADING RESPONSES OBTAINED 
WITH TWO RORSCHACH METHODOLOGIES 
FROM PSYCHIATRIC AND NON- 
PSYCHIATRIC SUBJECTS* 


PATRICIA F. WALLER ? 
University of North Carolina 


Baughman has developed a new inquiry 
procedure for the Rorschach Ink Blot Test 
(Baughman: 1954, 1958). This procedure 
utilizes a paired comparison presentation of 
a series of modifications of the original blot. 
For example, if a subject (S) reported a 
flower garden on Card X, instead of the usual 
questioning on the inquiry, the S is presented 
the original card and an identical card in 
which the color has been eliminated. He is 
then asked if he can still see the flower gar- 
den in the achromatic card, in which card is 
it clearer, and why. The original card is then 
removed and replaced by a card in which the 
shading has been eliminated, and the ques- 
tioning proceeds as before. The cards con- 
tinue until there is only an outline of the blot 
remaining, but it is rarely necessary to con- 
tinue questioning throughout the entire series. 

It has been shown that this paired com- 


arison inquiry procedure elicits from non- 


p . 
i ore shading responses 


iatric Ss many mM 
ae i the ie inquiry procedure 
(Baughman, in press; Marimon, in, pres). 
It appears that the questions involve = ase 
paired comparison inquiry are easier to i 
swer than those in the conventional inquiry. 
Rather than asking, “What is it; about the 
blot that makes it look like a bat?’ one asks 
«Do you still see the bat here? Is i R one 
What makes it clearer over here?” Althoug 


1 This article is based upon a doctoral dissertation 


at the University of North Carolina, Au- 
paper was presented at the 1958 
American Psychological Association, 


completed l 
gust, 1958. This 
meeting of the 
Washington, D. 

2Now at R. B. 
Bar Harbor, Maine. 


Jackson Memorial Laboratory, 
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this technique takes slightly longer, it ap- 
pears that it does not demand so much from 
the S. The questions do not seem so difficult 
to answer. 


Method 
Problem 


If shading is truly related to anxiety, as has often 
been hypothesized (Beck: 1944, 1951; Binder, 1937; 
Hertz, 1940; Klopfer, 1937; Rorschach & Oberholzer, 
1942), then it would seem reasonable to assume that 
shading responses would be more prominent in a 
psychiatric population than in a _nonpsychiatric 
population. However, if it is also assumed that psy- 
chiatric patients are under greater stress, then it 
may be reasonable to assume that it would be more 
difficult for them to identify the stimuli giving rise 
to their behavior. Although they may be reacting 
to shading, the stress they are under may make it 
more difficult for them to report it. Cox and Sara- 
son (1954) have made this point and have actually 
found fewer texture responses in high anxiety Ss 
than in low anxiety Ss. An easier, more objective in- 
quiry may enable such Ss to express the shading de- 
terminant whenever they are using it. On the basis 
of this reasoning, it was predicted that psychiatric 
patients would show a greater difference in shading 
responses between the regular and the Baughman in- 
quiries than would nonpsychiatric Ss. 


Subjects 


Data from a previous study by Baughman (1959) 
provided information from a nonpsychiatric popu- 
lation. Ss were 162 employces of three insurance 
companies in a southern city. The psychiatric Ss 
were 60 patients from the inpatient and outpatient 
services in a hospital in a nearby town. Psychi- 
atric Ss were selected according to three criteria. 
First, they were to be showing no obvious signs of 
psychosis. Second, there should be no question of or- 
ganic impairment, and third, they should be literate. 


3 The author is indebted to Emmett Earl Baugh- 
man, who made available data from his research. 
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Fig. 1. Cumulative distributions of shading re- 
sponses obtained from psychiatric and nonpsychi- 


atric populations, using two Rorschach methodol- 
ogies. 


Whether a patient was considered psychotic was de- 
cided by the psychiatrist in charge. In some cases, 
later evaluation led to a psychotic diagnosis, but in 
such instances the diagnosis could be termed debat- 
able, and the S was retained in the study. The psy- 
chiatric and the nonpsychiatric groups were compa- 
rable in terms of mean number of responses to the 
Rorschach (20.10 and 22.05, respectively) and me- 
dian education (high school graduate). 


Procedure 


The nonpsychiatric population consisted of two 
groups of 81 Ss each, one group receiving the regular 
inquiry and the other the paired comparison. The 
psychiatric population consisted of two groups of 30 
Ss each, one group receiving the regular inquiry and 
the other the paired comparison. The same examiner 
tested all the psychiatric Ss and many of the non- 
psychiatric, so that the method of administration 
was comparable. All records were scored by two 
scorers. Although scoring included the various forms 
of Y, V, and T, interscorer reliability was based on 
the sum of shading responses for each record, that 
is, the total number of responses involving shading of 
any kind. Interscorer reliability (Pearson r) ranged 
from .90 to .98. 


Patricia F. Waller 


Results 


A comparison of the groups on the basis of 
the sum of shading responses is shown 1 
Fig. 1. Because of the unequal Ns for the 
two groups, the results are depicted in cumu- 
lative percentages. Table 1 lists the means 
and standard deviations of number of shad- 
ing responses for the two inquiries with E 
psychiatric and the nonpsychiatric popu i 
tions. On the basis of the sum of shading E 
sponses, both the psychiatric and the nonea 
chiatric groups show significantly more sha 
ing on the Baughman inquiry procedure K 
< .01). However, the psychiatric populatia 
shows even more than does the nonpsychiatr™ 
population. A £ test-of the difference betwee 
the differences was significant at <.01 a 
confirming the hypothesis that the se 
atric population would show a greater diffe 


ait om- 
ence between the regular and the paired € l 


i . in, 
parison inquiries on the basis of the shading 


response. a... 
The relative efficiency of the two on 
procedures for discriminating between Pi 
chiatric and nonpsychiatric populations i 
tested. As can be seen in Fig. 1, the inl 
inquiry shows virtually no difference hen 
the two populations, and where the A 
ences occur, they are the result of the nae 
psychiatric population giving more nt 
than the psychiatric population. The + Cos 
tion of difference supports the theory O sons 
and Sarason but is contrary to expectata 
based on orthodox Rorschach theory. P ` 
ever, the differences between the groon m 
the basis of the regular inquiry are no tt 
nificant (f= .69, 109 df). On the pan 
the Baughman paired comparison inquiry, 


Table 1 


Shading Responses for Psychiatric and Nonpsych fs 
Populations on Each of Two Inquiry Proce 


atric 


Inquiry Psychiatric Nonpsychiatric 
X= 55 X= A 
7 SD= 6. 
Regular se = ae vag 
X = 14.8 X= a 
Baughman SD= 8.6 SD= : 
N = 30 N=8 
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psychiatric population gives more shading, as 
would be predicted from Rorschach theory. 
A £ test of the differences was significant at 
<.02. 


Discussion 


The hypothesis that psychiatric Ss would 
show a greater difference between the regular 
and the paired comparison inquiries in regard 
to the shading response was supported by the 
results. Furthermore, the differences found in- 
dicated that the paired comparison inquiry 
differentiates between psychiatric and nonpsy- 
chiatric populations significantly better than 
does the regular inquiry. These results lend 
support to the assumption that psychiatric 
patients are more sensitive to the shading de- 
terminant but have more difficulty reporting 
it. In other words, the regular inquiry may 
not differentiate between Ss who give little 
shading because they are not utilizing it and 
Ss who give little shading because they are 
having difficulty determining the basis of their 
response. The Baughman paired comparison 
inquiry procedure makes it easier for both 
psychiatric and nonpsychiatric Ss to report 
the determinants involved in their responses. 
The fact that on the regular inquiry pro- 
cedure, psychiatric and nonpsychiatric Ss give 
essentially the same amount of shading, and 
on the easier Baughman inquiry psychiatric 
Ss give significantly more shading than non- 
psychiatric Ss, lends support to the common 
contention that shading is in some way re- 
lated to anxiety and should therefore be more 
prominent in a psychiatric population. 


Summary 

nd 162 nonpsychiatric Ss 
the basis of Rorschach 
ed with two inquiry 


Sixty psychiatric a 
were compared on 


shading responses obtain 
procedures, the regular and the Baughman 


paired comparison. Tt was predicted that = 
psychiatric group would show a greater dit- 


ference between the two inquiry procedures, 
since they should be responding more to shad- 
ing but have greater difficulty reporting it, 
because of the stress they would presumably 
be under. The results supported this hypothe- 
sis at <.01 level of significance. The results 
also indicated that on the basis of total num- 
ber of shading responses, the regular inquiry 
procedure failed to differentiate between the 
psychiatric and the nonpsychiatric groups, 
whereas the Baughman paired comparison in- 
quiry differentiated at <.02 level of signifi- 
cance. 
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PROGNOSTIC RATING SCALE 
HISTORY PROGNOSTIC RATING SCALE! 


KLOPFER’S RORSCHACH 
AND PHILLIPS’ CASE 


CLAUDENE SEIDEL 


George Washington University 


Clinical use of the Rorschach test has led 
to the observation that there is often an ex- 
treme discrepancy between the general noso- 
logical picture that a patient presents overtly 
and symptomatically, and the ego structure 
as revealed by the Rorschach. Some of the 
psychological implications of this observa- 
tion, according to Klopfer, are that the Ror- 
schach record usually reflects the individual's 
ego organization as it is shaped by his pre- 
morbid level of maturity as well as his ego 
defenses and character formation, and that 
the ego organization may not always be easily 
recognizable in overt symptoms (Klopfer, 
Ainsworth, Klopfer, & Holt, 1954), 

The maturity of the male patient may also 
be reflected in his history of sexual adjust- 
ment and marital responsibilities as well as 
in his relationships to Parents and peers from 
adulthood. The psychiatric pa- 
bid maturity may not be readily 
apparent from his observable behavior at the 
time of admission to the hospital. Scales have 
been developed to deal with the patient’s un- 
derlying ego organization and premorbid so- 
cial and sexual history. 

Phillips (1953 
which 


1 This paper is based on a th 
faculty of the Columbian Co 
Washington Univer. 


esis submitted to the 
lle; 


ogy, St. Elizabeths Hospit 
for his assistance in setting up 
throughout the course of the i 


vestigation, and his 
ratings on the Rorschach Prog 


nostic Rating Scale, 


46 


phrenia. This scale includes three areas of 
case history data: the premorbid history” 
(sexual and social adjustment), possible pri 
cipitating factors, and signs of the disorder 
(affect and thought). Phillips found thai 
general, case history items pertaining to a 
recent premorbid past (found in Sec. I of t i 
scale) showed the clearest relationship to S 
come with his group of patients. These a, 
are: recent sexual adjustment, social eae 
of the recent sexual life, past history of Pea 
sonal relations, and the recent adjustment 
personal relations. and 

It was thought that Phillips’ scale “tf 
Klopfer’s Rorschach Prognostic Rating S y 
(RPRS) (Klopfer, Kirkner, Wisham, j 
Baker, 1951) might be of value in etern 
ing whether there is a relationship perwa 
the level of maturity reflected in a 
phrenic patient’s case history and his 1 in 
schach record, and also might be usefu 
predicting outcome of the illness. 


Method 
Sample vi 
The sample was selected from the file of ol 
referrals (for diagnostic evaluation) to the Poy i 
ogy Branch of Saint Elizabeths Hospital, Wa 7, 
ton, D. C. (This is a mental hospital of oe cor 
patients made up of voluntary admissions and c 
mitments from all socioeconomic levels.) hessa 
teria set up for the selection of the sample we! wie 
1. Only white, male schizophrenic patients ted 
Selected because the sample on which Phillips 
his scale was comprised of males, d 
2. The patients in the sample group shoul 


hav? 
hs 


ERE on 
been given a Rorschach test within four ™ bê 


after their admission date. (Two patients ‘torion 
sample group did not strictly meet this ae nd 
one was tested within five months of admissi? 

the other within six months.) 


Relationship Between 


3. The patients should have been hospitalized on 
or before May 1, 1954,-so that an evaluation could 
be made of the outcome of their illness after three 
years (ie. on May 1, 1957). 

4. The Rorschach records used in this study were 
obtained before these patients had received tranquil- 
izing drugs. 

5. Only legal residents of the District of Columbia 
and Veterans Administration cases were included so 
that comparably complete case histories could be 
obtained for all members of the group. 

6. The age range was between 20 and 50. 

7. Only patients whose Rorschach records con- 
tained at least five responses were chosen for study. 

In order to select 100 cases it was necessary to 
choose all patients conforming to these criteria that 
were referred for testing between February 1948 and 
December 1954. 

The distribution of the sample group according to 
diagnosis (type of schizophrenia) is as follows: cata- 
tonic—33, paranoid—32, hebephrenic—5, simple—2, 
schizo-afiective—2, undifferentiated—22, and type 
undetermined—4. The mean age of the sample was 


31.9 years. 


Rating Procedure 


The case histories were rated on 
the author. Each rating was made 
f the case history that was obtained 
study interview and from rela- 
the time of admission. Later 
ospitalization, was 


Phillips’ Scale. 
Phillips’ scale by 
on that portion o 
at the psychiatric case 
tives and friends at 
data, pertaining to the period of h 
not included in the scale ratings. ; 

The ratings on premorbid social and sexual his- 
tory (Sec. I of Phillips’ scale) were used in this 
study rather than the ratings on the remainder of 
the scale, because in Phillips’ study, Sec. I showed 
the most significant relationship to outcome of ill- 
ness. Preliminary correlations with the Klopfer rat- 
ings were found to be as high or higher when rat- 
ings on Sec. I of Phillips scale were used as when 
the ratings on the total scale were used. The cor- 
relation between Phillips I and Phillips total was 
found to be .96 for the sample group used in this 


tudy. p 
à th patient was rated on five subsections of 
Sec. I and the average of these ratings was deter- 
mined. There are actually six subsections in Sec. I, 
but since two of these (social aspects of recent 
sexual life—30 years and above; social aspects of 
recent sexual life—below 30 years of age) are mutu- 
ally exclusive, each patient could be rated on only 
‘five subsections. 

Klopfer’s Scale. The ratings on K 
schach scale were made without knowledge © 
scores obtained on Phillips’ scale.* 


ethod for determining scores was fol- 
exception. Klopfer subtracts 1.0 from 
evel score when there is a dis- 
more between the lowest and 
tings, provided the lowest form 
In this study 1.0 was 


lopfer’s Ror- 
f the 


2 Klopfer’s mi 
lowed with one 
the weighted form 1 
crepancy of 3.0 oF 
highest form level ra 
level rating is a minus score. 


Two Prognostic Scales 


Outcome Criterion 


One of the stated purposes of this study was to 
learn whether Phillips’ scale and Klopfer's scale 
might be useful in predicting the outcome of illness 
of schizophrenic patients. Two criteria were set up 
to investigate this question. The first of these was 
admission to the hospital on or before May 1, 1954, 
so that an evaluation of outcome of illness could be 
made after a three-year period. Secondly, those pa- 
tients that left the hospital prior to three years had 
to receive a psychiatric evaluation of Social Re- 
covery or Recovery. It was found in checking the 
records that 7 patients did not strictly meet the first 
criterion, so they had to be excluded from the prog- 
nostic evaluation. This left 93 patients that met the 
first criterion. Thirty patients did not meet the sec- 
ond criterion because they left the hospital prior to 
three years with a psychiatric evaluation of Im- 
proved or Unimproved. These 30 patients are a 
heterogeneous group. Some of them were released 
against medical advice, others were transferred to 
another hospital, and others were discharged be- 
cause they were considered to have received maxi- 
mum hospital benefit. Since these 30 patients could 
not be analyzed properly as to outcome of illness, 
they also had to be excluded from the prognostic 
evaluation (for reference purposes, the 37 patients 
that did not meet the criteria are called the “other” 
group). This left 63 patients who met the criteria: 
31 that were hospitalized continuously for three 
years or longer, and 32 that were discharged in less 
than three years with a psychiatric evaluation of 
Social Recovery or Recovery. 


Results 


Product-moment correlations were com- 
puted for the original sample group of 100 
cases and the prognostic evaluation group of 
63 cases. Table 1 gives the correlations be- 
tween the social and sexual history ratings 
(Phillips I) and: (a) Klopfer’s total (final) 
prognostic score ratings, (b) Klopfer’s form 
level ratings, and (c) Klopfer’s determinant 
ratings. The term “determinant rating” re- 
fers to the sum of the ratings on the first five 
sections of the RPRS, i.e., everything except 
form level. 

Point biserial coefficients were computed to 
determine the correlation between the Klopfer 
ratings and the dichotomous variable of So- 
cial Recovery or Recovery in less than three 
years, as against continuous hospitalization 


subtracted only when the lowest form level rating 
was —1.5 or lower. This exception was made be- 
cause with psychotic subjects it is often difficult to 
obtain a satisfactory inquiry and therefore specifica- 
tions of responses are difficult to evaluate. 


% Table 1 
Pearson Correlations between Klopfer’s RPRS Ratings 
and Phillips’ Social and Sexual History Ratings 


= a 


Prognostic Original 
Evaluation Sample 
Group Group 
(V = 63) (N = 100) 
Between RPRS total and 
Phillips I 30 .24* 
Between RPRS form level 
and Phillips I .46** 36** 
Between RPRS determinants 
and Phillips I .19 AS 


* Significant at .05 level, 
** Significant at 01 level. 


for three years or more. Point biserial coeffi- 
cients were also computed to determine the 
Correlation between premorbid social and 
sexual adjustment and the same dichotomous 
variable. These coefficients are presented in 
Table 2. 

Chi square values were determined to learn 
whether there is a relationship between the 
Rorschach ratings and diagnosis, and between 
the social and sexual history ratings and diag- 
Nosis, and between these ratings and age. The 
Klopfer total ratings were divided into five 


` Categories were s 
fer’s Groups I, 
1951). The com 
High categories co 
III (better than 50- 
will be of some hel 
and Medium Lo 
Group IV (50- 
category to Klopfer’ 


median, 
variable of age 


and 30 
30-50). This 
point of division was chosen because Phillips? 
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scale is divided in this manner for one stl 
section of Sec. I. . 

The computed chi square for the relatio 
ship between the Klopfer total categories a 
diagnosis is 16.35, which is significant at Ù 
-05 level. The relationship between the soc 
and sexual history ratings (Phillips I) a 
diagnosis shows a chi square value of a 
which is also significant at the .05 level. 
computed chi square for the relationship ; 
tween the Klopfer total groupings and age | 
-70, which is not sufficient to be statistic 4 
significant. Neither is the relationship 2 
tween the social and sexual history ratiné 
and age statistically significant; here the 
square value is 1.00. 


Discussion 


The present study represents two differs 
approaches to the study of the schizophre j 
patient. One approach stresses the importa i 
of the past history of the individual, espen 
aspects of his premorbid sexual and socia vill 
justment. The other approach is primar 
concerned with the patient’s present ae 
tential ego strength, or adjustment leve! 
reflected by the Rorschach. rat 

The significant correlation between thea af 
ings on Sec. I of Phillips’ scale and the by 
ratings on Klopfer’s scale supports the al 
pothesis of a relationship between the Bi | 
and sexual history and the ego strengt? im 
schizophrenic patients. These findings af jne 
agreement with the related findings of mal 
Fulkerson, and Phillips (1955) with a 
subjects. These investigators found thal 
lower the social adequacy of the na 4 
the higher the degree of his maladjust 


Table 2 


s Soci 

Point Biserial Correlations between Ratings ae 7 
Recovery or Recovery in Less Than 3 Years/ 5 

uous Hospitalization for at Least 3 Year: 


io 
f f tco 
Ratings Correlations with OU 


Social and Sexual history 


(Phillips I) .46** 
Klopfer total .40** 
Klopfer form level .44** 
Klopfer determinants .34** 


** Significant at .01 level. 


Relationship Between 


In the present study, the poorer the pre- 
morbid social and sexual adjustment, the 
lower the ego strength. 

According to the results obtained in this 
study, the social and sexual history of the 
schizophrenic patient is significantly related 
to his reality ties (as reflected by RPRS form 


level ratings) but is not significantly related. 


to aspects of ego strength other than reality 
testing (reflected by determinant ratings on 
the RPRS). It seems likely that form level, 
as rated on the RPRS, reflects the more di- 
rect manifestation of a psychotic breakdown 
in ego strength, whereas the determinant rat- 
ings of the RPRS reflect, to a greater extent, 
the type of ego defenses and potential emo- 
tional resources of the individual. It is likely 
ihat the Phillips case history data reflect a 
manifest record of poor sexual and social con- 
tacts without adequate consideration of the 
underlying psychodynamics of sexual and so- 
cial disturbance which the determinant score 
would be more likely to reflect. It is therefore 
not surprising that a significant relationship 
was found between the premorbid social and 
sexual data and form level on the RPRS, but 
not between the sexual and social history and 
the Rorschach determinants. 

The results indicate that the RPRS deter- 
minant scores as well as the RPRS form level 
ratings are significantly related to the out- 
come of the schizophrenic illness. This ‘sug- 
gests that the underlying defenses and poten- 
tial emotional resources of the individual as 
well as the extent of his deficiencies in reality 
testing are important in influencing the out- 


come of the illness. 
The significant correla 
of illness suggest that ra 


sexual history, Klopfer f 
total, and Klopfer determinants would all be 
useful in predicting the outcome of illness of 
schizophrenic patients.? The present study 
therefore contributes toward the validation of 
the RPRS by supplying further data regard- 
ing its use with schizophrenic patients. This 

3 It should be pointed out that the „revision of 
Klopfer’s form level method (see fn. 2) in the pres- 
ent study probably slightly increased the correlations 
in which Klopfer form level or Klopfer total ratings 


are involved. 


tions with outcome 
tings on social and 
orm level, Klopfer 


Two Prognostic Scales 49 
study also confirms the results obtained by 
Phillips. 

Both the Klopfer total ratings and the so- 
cial and sexual history ratings show a sig- 
nificant relationship to diagnosis. The dis- 
tributions suggest that paranoid schizophren- 
ics received more favorable scores on both 
measures than did other subclassifications of 
schizophrenia. 


Summary 


One hundred male schizophrenic patients 
were rated on Phillips’ Prognostic Rating 
Scale and Klopfer’s Rorschach Prognostic 
Rating Scale and 63 could be followed up to 
determine their status as either discharged, 
recovered, or still hospitalized. 

1. Section I of Phillips’ scale was signifi- 
cantly related to RPRS ego strength (total) 
and reality testing (form level). 

2. No significant relationship was found be- 
tween Sec. I of Phillips’ scale and RPRS de- 
terminants. 

3. Section I of Phillips’ scale and RPRS 
ratings for ego strength (total), form level, 
and determinants were significantly related 
to social recovery. 

4. Section I of Phillips’ scale and RPRS 
ego strength were significantly related to diag- 
nosis (type of schizophrenic reaction). 

5. No significant relationship was found 
between the age of the patient and RPRS ego 
strength or between age and Sec. I of Phillips’ 
scale. 4 

6. Implications for predicting the outcome 
of illness of schizophrenic patients were dis- ~ 
cussed. 
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From the results of many well controlled 
investigations of the effects of ataractic drugs 
on psychiatrically hospitalized patients, there 
can be little doubt but that many of these 
drugs act as a stimulus in modifying the 
psychotic’s behavior and symptomatology. As 
Staudt and Zubin (1957) point out, however, 
the important question is not so much whether 
the therapeutic agent stimulates an immedi- 
ate change, but whether the agent has any 
significant effect on longitudinal recovery 
tates. The use of control and experimental 
groups in an adequate double blind design, 
in which the experimental (drug) group was 
found to have improved significantly in com- 
parison with the control group, does not be- 
gin to answer the important question raised 
by Staudt and Zubin. In such studies, the use 
of the control group allows the researcher to 
reach conclusions regarding only the immedi- 
ate stimulus effect of the drug under investi- 
gation. 

Perhaps the best way to evaluate the over- 
all contribution of these drugs would be to 
compare the release rates, length of hospitali- 
zation, and return rates of two identical hos- 
pitals; one committed to a therapeutic in- 
volvement with the psychotic patient without 
drugs, the other relying on drugs as the ma- 
Jor treatment method. Another possibility 
would be to compare the same hospitals’ dis- 
charge and return rates before and after the 
introduction and extensive use of drugs as a 


1The data for this study were gathered at the 
VA Hospital, Salt Lake City, Utah, under the sup- 
port of the Psychiatric Evaluation Project, Richard 
L. Jenkins, Director. 


treatment method. It is the purpose of this 
paper to present such comparative data in an 
effort to evaluate the effect of ataractic drugs 
on the over-all rehabilitation of the psychotic 
patient. 


Procedure 


The hospital under consideration is a small, yal 
staffed Hahn-type Veterans Administration Hospital; 
The Hahn-type hospital was designed so that ai 
ward operated as a small treatment unit (24 to i 
beds), each with its own staff, dining facilities, ani 
day rooms (Anderson & Fechner, 1956). Until very 
recently, cach treatment ward in this hospital was 
staffed by a treatment team consisting of a gee 
trist, psychologist, social worker, and nurse. Ony 
rarely did a treatment team have responsibility or 
more than two treatment units. The aide-patient 
ratio ranged from 1:2 for closed wards to 1:8 for 
some open wards (all 3 shifts). By the minimum 
suggested standards for NP hospital „staffing ratios 
(American Psychiatric Association), this hospital was 
considered to be very adequately staffed. : 

The two periods used to evaluate the effectivencss 
of the widespread use of ataractic drugs in this well 
staffed hospital were: July 1, 1953 to July 11, 1954 
(1 to 2 years prior to the introduction of ataractic 
drugs to the hospital); and July 1, 1956 to April 
29, 1957. A recent survey revealed that 89% of all 
psychotic patients were on ataractic medication con- 
tinuously from the time of their admission up to 
(and often beyond) the time of their discharge, and 
that 96% of all psychotics received ataractic medi- 
cation sometime during the course of their hospitali- 
zation. Thus, the major change in this hospital be- 
tween 1953-54 and 1956-57 was the introduction 
and widespread use of ataractic drugs in the treat- 
ment of the psychotic patient. 

The patients under consideration were the 200 con- 
secutive psychotic admissions between July 1, 1953 
and July 11, 1954; and the 200 consecutive psychotic 
admissions between July 1, 1956 and April 29, 1957. 
All patients were male, nongeriatric veterans who 
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Table 1 


Length of Hospitalization and Return R: 53-5 
] ition a ates of 1953- 
Psychotic Admissions and 1956-57 (drug) ee 


1953-5 56-57 
P = St 1956-57 Significance of 
- rears: Drug Difference* 
Median length of 
hospitalization of 116 days 110 day: 7 
all patients (W = 200) = aS 
Return rate to 
hospital within 1 
year of discharge 25.3% 31.3% N.S 
of all patients SPEL 
(N = 150) 
Median length of 
hospitalization of 
134 1953-54 patients 
112 days 92 days N.S. 


and 134 1956-57 
patients matched on 
marital status and 
residence 


* Median test, chi square analysis. 


were diagnosed as functionally psychotic at the time 
of admission. The admission diagnosis of functional 
psychosis was used since an admission diagnosis of 
psychotic was 86% accurate aS compared with the 
discharge diagnosis. Actually, the diagnosis | of a re- 
covered patient, who was ‘clearly psychotic at the 
time of admission, was often changed to one of non- 
psychosis upon discharge- he admission diagnosis 
thus appeared to be adequate for the purposes of the 

iability of the subdiag- 
the diagnoses of 
further broken down, although 
bout the same proportion of 
in both of the predrug and 


psychoses were not 
there appeared to be a 
affective psychotic states 
drug periods. 


The conclusions rega" i Ta 
of the ataractic drugs °” the rehabilitation o e 
psychotic patient are based on both a compa io a 
the median length of hospitalization and the retu 
rates of predrug 


patients an 


median length of hospitali 
efficiency of the hospital system b! 
the effectiveness of treatment. The retur 
a year following discharge © 
drug groups are used, therefore, to ev: 
tiveness of ataractic drugs on the ov 
tion of the psychotic patient. 

patients to be discharged and see’ for one yeat 

i a 
after discharge would have involve! ha fe e 


e: llow-u data were a O 
bel yenti 7 ho 1ft the hospital within 292 


ients W 
75% of patients ho the predig erat, and 


r-all effectiveness 


son demonstrates the 
ut not necessarily 
n rates within 


days of admission the hos- 
jents who had left the 20° 
75% of drug group a time of admission. This 


from 5% of 


pital by 252 days 
between th 


i i k 7 
40-day difference e time it too 


the predrug and 73% of the drug group to be dis- 
charged represented a difference of only four pa- 
tients. In other words, on the 252nd day from the 
date of admission, 150 1956-57 patients had been 
discharged, while 146 1953-54 patients had been dis- 
charged. Thus, the rates of return to this hospital 
are based on a total of 150 1953-54 patients and 150 
1956-57 patients. The return rates to this hospital 
were found to underestimate the return rates of psy- 
chotic patients to this and other hospitals within 
one year of discharge by 7%? A careful personal 
contact follow-up of almost 100 psychotic patients 
revealed that of the patients who were rehospitalized 
within one year after discharge, the vast majority 
returned to the discharging hospital. 

The length of hospitalization and return rates to 
the hospital may be related to population character- 
istics and not necessarily to the type of treatment re- 
ceived. On all 400 patients in this study, data with 
regard to marital status, residence, service connec- 
tion, type of admission, etc. were analyzed in rela- 
tion to length of hospital stay and return to the 
hospital within one year of discharge. Married pa- 
tients were found to leave the hospital significantly 
sooner than single patients. Out of state residents 
remained significantly longer than residents. An ex- 
amination of the 1953-54 and 1956-57 psychotic pa- 
tient groups revealed that 47% of the 1953-54 ad- 
missions were married and 32% of the 1956-57 pa- 
tients were married. On the other hand, 2% more of 
the 1953-54 group were out of state residents. Con- 
sideration of both these factors indicated that the 


2 This follow-up is part of the research pr 
ipis oced 
of the VA Psychiatric Evaluation Project. soi 
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1953-54 group was likely to remain in the hospital 
a shorter period of time, based on the difference in 
the population characteristics alone. In the later 
presentation of the median length of hospitalization 
figures, 134 1953-54 patients are matched on marital 
status and residence with 134 1956-57 patients. This 
matching allows for a more accurate estimate of the 
effect of the introduction of ataractic drugs on the 
length of hospitalization. None of those variables 
were related to the return rates. The married patient, 
for instance, was just as likely to return to the hos- 
pital within a year of discharge as the single patient. 


Results 


From data presented in Table 1, there were 
no statistically significant differences in the 
median length of hospitalization of 200 pre- 
drug and 200 drug psychotic admissions, nor 
in the return rates of the 150 1953-54 (pre- 
drug) and 150 1956-57 (drug) discharged 
patients. At any one given time an average 
of 3% (total group) to 5% (matched group) 
more drug patients had been discharged from 
the hospital compared with the number of 
predrug patients. For example, within 100 

_ days of admission, 86 1953-54 patients had 
been discharged, while 92 1956-57 patients 
had been discharged from the hospital. On 
the average of 6 to 7 days later, however, an 
equal number of 1953-54 patients had been 
discharged from the hospital. 

The direction of the differences in Table 1 
suggests that the ataractic drugs resulted in 
a more efficient hospital operation but a less 
effective treatment program.’ Since these dif- 
ferences were not significant, however, it was 
necessary to conclude that the use of ataractic 
drugs in the treatment of psychotic patients 
in a well staffed hospital has no demonstrable 
effect on the rehabilitation of the patient. 


Discussion 


The tangible within-hospital results of the 
use of ataractic drugs, however, are apparent. 
The wards are quieter, ECT has been re- 
duced from 54 patients treated in 1953-54 to 
16 patients treated in 1956-57. These changes 
have made it a more- comfortable place for 


8 Many of the 1956-57 patients were discharged 
and continued taking medication after leaving the 
hospital (based on trial visit reports). Even this ad- 
ditional factor of posthospital medication apparently 
made no difference in the over-all return rate of this 


group. 
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the staff to work, and perhaps a more desir- 
able place for patients to come. The fact re- 
mains, however, that the tendency of this 
well staffed hospital to rely so heavily on the 
ataractics has resulted in no measurable in- 
crease in over-all patient rehabilitation. On 
rounds and in staff conferences, the patient 
who is not improving is usually treated by a 
change in the dosage of his medication or by 
a change from one drug to another. 

One possible explanation of the suggested 
higher return rate for patients treated with 
drugs is that the hospital was admitting more 
chronic patients in 1956-57, and, therefore, 
could expect a higher return rate following 
discharge. In a recent study on the rehabilita- 
tion of “hard core” chronic patients who had 
been hospitalized elsewhere at least five years 
prior to transfer (Ellsworth, Mead, & Clay- 
ton, 1958), it was reported that over 50% of 
these patients, with extremely poor prognoses, 
had been discharged from the hospital. The 
return rate, to any hospital within one year 
after discharge, was 31%. Since the return 
rate to this hospital reported for admission 
patients treated in the drug era (1956-57) 
was 31.3%, it would appear that chronic pa- 
tients have at least as good a chance of re- 
maining out of the hospital for one year as 
the more acute admission patients. It must be 
remembered that the 31.3% return rate of the 
1956-57 psychotic to this hospital underesti- 
mated the actual return rate to this and other 
hospitals by 7%. In light of this evidence, 
the conclusion that there is no essential dif- 
ference in patient rehabilitation as a result of 
the introduction of ataractic drugs is main- 
tained. 3 

Why are these drugs used and relied on so 
heavily in a well staffed psychiatric hospital? 
From the data of the present study, it is ap- 
parent that patients improved before the ad- 
vent of ataractic drugs at about the same 
rate as patients treated with these drugs. The 
heavy utilization of these drugs reflects the 
staffs’ conviction of their effectiveness, which 
can perhaps best be attributed to the error 
of post hoc ergo propter hoc logic. In other 
words, the staff assumes that since many pa- 
tients improved following the administration 
of an ataractic drug, the drug must be the 
causative agent of that improvement. Actu- 
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ally, of course, the administration of the drug 
was only one of several changes in the pa- 
tient’s life following his admission to the 
hospital. 


Summary 


1. The length of psychiatric hospitalization 
and return rates of two large groups of psy- 
chotic, male, nongeriatric patients admitted 
to a well staffed hospital were compared. One 


` group (1953-54) was treated before the in- 


troduction of ataractic drugs. The other group, 
over 90% of whom were given ataractic drugs, 
was treated during the period 1956-57. 

2. Aside from such in-hospital effects as 
quieter wards and less ECT, there were no 
statistically significant differences in the me- 
dian lengths of hospitalization and the return 


rates of the two groups. 
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3. A careful examination of the differences 
in population characteristics of patients ad- 
mitted during both years failed to account for 
the similarity of release and return rates dur- 
ing the drug and nondrug era. 
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The purpose of this study is to predict the 
performance of brain damaged individuals on 
the Rorschach test by utilizing a recently 
developed theoretical approach to Rorschach 
analysis. This approach is based on the ap- 
plication of the Haskell Coaction Compass 
(1949) to personality dynamics and has 
proved fruitful in a number of studies (Clarke 
& Lodge, 1955; Lodge, 1953; Lodge & Gib- 
son, 1952; Lodge & Guyer, 1953; Lodge & 
Steenbarger, 1953). The application of Has- 
kell’s Coaction method to psychology is based 
on the observation by Lodge and Steenbarger * 
that: 


Personality theory, historically as well as recently, 
has inclined to rest largely upon assumptions of dy- 
namic interplay between two sets of intrapsychic 
power systems... . Such standard dichotomies as 
thought and feeling, reason and instinct, ego and id, 
inhibition and facilitation, socialized and primitive 
reactions, “consciousness” and “unconsciousness,” ete., 
all suggest interdependent Power systems of roughly 
comparable meaning, 


Interpretation of the Rorschach test, derived 
as it was from Rorschach’s views on Jung’s 
psychoanalytic formulations, has also tradi- 
tionally relied on the assumption of two such 
intrapsychic power systems. Lodge and Steen- 


1 An earlier version of this paper was presented at 
the Southeastern Psychological Association Meeting, 
Atlanta, Georgia, April 29, 1958. This research was 
supported, in part, by grant B-1459 from the Na- 
tional Institute of Neurological Diseases and Blind- 
ness, United States Public Health Service. The au- 
thors gratefully acknowledge the helpful suggestions 
of G. T. Lodge. 

2 From “Relations of the Personalities of Scientists, 
Normal Persons, Neurotics, and Psychotics: A Geo- 
metric Analysis,” by G. T. Lodge and C. J. Steen- 
barger to be published in Volume II of E. F, Has- 
kell (Ed.), An Introduction to Unified Science. 
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barger (1953) have identifed these variables 
as Control and Affect and provided opera- 
tional definitions for them in terms of Ror- 
schach responses. 

The determinants of any given Rorschach 
response are considered to reflect the inter- 
action of these two variables,, which are de- 
fined so that form level represents the strength 
of the control component, while color, shad- 
ing, vista, and movement determinants are 
considered expressive of affect.” Three levels 
of each are specified: weak (—), neutral (0), 
and strong (+), thus yielding nine possible 
qualitatively different relationships or types 
of coaction. , 

Relationships between these variables can 
be visually represented conveniently on a 
cartesian coordinate system called oe ater 
tion compass (Fig. 1), whose X and F axes 
represent, respectively, Control and pat 
Each response becomes a resultant vector o 
unit length, with its direction determined by 
the relative strength of the two coacting proc- 
esses. All response vectors in a given protocol 
may be geometrically combined to yield a 
single overall vector representing the deter- 
minant pattern of the entire record. This vec- 
tor may fall in any direction on the compass, 
due to the infinite number of possible inter- 
gradations between the nine coaction cate- 
gories, and its length may range between 0 
and 100% of the radius (the longer the vec- 
tor, the more consistent is a given response 
tendency and vice versa). 

i rthogonality or independ- 
sie eae venti or control and affect 
has been supported by the finding of substantially 


zero correlations between them in a PETE ei 
sample of protocols selected from previous coactio 


studies. 
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Vectors falling in the right half of the com- 
pass reflect good form level, those in the left 
half, poor form level. Vectors falling in the 
upper half of the compass reflect the presence 
of affect, those in the lower half result from 
the absence of affect determinants. Axes of 
Integration and Rigidity are defined by bi- 
secting the right angles formed by the control 
(X) and affect (Y) axes. Positive integration 
(+ +) thus refers to strong control combined 
with strong affect, while weak control together 
with weak affect (— —) defines its negative, 
ie., lack of integration. Similarly, Rigidity 
(+ —) refers to the ratio reflecting strong 
control with weak affect, while its opposite, 
Fluidity (— +), refers to weak control with 
strong affect. 

Previous studies have demonstrated the ef- 
fectiveness of coaction technique in differen- 
tiating the records of normal, superior (sci- 
entists), neurotic and psychotic individuals 
(Fig. 1) (Clarke & Lodge, 1955; Lodge, 
1953; Lodge & Gibson, 1953; Lodge & Guyer, 
1953; Lodge & Steenbarger, 1953). Briefly, 
some of the conclusions of these studies are 
(Clarke & Lodge, 1955; Lodge & Steenbarger, 
in press): (a) Psychologically healthy oa 
viduals, by their greater strength in both the 
control and affect components, can be aimee 
tiated from the psychologically ill. (b) Weak- 
ness in the affect component clearly differenti- 
ates neurotics as a group from both normal 
and superior individuals. (¢) Lines in 
the control component definitely ae: nes 
hebephrenics and catatonics from ae rod 
mals and neurotics. Most psychotics a so 2 
a deficit in level of affect. (d) There Is a oa 
tinuum of psychological health SS OF Baw’ 
to the integration axis and treatmen 


P : ccording to CO- 
inant scores â 5 : 
schach determi effectively differentiates 


action principles this c 
Se ~ a ee ae beni 
7. 104a 

The present study was initiated (OO a ty 
explore the value a, in porn aliy 
research by testing 1 
Heperi to Tie performance of agen pein 
ie. brain damaged. In light of the pre ne ly 
cited results of earlier coaction — 
known psychological effects of brain ae 
(Goldstein, 1942) and other Rorsc qin 
ies of the brain damaged (Fisher 4 


1955: Klopfer & Kelley; 1946; Peña, 1953; 


THE COACTION COMPASS 
Y 


O+ 
AFFECT AXIS 


FLUIDITY 


e SCIENTISTS 


DISINTEGRATION 


Fic. 1. Mean vector tip positions for brain dam- 
aged and control Ss, as well as original and cross- 
validation population samples of normals, neurotics, 
and psychotics (Clarke & Lodge, 1955). 


Piotrowski: 1937, 1940; Reitan, 1955; 
Schreiber & White, 1954), two interrelated 
hypotheses were developed. 

1. Because cortical damage often results in 
intellectual and perceptual impairment, a 
relative deficit in form level is predicted, with 
a consequent low’ score on the control or X 
axis for a brain damaged (BD) group. 

This postulated deficit in form level or, at 
a behavioral level, control, is further assumed 
to result in a disturbed balance between forces 
of control and affect in the personality of the 
BD individual. Behaviorally, this disturbance 
might be expected to take the form of rela- 
tively consistent overcontrol, extreme lability 
or fluctuations between rigid control and un- 
controlled emotional outbursts. In the analo- 
gous Rorschach situation, such problems of 
behavioral integration would result in unsuc- 
cessful combination of control and affect de- 
terminants, i.e., minus form level when form 
is used in combination with other determi- 
nants (e.g., CF—, FC—), and the use of only 
one kind of determinant, such as F alone or 
pure C. These considerations lead to the sec- 
ond hypothesis. 
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TABLE 1 


AGE, SEX, AND EDUCATIONAL CHARACTERISTICS OF CoxNTROL anp BD SUBJECTS 


Age Sex Education 
Subjects N xX Range M F bd Range 
Controls 20 34.0 10 10 10.9 3-18 
Brain damaged 20 31.6 10 10 9.9 5-14 


2. BD individuals will have a shorter pro- 
jection on the axis of integration (X’). 

There would seem to be little psychological 
basis for a differentia] prediction concerning 
the expression of affect determinants in a BD 
group other than to anticipate considerable 
variability reflecting the reaction of different 
personalities to various kinds of damage. 


METHOD 


in differential diagnosis with 
damage as one possibility. As Piotrowski (1940) has 


group would appear 
to be the most relevant, From a pool of 80 patients 


» 20 patients with unequivocal brain 
damage were matched for age, 


nosis ruled out brain damage (T: 
for inclusion in the BD 


syndrome following a variety of CNS disease and 
rimarily, tumor removal, CVA, 


members of the Control group, thus determined, con- 
sisted largely of neurotic patients with considerable 
depression, anxiety, and psychosomatic or hysterical 
symptomatology. Five of the Control group patients 
were diagnosed schizophrenic. All information rele- 


vant to the selection of subjects (Ss) was obtained 
from their medical charts rather than their psycho- 
logical test records. 

As scen in Table 1, the age range was 15 to 55 for 
both groups. The mean age of the BD group was 
31.6 as compared with 34.0 for the Control group, 
a nonsignificant difference which, in any event, would 
operate against a prediction of greater deficit in the 
BD group. The difference between the mean levels of 
education, 9.9 for the BD group and slightly higher 
(10.9) for the Control group, was also insignificant. 

All of these Ss had scorable Rorschach records in 
their psychological file. No restrictions were placed 
on the number of responses or rejections since this 
was considered to reflect realistically the clinical 
situation with respect to the types of cases under in- 
vestigation. Although previous coaction studies have 
utilized Beck’s scoring, Phillips Revised Develop- 
mental Scoring System (Phillips, Kaden, & Waldman, 
1957) was used here. Phillips’ system is considered 
particularly suited for the analysis of organic rec- 
ords because it gives credit for the Positive elements 
in a percept whose over-all organization is inade- 
quate. In adapting Phillips’ definitions of adequacy 
and specificity of form to coaction levels of +, 0, 
and —, the only significant departure from previous 
studies was to equate responses scored “vague Jin 
Phillips’ system with O form level on the coaction 
compass. Since some previous studies have empha- 
sized the prevalence of vague W responses in proto- 
cols of organics, it seemed this might help differenti- 
ate the records (Pefia, 1953; Schreiber & White, 
1954). It should also be noted that Phillips’ system 
tends to give credit for a greater number of determi- 
nants as a result of the conditions under which a re- 
sponse is broken down into separately scored pee 
cepts. In scoring + and — form level, Phillips’ re- 
vised rules were supplemented by the tables in Phillips 
and Smith (1953). i 

The records were rescored in the Phillips system 
by the senior author, using the original scoring by 
the examiners of these patients as a guide for deter- 
minants and form level ratings. The coaction com- 
pass vector for each record was then computed ac- 
cording to the method outlined by Lodge and Steen- 
barger (1952). To guard against possible bias due to 
knowledge of the composition of the groups as well 
as to establish the reliability of the scoring system, 
the 40 records of both the BD and Control groups 
were again scored by a research technician familiar 
with the Phillips system but unfamiliar with the in- 


-(1937). The BD group average! 
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tent of the study and the nature of the population.* 
The reliability scoring of the variables which affect 
coaction analysis revealed 92% agreement per proto- 
col regarding the definition of responses, 997% agree- 
ment concerning the presence of determinants other 
than form, and 88% agreement on the scoring of 
form level. This degree of agreement was essentially 
the same for both groups and indicates satisfactory 
reliability of scoring. 
As a further estimate 
the Ss, the records were r; 


of the representativeness of 
ated for Piotrowski’s signs 
d 3.9 Piotrowski signs 
ds scoring beyond the recom- 
osing organicity of five or 
acknowledge the assist- 
ho carried out the reli- 


per record, with 9 recor 
mended cutoff for diagn 


+The authors gratefully 
ance of Shirley Crooke, w 
ability study. 
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more signs. The Control group averaged 1.95 signs 
per record, with none showing more than four signs. 
This detection of 9 out of 20 “true” positives and 
no “false” positives is similar to Fisher and Gonda’s 
findings (1955). 


RESULTS 


Figure 1 illustrates the resultant mean vec- 
tor positions with respect to the X and Y axes 
for both the BD and Control groups. For the 
Control group, the mean projection on the X 
axis was 35.84 (percentage of radius). This 
is similar to projections previously reported 
for neurotic groups of 32.59 and 48.56 (see 
Table 2). However, this group showed consid- 


TABLE 2 


SUMMARY OF THE MEANS, SIGMAS, i 
AND AFFECT, INTEGRATION AND 
PRESENT STUDY AS WELL 


AND SIGMAS OF THE MEANS OF PROJECTIONS UPON THE AXES oF CONTROL 
RIGIDITY, FoR BRAIN DAMAGED AND CONTROL GROUPS OF THE 
AS ORIGINAL AND Cross-VALIDATION SAMPLES* 


in these tables are percentages of the radius of the coaction compass.) 


(The units used 


z T Control Affect Integration Rigidity 
8.05 — 6.14 1.54 — 10.00 
Brain damaged ea Die 21.28 36.14 27.94 31.43 
(N = 20) Sigma Mn. 4.88 8.29 6.41 7.21 
35.84 6.65 29.14 —21 51 
Control group i Dis. 4.12 20.28 18.92 15.89 
(N = 20) Sigma Mn. 94 4.65 4.34 3.64 
Original data m ze A: 

i Mean 2.54 —25.21 16.10 19.66 
Psychotics Sigma Dis- 26.03 28.54 24.27 30.05 
(N = 25) sipia Ma. 5.31 5.82 4.95 6.13 
32.59 —20.81 8.33 36.61 
Neurotics M a DIS, 10.68 16.07 12.25 15.69 
W = 27) Stent Mn. 2.10 3.15 2.40 3.08 
42.65 15.84 41.35 18.95 
Scientists Moa Dis: 5.73 20.03 14.98 14.50 
(N = 19) Sema Mn. 1.39 4.86 3.64 3.52 
Cross-validation data ae 9.22 —13.88 — 3.29 16.33 

i SER 5 43 22.03 21 
Psychotics ah Dis. 15.17 26. 05 
Uy = 30) os Mn. 2.77 4.83 4.02 asi 
48.56 —18.95 20.93 47.15 

A Mean 22.82 17 s 
Neurotics > > 12.63 “ AL 19.44 
‘Gy = 30) aoe 7 2.31 4.17 3.18 3155 
46.39 — 2.83 30.80 34.82 

Mean 19.86 5 
Normals a igs 13.16 E 18.51 15.01 
YN = 30) pe ate 2.40 3.63 3.38 2.74 


a Clarke and Lodge, 1955. 
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erably less variation than the previous groups 
with a standard deviation of only 4.12. This 
suggests that as regards form level and its 
suggested analogue of control, these Control 
Ss (who were chosen on the basis of a cer- 
tain behavioral similarity, namely, organiclike 
symptoms) are a relatively homogeneous 
population in terms of this personality factor. 

The mean projection on the X axis for the 
BD group was 8.05 with a sigma of 21.28. 
Twelve BD Ss scored below the lowest Con- 
trol S’s score of 12.5, and only one BD S 
scored above the Control group mean. The 
Mann-Whitney U test was utilized to test the 
hypothesis that the two groups represented 
different populations with respect to this di- 
mension. The results supported the initial pre- 
diction of a lower score on the Control axis 
for the BD group with p < .001. In compari- 
son with results of earlier studies, the BD are 
most similar to psychotics (functional) on 
this dimension (Table 2). 

The second prediction, that the BD group 
would show a shorter projection on the Inte- 
gration axis, was also supported with a p< 
001 according to the results of the Mann- 
Whitney U test. Table 2 indicates a mean of 
1.54 for the BD and 29.14 for the Controls in 
this axis. The distribution was such that only 
five Control Ss scored below the median pro- 
Jection of 19.5 on the axis of integration, while 
only five BD Ss scored above this median. 

With regard to a breakdown of the response 
types that contributed to these two results, it 
1s interesting to note that there were no sig- 
nificant differences between the groups in the 
number of vague responses. The prevalence 
of minus form level in the BD records, 
both in responses determined by form alone 
and combined with other determinants, con- 
tributed most to the present findings. It 
should be noted, however, that the final pro- 
Jection on the Control axis represents a dif- 
ferent kind of information than that given by 
the F+% conventionally used in Rorschach 
scoring, since every response is weighted either 
+, —, or 0, for both form and affect. Of the 
BD records, 70% contained at least one W 
minus response, whereas this sign was present 
in only 20% of Control group records, a dif- 
ference significant beyond the .01 level. Inci- 
dentally, the total number of responses given 


by both BD and Control Ss was unexpectedly 
similar, with both groups averaging 14.0 re- 
sponses per protocol. 

With respect to affect determinants, the 
only difference between the groups was the 
anticipated greater variability among the BD 
scores, which was significant at the .02 level 
of confidence according to an F test applied 
to the difference between the variances. Previ- 
ously tested psychotic groups also showed 
wide variability on the affect dimension. The 
mean projection on the Y axis was +6.65 for 
the Control group and —6.14 for the BD. The 
tendency of the Phillips system to recognize 
a greater number of determinants may con- 
tribute to the higher score of this Control 
group in comparison with previous norms for 
neurotic groups (Table 2). 


DISCUSSION 


The finding of significantly lower scores on 
the Control or X axis for the BD group con- 
firms the first hypothesis and suggests that 
cortical damage is associated with relatively 
inadequate form perception. This kind of im- 
pairment may be interpreted as symptomatic 
of a more general deficit in ego strength 
(Phillips & Smith, 1953) or in coaction terms, 
control, and related to inadequacies i re- 
ality testing. 

The second hypothesis concerned the Inte- 
gration of what has been defined as control 
with factors considered to reflect emotional 
responsivity. Its confirmation by a signifi- 
cantly shorter projection on the Integration 
axis for the BD group is largely due to the 
poor form level. This leads to the generaliza- 
tion that weakened control at a behavioral 
level is the primary factor leading to a dis- 
turbed balance of control and affect compo- 
nents of personality and their resultant inade- 
quate integration. The wide range of scores 
along the affect dimension for the BD group 
suggests that this leads to a variety of emo- 
tional adaptations. On the whole, the data 
obtained indicate that the BD group is more 
similar to psychotic groups than to either 
normal or neurotic populations. This similar- 
ity is interesting in light of the fact that pa- 
tients with functional psychoses frequently 
manifest organiclike symptoms. 

In view of the fact that coaction studies 
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thus far have focused largely on comparing 
the Rorschach performance of different diag- 
nostic groups, further implications of these 
findings would require research correlated 
with more specific behavioral assessment. For 
example, a promising area for future research 
suggested, by the results of the present study 
might involve investigation of the correspond- 
ence between scores on the integration-dis- 
integration and fluidity-rigidity continua 
with actual behavioral manifestations of in- 
adequate integration of control and affect in 
brain damaged individuals. 

At present, coaction methodology represents 
atization of Rorschach de- 
terminant scoring. This leads to the general 
question of why, in the absence of clearly 
specified behavioral correlates related to con- 
trol and affect, have coaction studies con- 
sistently yielded results which significantly 
differentiate among healthy groups as well as 
among various categories of psychopathology. 
One hypothesis is that coaction analy sis rep- 
resents a geometric encoding of certain 
plicit inferential processes involved in ne 
diagnostic behavior of clinicians. Lodge an 
Steenbarger (see fn. 2) have implied this in 
their suggestion of the fundamental equiva- 
lence of major personality theory concepts 
which tend to rest on the basic assumption 
of a two-variable coacting intrapsychic Rower 
system. It may be that Kraepelinian anc Bey 
choanalytic models, despite differences in ter- 
minology, tend implicitly to utilize a comman 
logic to analyze, differentially weig t, an 
finally integrate information into og 
statements. Disregarding, for the moment, t | 
problem of the validity of commonly use: 
categories (for example, the distinction be- 
tween neurotic and psychotic), there is the 
possibility that the formal geometric system 
imposed upon Rorschach data by coaction 
scoring represents a significant —— 
to the principal common elements in the logic 

i Sis. oo 
i 7 oe is, then, some theoretical justifica- 
tion for exploring possible interactions = 
tween two such variables, for which contro 


and affect are convenient approximations, the 


general framework of a cartesian coordinate 
system permits representation of the possible 


interactions between them according to a con- 


primarily a system 


sistent logical system. As a geometric model, 
the fwo-dimensional coordinate system has the 
advantage over traditional verbal conceptual 
frameworks of avoiding the possible imposi- 
tion of unidimensionality upon the interac- 
tions of a two-variable system. It also con- 
veniently permits visual comprehension of 
these numerous relationships, simultaneously. 
At the same time, of course, it involves a net- 
work of geometric conventions. These, how- 
ever, lead to explicit hypotheses which are 
subject to test. The relation of this geometry 
to the problem of diagnosis, its language sys- 
tem and empirical referents, then, are areas 
of research suggested by the results of this 
and other coaction studies. In view of the 
present lack of unification in the language 
system of psychopathology, especially in its 
relationship to concepts of mental health, it 
would seem crucial to clarify whatever under- 
lying logical framework may exist. 


SUMMARY 


Coaction analysis of Rorschach divides de- 
terminants into two variables: Control (form 
level) and Affect (color, shading, and move- 
ment). Interactions between these determi- 
nants are geometrically plotted to yield a re- 
sultant vector on a cartesian coordinate sys- 
tem. On the basis of symptoms described in 
the brain damaged, it is predicted that they 
will manifest a breakdown of form level in 
combination with other determinants to re- 
sult in (a) a lower score on the Control axis 
and (b) a shorter projection on the Integra- 
tion axis. 

From a pool of 80 patients referred for psy- 
chological testing with questionable organic 
brain disease, 20 patients with unequivocal 
brain damage were matched on age, sex, 
color, and education with 20 patients whose 
ultimate medical diagnosis ruled out brain 
damage. The Rorschach records of these pa- 
tients were scored according to the Phillips 
Revised Rorschach Developmental Scoring 
System and then plotted on the coaction 
compass. 

The results confirm the predictions and in- 
dicate that the BD population is similar to 
previously tested psychotic groups both in 
terms of weakened control, poor integration 
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of control and affect, and considerable vari- 
ability along the affect dimension. 
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INTRA-Q DECK RELATIONSHIPS AS INFLUENCES 
AND REALITIES IN PERSONALITY ASSESSMENT * 


PETER F. BRIGGS ax» ROBERT D. WIRT 


means of convergence between the ambiguous 
and the known. In the analysis of data drawn 
from Q sorts it has been found that stereo- 
types such as sick-well (Kogan, Quinn, Ax, & 
Ripley, 1956) and social desirability—undesir- 
ability (Edwards, 1957) can be identified. 
The suggestion is then made that the effects 
of these stereotypes be controlled either by 
item selection or by correlational techniques. 
/ These recommendations rest upon work aimed 
at identifying stereotypes that parallel a 
good—bad dimension. Much has been made of 
the ease with which personality descriptive 
items can be ordered on such dimensions. The 
actual validity of such descriptions has not 
$ been examined extensively. 

The Q sort technique produces an amal- 
gamation of sorter, subject, and Q deck char- 
acteristics which are difficult to refract. The 
group or case descriptions resulting from these 
multiple influences are usually considered pri- 

i mary data; that is, they are rarely checked 


4 against other descriptions of the same cases. 
| This report includes an examination of Q 


Statistical controls are thought to provide a 
| 


deck stereotype descriptions as they relate to 
(a) known characteristics of samples, (b) 
each other, (c) item variance, and (d) reli- 

* able group differences as reflected by the 
items in the deck. 


PROCEDURE 


Four groups were examined. Each had 
70 cases who were selected because of 
ta Multiphasic Personality Inventory 
d their delinquency status. Two 


The cases. 
approximately 

' their Minneso 
N (MMPI) patterns an 
i supported by research grants 
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samples had MMPI codes that contained a peak on 
D, or Mf, or Si, the scales which other research has 
shown to be associated with a low rate of delin- 
quency (Hathaway & Monachesi, 1953; Wirt & 
Briggs, 1959). One of these groups had no record 
of delinquency, and one was composed ‘of delin- 
quents. Another pair of delinquent and nondelin- 
quent samples had MMPI high point codes con- 
taining a two-scale combination of Pd, or Sc, or Ma. 
These are scale combinations which have been shown 
to be associated with delinquency proneness (Hath- 
away & Monachesi, 1953; Wirt & Briggs, 1959). The 
two obvious characteristics of the groups were the 
delinquency versus nondelinquency dichotomy, and 
the personality test differences, which suggest a 
neurotic versus character disorder dichotomy. The 
groups also differed greatly in achievement (school 
and job success) as well as other forms of construc- 
tive community activity. The neurotic nondelinquent 
sample was the most successful group on these cri- 
teria of success. The two delinquent samples were in- 
distinguishable in achievement and clearly below 
both nondelinquent samples. 

Tlie rating of each sample as well as the collec- 
tion of additional information has been reported 
elsewhere (Wirt & Briggs, 1959). Cases from each 
sample were scen for intensive interviews. On the 
basis of these, the interviewers sorted a Q deck for 
each S. 

The Q deck. The deck used in this study was de- 
veloped by Block (1954). It contained 115 items to 
be distributed over a forced nine-point distribution. 
Two aspects of item variability were examined. These 
were the size of the variance when an item was used 
as descriptive of a group, and the relation of items 
showing a significant difference among group means. 

The interviewers. The interviewers were five gradu- 
ate students in psychology or social work, all of 
whom had had at least three years of clinical ex- 
perience.? 

Stereotype sorts. Following the final interview and 
Q sort, the interviewers were instructed as follows: 


Using your experience with the deck for this sam- 
ple of cases, sort the Q deck for the five concepts 
listed below: (a) social desirability of the traits in 


2 The interviewers in this investigation were Doug- 
les Freed, Arthur Gallese, Ronald Johnson, A. M. 
Marchionne, and Jay Willet. 
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TABLE 1 


MEAN INTERCORRELATIONS FOR EACH STEREOTYPE 
AMONG Frve Sorters 


Stereotype N Meanr 
Delinquent 3 ed 
Nondelinquent 3 “on™ 
Neurotic 3 Gat 
Social desirability 5 aor 
Item difficulty 5 .41** 


"PZA 


the deck; (b) difficulty of sorting cach item in the 
deck; (c) impression you formed of the typical 
delinquent; (d) impression you formed of the 
typical nondelinquent; (e) impression you formed 
of the typical neurotic. 


RESULTS 


The mean Pearsonian correlation coefficients 
among sorters for each of the five sorts were 
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calculated using Fisher’s z transformation. 
They are given in Table 1. The concept of de- 
linquency was poorly agreed upon (r = .32). 
For this reason it was dropped from subse- 
quent analyses even though there was some 
reliable covariance. The agreement on diffi- 
culty in sorting was not encouragingly high. 
The remaining values appeared to be re- 
spectable interjudge correlations. 

Mean item ratings for the four samples and 
the stereotypes were intercorrelated. These re- 
sults are given in Table 2. The matrix of in- 
tercorrelations in Table 2 has three subdi- 
visions: the interstereotype correlations, the 
intersample correlations, and correlations be- 
tween the stereotype sorts versus the mean 
sample sorts. 

The four samples were compared with each 
other in their relationship to the three stereo- 


TABLE 2 


CORRELATION COEFFICIE 


Neurotic 


Character Disordered 


Stereotype 


Non- 
delin- 
quent 


Non- 
delin- 
quent 


Sample or 
Stereotype 


Delin- 
quent 


Non- 
delin- 
quent 


Item 
Diff- 
culty 


Social 
Desira- 
bility 


Delin- 
Neurotic 


Neurotic 
nondelinquent - 
sample 


Pead .72** 
Neurotic 
delinquent =n 


.82** 
sample 


Character 
disordered 
nondelinquent 
sample 


Character 
disordered 
delinquent = 


quent 


4i** 67** 21** p .06 


.83** zos —.08 gp .09 


da .84** —.28** P AY lola 18 


.61** —.32** aoe ll 


sample 


Nondelinquent 
stereotype = 


Neurotic 
stereotype 


Social 
desirability 
stereotype 


—.34** .25** 
-= F 


me 33% 


Tix 


| 
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TABLE 3 


SIGNIFICANT SIMILARITIES BETWEEN STEREOTYPES AND SAMPLES 


Neurotic nondelinquent (I) vs. 
neurotic delinquent (II) 


Neurotic nondelinquent (I) vs. 

Neurotic delinquent (I) vs. 5 
character disordered delinquent (IV) 

Neurotic delinquent (II) vs. 


Neurotic delinquent (II) vs. 
character-disordered delinquent (IV) 


Stereotype 
Nondelinquent Neurotic Social Desirability 
Samples Compared Like Sample Like Sample Like Sample 

pa II** = 

character-disordered nondelinquent (III) La I** lr** 
= r = 

character-disordered nondelinquent (IIT) up. m pe 
Iv** Tres —— 

Character-disordered nondelinquent (III) vs. fi S sia 


character-disordered delinquent (IV) 


*P <01. 

+*+ P <.001. 
types having the highest interrater agreement: 
nondelinquency, neuroticism, and social desir- 
ability. These results are shown in Table 3. 
The validity of these stereotypes was then as- 
sessed in terms of known characteristics of 
the samples. 

Nondelinquent stereotype and known non- 
delinquency. The samples of nondelinquents 
were accurately described as less nondelin- 
quent than the delinquent samples with simi- 
lar personality test patterns. The nondelin- 
quent neurotic sample appeared about the 
same as the character disordered delinquent 
sample. In this case the stereotype did not 
distinguish the known group behaviors. How- 
ever, on the other cross-personality pattern 
comparison a significant difference was found, 
showing the nondelinquent character disor- 
dered group to be similar to the nondelinquent 
stereotype and the delinquent neurotic code 
group correctly was not described as nonde- 
linquent. The surprising thing in these find- 
ings is the degree to which the character-dis- 
ordered samples appeared more nondelinquent 
than the neurotic’ samples. Other data show 
that, in fact, the character-disordered sam- 
ples consisted of individuals with a lower re- 


gard for the mores than most individuals with 
test findings indicating neurotic disturbance. 
The nondelinquent stereotype apparently was 
complicated by other influences. 

Neurotic stereotype and personality test 
findings. The results for the neurotic stereo- 
type group comparisons also appear in Table 3. 
Here the neurotic and the character disor- 
dered groups were distinguished on all occa- 
sions. It is interesting to note that the delin- 
quent neurotic sample was described as more 
neurotic than the nondelinquent sample with 
similar MMPI profiles. This suggests a sur- 
plus meaning in the stereotype. It should be 
noted that, in fact, the delinquent samples 
were actually more psychopathic on the 
MMPI and in actual behavior than their 
nondelinquent test pairs (Wirt & Briggs, 
1959). 

Social desirability stereotype and other cri- 
teria of social desirability. As indicated previ- 
ously on the criteria of social desirability, 
school and job achievement, and constructive 
community enterprise, the neurotic nondelin- 
quent sample was superior to all other sam- 
ples. According to the stereotype developed 
by the raters, however, it was the character- 
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TABLE 4 


THE NUMBER OF [TEMS JUDGED CHARACTERISTIC or THE 


Sauenes aun Stesroryers Wiren Over- 
LAP THE EXTREMES OF SOCIAL 
DESIRABILITY 


Items 
Socially Socially 
Desirable Undesirable 
(N = 19) (N = 20) 
Sample or 
Stereotype Like Unlike Like Unlike 
Neurotic 
nondelinquent 6 0 0 11 
sample 
Neurotic 
delinquent 9 0 0 6 
sample 
Character 
disordered 
nondelinquent 10 0 0 11 
sample 
Character 
disordered 
delinquent 9 0 0 7 
sample 
Nondelinquent 
stereotype 12 0 0 13 
Neurotic 
stereotype 0 5 4 4 
Item difficulty 
stereotype 0 4 3 4 


disordered nondelinquent sample that seemed 
most desirable. Further, known delinquency 
versus nondelinquency was not reflected in 
social desirability as sorted by the raters, al- 
though certainly delinquency is less socially 
desirable than nondelinquency. The relation- 
ship between the nondelinquent and social de- 
sirability stereotypes was the highest in the 
matrix (r = .86). 

The MMPI and delinquency status were 
effective variables, as were the nondelinquent 
and the social desirability stereotypes. All of 
these correlated with the other samples and 
stereotypes in ways which tend to reflect the 
known group characteristics, 

The high correlations associated with social 
desirability as an a priori stereotype accounts 
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for a quite uniform amount of vatiañce in ali 
samples except the sample of nondelinquent 
character disorder individuals. This sample 
approached the stereotype of “good” person- 
ality to a striking degree, It is important to 
note that this finding is not paralleled in 
other, less subjective, data where it was found 
that these individuals would not be consid- 
ered socially desirable by conventional stand- 
ards (they are unachieving, promiscuous, un- 
reliable, etc.). 

Removal of the variance attributable to so- 
cial desirability would not appreciably alter 
the relationships in the sample matrix. An 
illustration of the overlap among all of the 
distributions underlying the matrix and the 
social desirability stereotype is provided in 
Table 4. Here the top and bottom 20 items 
in each sort were examined for overlap with 
the top 19 and bottom 20 of the social de- 
sirability items. It will be noted that in the 
nondelinquent stereotype and in the clinical 
groups there were no misplaced items. The 
correlations shown in Table 2 are, of course, 
a function of the frequency of these over- 
lapping items. However, the concreteness of 
the examples in Table 4 provides a convincing 
illustration of the strength of the correlation. 

The content of items which constituted the 
more desirable traits were intellectuality, emo- 
tionality, independence, and perspective. The 
undesirable traits seemed much more compli- 
cated and contained longer, more involved de- 
scriptions. The traits rated undesirable were 
highly interpretive and referred to intrapsy- 
chic phenomena; e.g., hostility, absence of 
warmth, and poor ego development. The ends 
of the distribution for the nondelinquent 
stereotype sorts were much the same as those 
for the social desirability items. 

The items from the extremes of the neurotic 
stereotype described insecurity and preoccu- 
pation with neurotic problems at the charac- 
teristic end, while at the nonneurotic extreme 
the items reflected an absence of concern with 
such thoughts and a happy aggressive mastery 
of situations. Again, the content of the nega- 
tive (more neurotic) items was more compli- 
cated and the descriptions were longer than 
the items sorted as more desirable. 

Content of items in relation to item difi- 
culty. An examination of the traits that were 
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recatded aa difficult or eaey là sort SBBWE 
the more difficult items seemed to refer to 
quite obscure facts, usually relating to covert 
data or complex interpersonal relations. The 
items rated as easily sorted, by contrast, re- 
ferred to more observable ways of acting or 
simply to expressed verbal opinions. The fre- 
quencies of overlap between the extremes of 
this distribution and the social desirability 
dimension extremes were presented in Table 4. 
The correlation between these two mean sorts 
was low (r = .33) but significant (P < .01). 
The small number of overlapping items shown 
in Table 4 reflects the weak relationship. 
Item variance and mean differences. A final 
pair of measures of item performance studied 
were the item variances within the samples 
and the occurrence of significant mean differ- 
ences among the sample means. From the 115 
items, 24 with low variance „were selected 
(items with two sample variances out of 
four below 3.00). Seventeen high variance 
items were selected (items with two vari- 
ances greater than 5.50). The proportion of 
discriminating items in these two sets was 
almost identical. There were six items mak- 
ing 10 discriminations in the low ‘set and five 
items making 9 discriminations in the large 
variance set. The items for the extremes of 
social desirability were not evenly represented 
among the items at the extremes of variabil- 
ity. There were four such items in the 24- 
item set with low item variability and 10 in 
the 17-item set with high variability (P < 
.01). Of these latter 10 items, five were from 
each end of the social desirability continuum. 
Tt was clear that the items with large vari- 
ances were more often from the extreme of 
social desirability than were items with small 
variance. As can be seen in Table 4, the two 
extremes of the social desirability stereotype 
behaved consistently (although the items 
were different). ; s 
The ease-difficulty of sorting was examined 
for the items in the two extremes of variation. 
There were 6 items of high variance and 11 
with low variance. The easy to sort items 
were all high variance items (5 high and 0 
low), while the hard to sort items were almost 
all low variance items (1 high to 11 low; 
P < .01). Larger variances were associated 
with easier sorting (and with social desir- 


$i 
Abiity). items ih ie eens gi saring 
ease were, however, almost equally discrimi- 
nating in group comparisons. The number of 
items and discriminations made by the items 
in each group mean comparison for the ex- 
treme items showed no differences. Neither 
the nondelinquent nor the neurotic stereotype 
extreme items behaved differently with respect 
to variance size, However, the high variance 
items were more frequent in the nondelinquent 
stereotype extremes than were the low vari- 
ance items. This would be expected from the 
relationship between the nondelinquent and 
social desirability stereotypes. Neither set of 
extreme items was differently involved in 
group differences. 


DISCUSSION 


These data support the argument that 
stereotypes can be formed from Q deck items. 
This evidence does not, however, substantiate 
the idea that the stereotypes derived from a 
priori sorts are especially meaningful. This 
is particularly true of the social desirability 
concept. In the forced sort the content avail- 
able in the items as subtopics determines the 
form a stereotype may take. 

With this particular set of items a type of 
glib, almost psychopathic caricature emerges 
as the stereotype of the socially desirable per- 
sonality. It is quite conceivable that with 
other item sets quite a different stereotype 
might have emerged. To some degree, the 
items in this stereotype showed a high over- 
lap with all the sample descriptions. It is clear 
that the desirability continuum describes the 
nature of any group to the extent that no 
items from the undesirable extreme are re- 
versed in group description. Thus the ex- 
tremes of such a stereotype are apparently 
forced by the sorters’ knowledge of people. 
This is certainly not the typical meaning of 
a stereotype in psychological research. It more 
closely parallels the meaning of a cliché. This 
is not to imply that the items at the extremes 
of the stereotypes do not discriminate be- 
tween cases, for they manifest more than av- 
erage variability in the present study. 

Among the items in the deck some connec- 
tion between sorting ease and the stereotype 
extremes appeared clearly. Here it is sug- 
gested that stereotyping hinges upon base- 
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belief relationships. The easily sorted items 
were useful in stereotyping while more ob- 
scure factors simply did not enter. Similarly 
the easier to sort items were more variable. 

As to the intradeck relationships and the 
items describing groups, there were no obvious 
limitations or correlations. The distinguishing 
characteristics of the groups seemed to emerge 
from all types of items. 

A final point is in order. Edwards (1957) 
and others have proposed that items reflect- 
ing stereotypes be removed or controlled. This 
is a difficult procedure, since to identify the 
particular influences to be removed requires 
a very extensive analysis of the deck. But 
more importantly, it appears that valuable 
items will certainly be deleted under such a 
system of selection. 


SUMMARY 


Q sort descriptions of several samples and 
stereotypes were made by five judges follow- 
ing intensive interviews. These descriptions 
were compared with known characteristics of 
the Ss. The mean correlation among sorters 
for each of the five sorts of delinquency, non- 
delinquency, neuroticism, social desirability, 
and item difficulty was significant. The sorters 
accurately described the nondelinquents and 
the neurotic samples. They described the de- 
linquents as more neurotic than the nonde- 
linquents and the character disorder samples 
as more nondelinquent than the neurotic sam- 
ples. In fact other data showed the character 
disorder samples to. be more psychopathic on 
the MMPI and in actual behavior than other 
groups. The raters described the character- 
disordered nondelinquents as the most socially 
desirable, while in fact this group was known 


to have a generally poor regard for social con- 
ventions. Known delinquency versus nonde- 
linquency was not reflected in social desir- 
ability as sorted by the raters. 

Items which were difficult to sort included 
content requiring interpretation of covert be- 
havior, while items rated as easily sorted re- 
ferred to more observable ways of acting. 
Items with large variance within samples were 
more often from the extremes of social desir- 
ability than were items with small variance. 

The data were interpreted to show that 
while the Q sort technique may be useful in 
developing definitions for stereotypes, the re- 
sulting concepts must be interpreted with 
caution. This seemed especially true of the 
social desirability stereotype. It was suggested 
that the removal or statistical control of items 
reflecting stereotypes requires extensive analy- 
sis and can result in deletion of valuable in- 
formation. 
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THE LONG TERM STABILITY OF SELECTED 
RORSCHACH RESPONSES * 


JEROME KAGAN 


Fels Research Institute, Yellow Springs, Ohio 


Of the various instruments employed to 
elicit fantasy responses, the Rorschach has 
been both the most frequently used technique 
and the one subjected to the most empirical 
research. However, despite many years of Te- 
search effort, a large body of social scientists 
still question Rorschach’s basic hypotheses 
and the reliability and validity of Rorschach 
derived variables. The purpose of this paper 
is to demonstrate that selected Rorschach 
variables show low, but statistically signifi- 
cant, degrees of intra-individual stability over 
periods of three and six years during adoles- 
cence and early adulthood. Since important 
personality changes often occur during these 
years one would not expect extremely high re- 


` liability coefficients over such a long period 


of time. Thus, absence of stability for a spe- 
cific variable would not mean that the vari- 
able might not be reliable for shorter test- 
retest intervals. However, demonstration of 
a statistically significant degree of stability, 
albeit low to moderate, would suggest that the 
variable was reflecting an enduring aspect of 
human functioning and deserved more inten- 
See some indication that determinant 
variables show low to moderate degrees of 
test-retest reliability when the interval be- 
tween tests is less than a week (Kaplan & 
Berger, 1956; Epstein, Nelson, & Tanofsky, 
1957). In the Epstein et al. (1957) research 
16 college students were presented with a dif- 
ferent series of specially designed ink blots 
twice weekly for five weeks. Kaplan and 
Berger administered the standard Rorschach 
stimuli four times to 28 subjects with an in- 
as supported, in part, by a re- 
60) from the National Institute 
ed States Public Health 


1 This research W 
search grant (M-12 f 
of Mental Health, Unit 


Service. 


terval of four days between each administra- 
tion. In both studies movement responses 
showed a moderate degree of reliability with 
correlations in the forties and fifties. How- 
ever, except for a longitudinal study of the 
consistency with which the 10 standard blots 
elicit specific responses (Ranzoni, Grant, & 
Ives, 1950), there is no evidence on the long 
term intra-individual stability of content or 
determinant variables. 

Historically, initial interest in Rorschach 
responses was centered on the formal, percep- 
tual dimensions of location and determinant. 
However, during the last decade there has 
been a rapidly growing interest in the moti- 
vational content of Rorschach responses, with 
particular emphasis on aggressive content 
(DeVos, 1952; Elizur, 1949; Finney, 1955; 
Kagan, Sontag, Baker, & Nelson, 1958; 
Rader, 1957; Schafer, 1954; Towbin, 1955; 
Wolf, 1957; Zubin, Eron, & Sultan, 1956). 
The present study, therefore, was particu- 
larly interested in the stability of aggressive 
content. The intense interest in aggressive 
content seems due, in part, to the fact that 
Rorschach stimuli have a strong tendency to 
elicit content which appears aggressive in 
meaning, e.g., dead bug, animals fighting, 
people arguing, knives, guns, bombs explod- 
ing, etc. Undisguised evidence of other dy- 
namic motivational percepts (sex or depend- 
ency) appears with less frequency. That is, 
perceptions of objects or actions symbolic of 
sexual or dependent motives (genitals, breasts, 
dogs nursing, people kissing) are less fre- 
quent than imagery normally regarded as sym- 
bolic of aggression. Second, clinicians often 
must make important predictions about a pa- 
tient’s aggressive motives and the probability 
of occurrence of overt aggressive behavior. 
This practical consideration would direct the 
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clinician’s attention to imagery which ap- 
peared to be symbolic of aggressive thoughts. 

In addition to aggressive content, the sta- 

bility of seven additional variables was ex- 
plored. The complete list of variables in- 
cluded: (a) number of responses (R), (b) 
dynamic aggression (Dyn. Agg.), (c) static 
aggression (Stat. Agg.), (d) anatomy (Anat.), 
(e) dependent-oral imagery (Dep.-Oral), (f) 
human and human detail responses (H), (g) 
human movement (M), (h) animal move- 
ment (FM), and (i) sum movement (sum M, 
FM, m). 

The stability of movement responses was 
investigated because aggressive content often 
involves a movement dimension and because 
many psychologists regard the movement re- 
sponse as one of the most important Ror- 
schach variables (Klopfer, Ainsworth, Klop- 
fer, & Holt, 1954; Phillips & Smith, 1953; 
Piotrowski, 1958). Human percepts (H) were 
analyzed in order to determine the differential 
stability of human percepts with movement 
versus human percepts with or without the 
presence of movement. Anatomy was scored 
because of its presumed diagnostic value for 
pathological conditions. Finally, dependent- 
oral imagery was scored in order to compare 
the stability of two motive categories (aggres- 
sion and dependency) that play an important 
role in the functioning of both normal and 
patient populations. Although we would have 
liked to score for location, form quality, color, 
shading, and vista, the nature of the avail- 
able data made this analysis impossible. The 
protocols were gathered over a long period of 
time by different examiners with varied in- 
quiry procedures and it was impossible to 
score for location and most determinants. 


METHOD 
Subjects and Procedure 


The subjects (Ss) were drawn from the Fels re- 
search population, which is a predominantly middle- 
class sample residing in southwestern Ohio. The fa- 
thers of these Ss were equally distributed among the 
following vocational classes: agriculture, skilled la- 
bor, tradesmen, and professional. The IQ scores of 
the Ss were all average or above with a mean Stan- 
ford-Binet IQ of 120 during the childhood years, 
and a mean Wechsler-Bellevue IQ of 123 during 
early adult life. For this study, 37 male and 38 fe- 
male Ss were selected for whom two or three stand- 
ard Rorschach protocols were available between 10 


and 20 years of age. In addition, 52 of these Ss (30 
males and 22 females) were adult members of a cur- 
rent assessment program which has been in progress 
for several years. The ages of the 52 Ss at the time 
of the adult assessment ranged from 19 to 29 with 
a median age of 25 years. 

The longitudinal Rorschach protocols were based 
on the 10 standard Rorschach plates which were ad- 
ministered at medium ages of 103, 13}, and 164, with 
30 of the males and 33 of the females having all 
three administrations, As mentioned above, the ex- 
aminers differed over the years, and the comparabil- 
ity with respect to the inquiry could not be guaran- 
teed. For this reason, only the initial, spontaneous 
verbalization of the S (the free association) was 
scored. In the adult assessment program a modifica- 
tion of the Rorschach stimuli was used which will 
be called the modified Rorschach. In this modified 
series, parts of the standard stimuli were covered 
with a template of white cardboard. Thus the S saw 
only a part of the original stimulus and gave only 
one response to each card. The 32 stimuli were com- 
mon D or d areas, each tended to be Perceived as 
a unitary Gestalt, and, with one exception, all the 
stimuli were either completely achromatic or chro- 
matic. This special instrument was designed for two 
reasons. First, it guaranteed an equal and fairly 
large response pool for all Ss. The statistical prob- 
lems associated with unequal protocol length have 
long plagued Rorschach investigators, and this in- 
strument was a beginning attempt to solve this 
thorny problem. Second, this instrument gives the 
investigator considerable control over the stimulus 
input to the S and allows the investigator to specify 
those aspects of the stimulus upon which the re- 
sponse was based. It is acknowledged that this al- 
teration of the Rorschach stimuli produces a qualita- 
tively different stimulus pattern to the S. However, 
the author was interested in ascertaining if specific 
types of response would show intra-individual sta- 
bility despite alteration in the external stimulus. The 
32 stimuli in this series were congruent with the fol- 
lowing Klopfer areas (Klopfer et al., 1954, pp. 70- 
79) 2 


Rorschach 
Plate Klopfer Areas 
I (D 1) (D2) (d3 symmetrical) 
II (D 1) (D 2symmetrical) (D3 symmet- 
rical) (d 1) 
II (D2) (D1) (D 3) (Dg symmetrical 
plus D 3) 
IV (D 1) (d 2) (whole blot minus D 1) 
Vv (2 2) (whole blot minus d 3 and d 1) 
VI (2 2) (D 4) (d 4) 
VII D 4 symmetrical) (d 1) 
VII (D 2) (D1 Plus D 3 plus D 4) 
(D 3 the center) (D 4 the bottom) 
Ix (D 2 symmetrical) (D 5) (D7) 
X (D 8) (D 17) (D 3) (D 1) 


? The author has available for dissemination photo- 
static reproductions of the 32 stimuli, 
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Scoring 


The standard and modified Rorschach responses 
were scored in the same manner for the following 
content categories. 

Dynamic aggression. This category included re- 
sponses which involved the ascription of aggressive- 
destructive activity or a state of death or injury to 
animals, objects, or people. The category consisted 
of three subcategories: (a) aggression—animals or 
People fighting, arguing, angry, or in a belligerent or 
aggressive posture, eg., lions fighting, two ladies 
arguing, an octopus about to snatch a fish, a panther 
about to leap on its prey; (b) destruction—explo- 
sions or destructive action involving inanimate ob- 
jects, eg, bombs exploding, volcano erupting, A- 
bomb cloud, thrust of a rocket, landslide; the Te- 
sponses in this destruction category all involved 
destructive action (inanimate movement) and the re- 
sponse “rocket” or “bomb” alone was not included 
in this category; (c) injury—any animal, person, or 
object seen as dead, wounded, mutilated, or dam- 
aged, e.g., dead dog, squashed bug. This latter cate- 
gory did not occur too frequently and the first two 
categories (aggression and destruction) made up 75% 
of the dynamic aggression score. g 

Static aggression. This category included percepts 
of objects, animals, or animal parts which are nor- 
mally regarded as aggressive but which did not in- 
volve the ascription of aggressive or destructive mo- 
tivation or activity to the perceived object. This 
category included the following subcategories: (a) 
aggressive objects—objects or animal parts normally 
regarded as aggressive, e.g., bullet, bomb, volcano, 
rocket, knife, teeth, claws, horns; (b) aggressive ani- 
mals—untamed animals normally regarded as aggres- 
sive, eg. lions, tigers, panthers, bears, vampires; (c) 
monsters—creatures of myth or fantasy commonly 
regarded as aggressive, e.g., monster, giant, witch. 
Thus, the response “bomb” or “monster” was scored 
as static aggression, while the response “bomb ex- 
ploding” or “monsters fighting” was scored dynamic 
aggression. Responses in which aggressive animals 
were engaged in nonaggressive activity were also 
scored static aggression, e.g., tigers walking, lions 

imbing, etc. 
al and static aggression differ from each 
other in the degree to which the aggressive sym- 
bolism is disguised or attenuated. The dynamic ag- 
gressive response always involves undisguised aggres- 
sive action or the end result of aggressive-destruc- 
tive activity. The aggressive implications of the 

i ressive response are more disguised and 
oe i “knife” or “tiger” 
symbolic. That is, the responses “knife i, = 
are less obviously aggressive in meaning than the 
responses “knife tearing flesh” or tiger devouring 
a bird” This variable of “degree of disguise’ was 
the primary basis ieee these two aggressive 

f 7 separated. 

ay of the internal body parts of 
animals or people or X rays of these internal body 
parts were scored anatomy, eg, stomach, spinal 
column, X ray of the chest, lungs, kidney. 


Dependence-orality. This category included im- 
agery which many clinicians normally regard as in- 
dicative of dependency motivation. This variable in- 
cluded the following two subcategories: (a) passive- 
dependent activity—animals or people in a passive 
posture or in a dependent activity; e.g., people lying 
down, people resting or sleeping, pigs suckling, dogs 
nursing, people supporting each other; (b) oral con- 
tent—responses in which people or animals were eat- 
ing or drinking and responses describing food or ob- 
jects directly associated with food or eating, eg, 
people eating, people drinking, ice cream, fried 
chicken, people sitting in a restaurant, side of beef, 
pitcher of milk. 

Human or human detail, This category included 
all images of humans or parts of humans, e.g, per- 
son, man sitting, face of someone, child, hands, etc. 

Movement. All the Rorschach responses were coded 
separately for the standard categories of M, FM, 
and m as described by Klopfer et al. (1954). The 
present scoring of m includes Klopfer’s Fm, mF, 
and m. A total score (sum M, FM, m) was obtained 
by summing the three movement scores. 

Response length. The variable R referred to the 
number of spontaneous responses in each protocol. 

The interater reliability for the scoring of the first 
six categories was satisfactory for the over-all per- 
centage of agreement between the author and an in- 
dependent scorer was 92%. 


RESULTS 
Age and Sex Differences 


Table 1 shows the frequency of occurrence 
of the variables for the three standard Ror- 
schach protocols as well as the modified Ror- 
schach for males and females separately. 

There were no statistically significant age 
differences for the movement or content varj- 
ables for the three standard Rorschach proto- 
cols. Occurrence of sum M, FM, m showed a 
moderate but nonsignificant increase with age 
for both sexes. The increase in response length 
from Protocol 1 to Protocol 3 was significant 
for males only (p < .01; two tails). Since 
the modifed Rorschach elicited more re- 
sponses than the standard plates and involved 
different stimuli, the differences in response 
frequencies between Protocol 3 and the modi- 
fied Rorschach cannot be attributed solely to 
age changes. 

None of the variables produced significant 
sex differences for all administrations. On 
Protocol 3, the males had a larger sum M, 
FM, m score (p < .05; two tails) but this 
was due, in part, to the fact that the males 
produced longer protocols than the females. 


12 


research had no a priori hypotheses about the 
outcome of the analysis, the data suggest 
several tentative conclusions. One of the pur- 
Poses of the study was to investigate the long 
term stability of aggressive content. The dif- 
ferential stability and Statistical independ- 
ence of dynamic 


As mentioned earlier, except for protocol 
length 


Despite the fact that changes in values and 
notives occur during these years, it is gen- 
rally assumed that Some ego processes do re- 
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Wilensky, and Mc- 
and King (1958) indicate 
that movement responses are Produced by in- 
aware of their mo- 
tives, conflicts, and sources of 


about these intrapsychic 


rather than a motivational Predisposition like 


dependency or aggression. Since human move-'* 


Most of the present Content categories, it is” 


modes of thinking 


—— 


anxiety and 
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ment, sum of all movement responses, dy- 
namic aggression, static aggression, anatomy, 
dependent-oral imagery, and human plus hu- 
man detail responses. 

The results revealed that: 

(a) For the standard Rorschach protocols, 
number of responses, dynamic aggression, 
anatomy, dependence-orality, human move- 
ment, and sum movement showed low but 
statistically significant degrees of stability 
during the ages 134 and 16} (phi coefficients 
ranged from .39 to .59). Static aggression and 
human percepts showed no evidence of sta- 
bility. 

(b) In; comparing the last standard Ror- 
schach with the modified Rorschach stimuli, 
dynamic aggression and animal movement 
(for men) and human movement (for women) 
showed a significant degree of stability (phi 
coefficients of .60, .42, and .47 respectively). 

“c) Dynamic aggression, which usually in- 
volved aggressive movement and contained 
undisguised aggressive content, was statisti- 
cally independent of static aggression, which 
did not involve aggressive action and con- 
tained more disguised aggressive symbolism. 
In addition, dynamic aggression occurred 
more frequently to chromatic cards while 
static aggression was elicited more often by 
achromatic stimuli. It was suggested that fu- 
ture scoring codes for aggressive content dif- 
ferentiate between dynamic and static aggres- 

i ses. 
K the stability coefficients for 
human movement responses were low, this 
able showed the most consistent stability 
olescence for the males. It was sug- 
vested that human movement responses re- 
flected an “awareness of motives and con- 
flicts” and that this construct appeared to be 
a relatively stable personality variable. 


vari 
during ad 
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MMPI FACTORS OF THE 
WIENER SUBS 


HARRIS AND THE 
CALES? 


JAMES C. LINGOES 
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_ There are essentially two main points of 
view regarding the factor struct 


te anes . 
fest, tae Permits diagnosis mainly in terms 
of ‘neurotic? or ‘psychotic,’ but not in terms 


of type of neurosis or Psychosis or other more 


140). O 


nal publication of the MMPI 
number of special, 


tionally derived scales have been p 


Some of these new 


scales have proved helpful 
in both clinica] 


and nonclinical settings, e.g., 
Drake’s Social I-E scale (1946), Feldman’s 
Prognosis for Shock scale (1951), Barron’s 
Ego Strength scale (1953), Gough’s Dissimu- 
lation scale (1954), and Harris’s Subscales 
(1955), among others. Yet, however useful 
some of these scales may be, it is improbable 
that there are as many dimensions of per- 
sonality or functioning as there are existing 
scales. It seems equally unlikely, on the other 
hand, that as many scales as this could have 
been developed in such a variety of settings, 
on quite heterogeneous groups, for ostensibly 
different Purposes, and using such seemingly 
different criteria, and yet hold the view that 
the MMPT is Capable of measuring only two 
basic factors, (b) Clinicians in practical set- 
tings seem to operate under the assumption, 
in writing their MMPI interpretations, that 
the test is useful in tapping attitudes and 
traits as well as in Separating patients from 
normals and they attempt to rely upon a con- 
figural approach in doing this. (c) Previous 
factor analytic results may have been due in 
part to the selection of factorially complex, 
composite scales which may have tended to 
obscure rather than clarify the factor struc- 
ture of this test. A hypothetical example of 
how such a situation might arise is the fol- 
lowing. Let us suppose that di 


A depending upon the tests in- 
cluded for analysis, that a different factor 
structure would result according to whether 
one used a total Score based on these four 
tests or used subtest scores. One might in the 
me out with a general 
and in the latter instance 
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with both a quantitative factor and common 
factors for each of the skills involved, since 
the variance of the separate tests would be al- 
lowed to vary independently from the total 
test score. 

To test the multidimensional hypothesis, 
the investigator felt that the independently 
developed rational subscales of Harris (1955) 
and of Wiener (1948), to be described below, 
would provide an excellent opportunity, since 
these subscales represent two different hy- 
potheses about the MMPI structure. Both in- 
vestigators started with the standard MMPI 
clinical scales but classified the items in these 
scales differently. 


DESCRIPTION OF MMPI SUBSCALES 
FOR ANALYSIS 


The Wiener subscales were derived by sort- 
ing items from each clinical scale into two 
groups. Those items which were relatively 
easy to detect as indicating emotional dis- 
turbance were called the Obvious items, while 
the remaining items were called Subtle. There 
were two main criteria: (a) how frequently 
the item was scored in the given direction by 
the normative samples and (b) subjective 
estimation of the implied emotional disturb- 
ance of the item. Items which were infre- 
quently answered by normals were automati- 
cally considered to be obvious, e.g., all items 
appearing on the F scale. 

Since the Harris subscales have not yet 
been published, the rationale underlying their 
construction will be given in more detail. 
These subscales were derived in conjunction 
with a study of prediction of response to psy- 
chotherapy (Harris & Christiansen, 1946). 
Differences on the Pd, Pa, Sc, and Ma stand- 
ard MMPI scales were obtained between 
groups responding well and poorly to therapy. 
Neither group, however, was very high on 
any of these scales. Although one might have 
inferred something like “subclinical psychotic 
trends” in those responding poorly, there re- 
mained the possibility of achieving a more 
psychological definition of the differences by 
an inspection of differences on the items within 
these scales, Subscales were formed by group- 
ing together the items within each scale which 
seemed similar in content or which seemed to 


reflect a single attitude or trait, i.e., item 
clusters were subjectively estimated. Upon 
scoring the two therapy groups on the vari- 
ous subscales, a number of differences emerged 
which led to an interpretation in terms of 
ego strength. Later research by Feldman 
(1951) and Barron (1953) on patients re- 
sponding well and poorly to various kinds 
of somatic and psychotherapy suggested that 
this interpretation was a sensible one. Fur- 
thermore, the clusterings that Harris made 
reveal a coincidence of overlap with Comrey’s 
analyses of the MMPI scales at the item 
level, notably on Hy (Comrey, 1957b) and 
on Pa (Comrey, 1958b), thus substantiating 
to a considerable extent the adequacy of 
Harris’s estimates. 

Eight of Wiener’s scales (HyO and HyS 
being omitted because of their near perfect 
correlations with the Admittance and Denial 
cluster scales of Little and Fisher (1952), 
which are to be subsequently analyzed), and 
28 of Harris’s subscales (1955) constituted 
the 36 variables of this study. The capacity 
of the IBM 701 program would allow for no 
more than this number; hence, some selection 
was necessary. This selection, however, in no 
way restricts testing the multidimensional hy- 
pothesis. Table 1, containing the various sub- 
scale names and descriptions, gives some idea 
of the content of these scales. 


HYPOTHESES 


There would appear to be only four pos- 
sible outcomes to a factor analysis of the 
Harris and the Wiener subscales (if one ex- 
cludes the unifactorial position), i.e., (a) two 
major factors corresponding to the psychotic 
and neurotic factors of Wheeler et al. (1951); 
(b) one large factor with high loadings on the 
Obvious scales and another large factor with 
high loadings on the Subtle scales, which 
would support the two-factor view of Wiener 
(1948); (c) six factors, one for each of the 
six standard scales, with subscales derived 
from the same standard scale appearing on 
the same factor, e.g., Paranoia-Obvious, Para- 
noia-Subtle, and Harris’s Paranoia subscales 
of Pa,, Paz, and Pas, which would imply a 
scale homogeneity which the authors of the 
MMPI did not purposefully attempt to 
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achieve; and (d) a classification cutting 
across both the standard scales and the 
Wiener subscales, in conformity with the 
viewpoint of Harris and Lingoes (1955). Hy- 
pothesis 3 would not appear to be a serious 
contender, since various investigators have 
demonstrated the factorial complexity of the 


James C. 


Lingoes 

standard scales, e.g., it is known that there 
are two quite different kinds of items in the 
Hysteria scale and that they are negatively 
correlated (Little & Fisher, 1952). The fourth 
hypothesis is favored by this investigator be- 
cause of the considerations advanced at the 
beginning of this paper. 


TABLE 1 


SUBSCALE DESCRIPTIONS 


Scale Description 

DO Depression-Obvious 

DS Depression-Subtle 

D, Subjective Depression : a negation of joy in doing 


things; pessimism, poor morale, and low self- 
esteem ; complaints about psychological inertia 
and lack of energy for coping with problems 

D: Psychomotor Retardation: nonparticipation in 
social relations; immobilization 


D, Complaints about Physical Malfunctioning: pre- 
occupation with oneself 

D; Mental Dullness: unresponsiveness; distrust of 
one’s own psychological functioning 

D; Brooding: ruminativeness; irritability 

Hy; Denial of Social Anxiety: social introversion 

Hy: Need for Affection and Reinforcement from 
Others: implied in an (obtuse) denial of a 
critical or resentful attitude toward other 
people; impunitiveness; overly protested faith 
and optimism in other people 

Hy,  Lassitude-Malaise: complaints about function- 
ing below par physically and mentally; effort- 
ful keeping up of a good front ; need for atten- 
tion and reassurance 

Hys Somatic Complaints: of a kind that suggest re- 
pression and conversion of affect 

Hys Inhibition of Aggression: expressed by concur- 
rence with others; disavowal of violence 

Pdo Psychopathic Deviate-Obvious 

Pds Psychopathic Deviate-Subtle 

Pd, TA Discord: struggle against familial con- 
trol 

Pd, Authority Conflict: resentment of societal de- 
mands and conventions and parental standards 

Pd; os Imperturbability : denial of social anxiety ; 

PS., amoni denial of dependency needs 

4A £ ; tenation: feelings of isolation from other 

aly e; i of belongingness; externalization 
al ae culties; lack of gratification in 

Pd Se aa e SE of Sel-integration; avowal 

ae? i y stated ; desponden: 

PaO  Paranoia-Obvious ue poneeucy 

PaS  Paranoia-Subtle 

Pa, Ideas of External Influen 


ce: externalization of 
‘ms, frustrations, fail- 
sree, persecutory ideas; 
ity for negative feelings 


blame for one’s proble 
ures; in the extreme deg 
projection of responsibil 


Description 


Pa: Poignancy: thinking of oneself as something 


special and different from other people; high- 
strung; cherishing of sensitive feelings; overly 
subjective; thin-skinned 

Affirmation of Moral Virtue: excessive gener- 
osity about the motives of others; righteous- 
ness about ethical matters; obtuse naïveté; 
denial of distrust and hostility 

Social Alienation: a feeling of lack of rapport 
with other people; withdrawal from meaning- 
ful relationships 

Emotional Alienation: a feeling of lack of rap- 
port with oneself; experiencing the self as 
strange; flattening or distortion of affect; 
apathy an 

Lack of Ego Mastery, Cognitive: the anmi pa 
of autonomous thought processes, strange anc 
puzzling ideas p : 

Lack of Ego Mastery, Conative: feclings ki 
“psychological weakness”; abulia, inertia, 
massive inhibition; regression 

Lack of Ego Mastery, Defect of Inhibition and 
Control: a feeling of not being in control of 
one’s impulses, which may be experienced as 
strange and alien; at the mercy of impulse and 
feelings; dissociation of affect 

Sensorimotor Dissociation: a feeling of change 
in the perception of the self and the body 
image; feelings of depersonalization and es- 
trangement 


Hypomania-Obvious 
Hypomania-Subtle 


Pas 


Scia 


Seis 


Scar 
Seo 


Soc 


Ses 


MaO 

MaS 

Ma, Amorality: a callousness about one’s own mo- 
tives and ends and those of other people; 
disarming frankness; denial of guilt ; a 

Psychomotor Acceleration: hyperactivity, r - 
ity; flight from “inner life” and anxiety; 
pressure for action j 

Imperturbability: affirmations of Confidence X 
social situations; denial of sensitivity; proc! r 
mation of independence from the opinions o 
other people 

Ego Inflation: feelings of self-importance to the 
point of unrealistic grandiosity 


Ma: 


Ma: 


Ma, 
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METHOD 
Subjects 


The four samples of Ss used in this study con- 
sisted of 150 male and 100 female neuropsychiatric 
patients (58% were outpatients and 42% inpatients), 
and 100 male and 100 female community Ss. These 
four groups will be hereinafter referred to by the 
abbreviations: Pm, Pf, Cm, and Cf, respectively. 
Median ages and age ranges of these four groups 
are: 32, 9-65; 33, 14-63; 25, 19-47; and 29, 19-62, 
years, respectively. Patient Ss were randomly chosen 
from the Langley Porter Neuropsychiatric Institute 
MMPI files, the only criteria for selection being that 
not more than 30 items were left unanswered on the 
test and that the patient had been tested on the 
group form. Community Ss were for the most part 
drawn from the staff at the same institution on a 
voluntary basis, while the remainder were drawn 
from cases used in another study (Lingoes, 1957) 
and from college students at the University of Cali- 
fornia at Berkeley,? using the same criteria for in- 
clusion. No attempt was made to match these groups 
on various variables nor was there any attempt to 
make the groups homogencous. The implications of 
this lack of matching and the heterogeneous nature 
of the samples will be discussed later. Suffice it to 
say here that the patient samples did not differ ap- 
preciably from each other in age or education. Both 
patient samples were in general older and of lower 
socioeconomic status than the community Ss. 


Procedure 


An independent factor analysis, employing Hotel- 
ling’s (1933) principal components method, was ap- 
plied to each set of correlations for the four groups 
of Ss. The squared multiple correlation (SMC) was 
used in the diagonal. This index is known to be 
the best universal lower bound for the communality 
(Guttman, 1940) and is calculated by taking each 
variable in turn as the dependent variable and com- 
puting the multiple correlation of that variable with 
the p-1 remaining variables and taking its square 
root. The SMC is thus seen to be a measure of pre- 
dictable or observed common test variance, ie, 
variance common to the test itself and the remain- 
ing p-1 tests. In an empirical investigation, using 15 
communality estimates, Wrigley (in press) has found 
SMCs to be one of the best estimates of communali- 
ties in terms of stability, rate of convergence to, and 
discrepancy from, the true communalities. In dis- 
cussing the logical and practical issues of the com- 
mon-specific variance distinction in factor analysis 
and the problem of communalities, Wrigley (1957, p. 
94) has offered a number of cogent arguments favor- 
ing the use of SMCs over that of communalities, eg. 
“The problem of alternative sets of diagonal values 
satisfying the theory then disappears. SMC’s a 
probably less influenced by increasing the sample o 


2 Gratitude is expressed to Alex Sheriffs for the use 
of his MMPI files. 


persons, since the revised values, though more ac- 

curate, may be either higher or lower. Determining 

the diagonal entries becomes separate from deter- 

mining the number of significant factors. One disad- - 
vantage is the lack of a significance test . . . , which 

has subsequently been provided by Tryon è and used 

in this study to determine whether the factors ro- 

tated were significant. 

A principal axes solution yiclds as many factors as 
there are tests. Using SMCs in the leading diagonal 
instead of unities, however, gives rise to factors hav- 
ing negative latent roots. Only those factors with 
positive latent roots and with at least one signifi- 
cant loading, i.e. P less than .05, as determined by 
Tryon’s (1956) method were rotated. The method of 
rotation adopted was that of Kaiser (1958), called ` 
the Varimax method, an analytic solution approach- 
ing orthogonal simple structure; i.e. the number of 
zero loadings for any particular factor are maxi- 
mized so that the factor variance is concentrated so 
far as possible in a few tests. The method operates 
on all possible pairs of factors and finds that angular 
transformation which maximizes the variance of the 
squared loadings, summed over the entire matrix, 
proceeding iteratively until convergence occurs. In 
order to preserve objectivity and make for com- 
parability, no attempt was made to improve upon 
the rotations by hand methods, although it was pos- 
sible that a better approximation to simple structure 
(using other criteria of Thurstone) might thereby 
have been achieved. 

Reproducibility of the factors was investigated by 
using the Burt-Tucker 4 (Burt, 1948, p. 185; Tucker, 
1951) index of factor similarity. To provide some 
psychological content for the factors, items from the 
scales defining the factors were submitted to 10 psy- 
chologists with MMPI test experience for inspection 
and written comments as to what they thought the 
factors might be measuring. 


3One can determine the significance of any factor 
loading Mj: on any dimension or Factor C; for the & 
dimensions retained, by computing the standard error 
of J;: using the following formula: 


chji = 


where /;? is the communality of the variable in question 
and W is the sample size. Thus if the factor loading 
lies outside the range + ¢ 1.96 /j;, then the population 
value could not be zero at the 95% interval of con- 
fidence. 

*Burt and Tucker independently conceptualized 
one possible solution to the problem of factor simi- 
larity as involving an index of proportionality among 
the factor profiles. These indices range from +1.00 
to —1.00 and can be interpreted much as 7° in ac- 
counting for percentage of variance. Accordingly, one 
must obtain an index greater than .707 to insure 
that one factor is more like another than it is un- 
like that factor. 
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RESULTS 
Number of Factors 


Fifteen factors accounted for 98% of the 
common factor variance in the two patient 
groups and for 96% of the variance in the 
two community groups.ë The number of fac- 
tors extracted and the manner in which they 
cut across the classifications of Wiener, Har- 
ris, and Hathaway and McKinley would 
strongly suggest that the MMPI is indeed 
factorially complex. It took five factors in 
each of the patient groups and seven in each 
of the community groups to account for only 
80% of the variance (based on the unrotated 
factor matrices). The first factor in each of 
the analyses accounted for approximately 51, 
46, 37, and 39% of the observed variance in 
the respective unrotated matrices. These re- 
sults would seem to suggest the untenability 
of the two-factor position. 


Factor Replication 


Seven factors were reproduced, i.e., had in- 
dices of factor similarity of at least -70, in 
all four analyses, four factors were replicated 
in three, and three factors appeared in two. 
Patients were more like one another than 
they were like community Ss (12 factors in 
common), and likewise there was greater con- 
gruence among the community Ss (10 factors 
in common); the same thing might be said 
for the intrasex comparisons (there were 10 
factors in common within each sex group). 
Agreement in factors is greater in this study 
than heretofore reported in the MMPI litera- 
ture, This agreement was probably to a con- 
siderable extent a function of objectivity in 
computation, but to some degree it might also 
be partly spurious, since there exists the pos- 
sibility that the amount of item overlap not 
only increased the size of the correlations, 


but may have also introduced correlation 
among factors across groups 


5 Tables 2-7, containing indi ae 
(Table 2), factor names A Nanci aed 
aes ane standard deviations (Table 4) scale the 
tercorralations (Table $),.witotated facta 1 dings 
(Table 6), and Varimax rotated factor lai s 
(Table 7) for each of the four iroupe ot bag oi 
been deposited with the American Documentation ae 
stitute. Order Document No. 6119, remitting $1 7 


for microfilm or $2.50 for photocopies, 


Order of Factors 


To facilitate presentation, the order of fac- 
tors was determined first by consideration of 
factor stability and second by the relative 
sizes of the factors. In accord with this or- 
dering, new factor numbers were assigned, 
e.g., Factor I (which differs from the first 
factor of any particular study by this no- 
tation). The size of each factor was deter- 
mined as follows: based on the sum of the 
squared factor loadings summed across groups 
(when replicated), the percentage of the total 
observed common variance of the four groups 
was obtained, e.g., the variance of the four 
groups (based on the SMCs) was 120.26, 
while the total variance of the four groups 
on the first factor in each analysis (based on 
the sum of the squared factor loadings) was 
30.51, and therefore Factor I accounted for 
approximately 25% of the variance. , 

Although there were both interdiagnostic 
and intersex differences, these were found to 
be for the most part a function of the size of 
the various factor loadings rather than major 
differences in the relative magnitudes or pat- 
terns of the loadings. Consistent differences 
across diagnostic groups and sexes will be 
mentioned when they occur, but only those 
factors will be presented which were repli- 
cated, based on the logic that greater confi- 
dence can be placed in them. Each of the 
factors will be related to the series of factor 
studies carried out by Comrey (1957a, 1957b, 
1958a, 1958b, 1958c) and Comrey and Marg- 
graff (1958) on the items of the MMPI clini- 
cal scales. Since there is an overlap between 
the items for Comrey’s factors and the items 
constituting the Harris subscales, the oppor- 
tunity for objectively but indirectly assessing 
similarity across factors and studies becomes 


possible. 
Manner of Presentation of Results 


Results are presented in the following or- 
der for factors obtained in all four analyses: 
(a) the factor’s number and name; (b) the 
most highly loaded subscales on the factor 
with their loadings for the four groups, i.e., 
Pm, Pf, Cm, and Cf; (c) any additional sub- 
scales which have loadings greater than .39 
for at least two analyses; (d) the indices of 
factor similarity; and (e) judges’ descriptions 
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of the factor based on (5) above. For factors 
which were replicated in at least three analy- 
ses, Items (a@)—(d) above are given with an 
indication of the groups included appearing 
alongside of the factor name. 


Factors Found in All Four Analyses 


Factor I: General Maladjustment. There 
were seven scales with high loadings in all 
four groups: 

Depression-Obvious (.88, .89, .83, .83) 
Dı Subjective Depression (.92, .92, ,84, .85) 
Ds Mental Dullness (.90, .90, .84, .82) 

Ds — Brooding (.85, .83, .73, .74) 


Hy, Lassitude-Malaise (.85, .90, .61, .57) 
Pd Self-Alienation (.83, .68, .69, 49) 
Seo Lack of Ego Mastery, Conative (.89, .85, .71, .58) 


Other subscales with loadings greater than 
.39 for at least two samples were: 


D: Psychomotor Retardation (.63, .62, .32, .31) 

Ds Physical Malfunctioning (.57, .61, .24, .26) 
Somatic Complaints (.46, 43, .27, .26) 

Social Alienation (.57, .41, .45, .31) 

Poignancy (.48, .46, .22, .36) 

Emotional Alienation (.75, .72, .09, .23) 

Lack of Ego Mastery, Cognitive (.78, .71, .43, 37) 


The Pm factor best exemplified Factor I. 
Indices of agreement with the remaining three 
groups were: .99, .95, .94. 

The 10 judges reviewing the 69 items from 
the seven scales with the highest loadings 
(overlapping items were offered only once) 
agreed in describing this factor as one of gen- 
eralized anxiety, depression, guilt, pessimism, 
and anxious worrying. It is much like the first 
factor found in a number of other investiga- 
tions and may appropriately be called the 
General Maladjustment Factor or as Kasse- 
baum, Couch, and Slater (1959) prefer to 
call it, a factor of Ego Weakness vs. Ego 
Strength. This factor includes the majority 
of the items from Comrey’s (1957a, 1957b, 
1958c) first factors for the D, Hy, and Pd 
scales and his second factor of the Sc scale 
(Comrey & Marggraff, 1958). : i 

Factor II: Denial of Social Anxiety. Five 
scales had high loadings in all four groups: 


Denial of Social Anxiety (.82, .87, .81, .87) 


Pas Psychopathic Deviate-Subtle (.69, .71, .70, .69) 
Pd; Social Imperturbability (.73, .83, .83, .87) 

MaS Hypomania-Subtle (.39, .57, 55, .38) 

Mas Imperturbability (.62, .80, .63, .77) 


No other subscales had loadings greater 
than .39 in any sample. This factor was also 
best represented by the Pm group. Indices of 
agreement with the other three groups were: 
94, .95, and .93. 

The 47 items comprising the above five 
scales elicited such descriptions as: social 
ease, interpersonal dominance, extraversion, 
and social adaptability (middle scorers) and 
veiled aggression, social obtuseness, denial of 
social anxiety, and naiveté (high scorers). 
Both Factors I and II had items referring to 
anxiety, but in contrast to General Maladjust- 
ment, the items in the present factor bear 
upon inter- rather than intrapersonal vari- 
ables. This factor includes the majority of 
items from Comrey’s (1957b, 1958a, 1958c) 
second factor of the Hy scale, fourth factor 
of the Pd scale, and the first factor of the 
Ma scale. It is also of interest to note that 
this factor bears some resemblance to Kasse- 
baum et al.’s (1959) second factor of Intro- 
version-Extraversion. 

Factor III: Loss of Control. Four scales 
had high loadings: 


Scc Lack of Ego Mastery, Defect of Inhibition and 
Control (.55, .71, .52, .73) 

Scs Sensorimotor Dissociation (.37, .50, .11, .75) 

MaO Hypomania-Obvious (.63, .73, .45, .65) 

Ma: Psychomotor Acceleration (.56, .85, -66, .55) 


Only one other subscale had loadings greater 
than .39 in at least two samples, ié., DS De- 
pression-Subtle (—.20, —.57, —.18, —.40). 
The Pm factor showed the highest indices of 
agreement with the other groups, i.e., .92, .88, 
and .93, 

The consensus classified the 45 items from 
the first four scales into the areas of: loss of 
emotional and impulse control, restlessness, 
incipient ego disintegration, and possible body 
image distortion. This factor would appear to 
be measuring at the high end, the failure of 
neurotic defenses, and in the middle ranges, 
inadequate control mechanisms, The highest 
factor loadings tended to appear in the two 
female groups, which might indicate a higher 
saturation of this factor among females. There 
was little congruence between these subscales 
and Comrey’s (1958c) and Comrey and 
Marggraff’s (1958) factors of the Ma and 
Sc scales. 
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Factor IV: Denial of Distrust and Hostil- 
ity. High loadings (greater than .39) were 
obtained on three variables: 


Hy2 Need for Affection and Reinforcement from Others 
(.63, .59, .78, .58) 

PaS Paranoia-Subtle (.90, .87, .93, .91) 

Paz Moral Virtue (.84, .92, .93, .89) 


Indices of .96, .95, and .95 were obtained 
between the Cf factor and factors in the re- 
maining groups. Some indirect support for 
this factor can be adduced from Comrey’s 
(1957b, 1958b) analyses of the Hy and Pa 
scales, inasmuch as his third factors from both 
of these scales overlap in items with the Hya 
and Paz subscales. 

Descriptions such as: naïveté, denial of dis- 
trust and hostility, dependence upon the good 
will of others, impunitiveness, tolerance, re- 
pressive and submissive behavior, were ob- 
tained from the 25 items constituting the 
above three scales. Both the present factor 
and Factor II involve repression, denial, and 
naïveté, but whereas the latter deals with in- 
teraction ease, the present factor was more 
specifically concerned with defenses against 
both hostile and aggressive impulses. 

Factor V: Familial Discord. The following 
two scales had the highest loadings: i 


Pd, Familial Discord (.76, -74, .68, .72) 
Scia Social Alienation (.46, 41, -50, .37) 


ey Strivings, 
rusted, misunderst, i 
lated. There Renal en 
first two scales, Some 
items of Pd, and & 
tor on the Pd scale wW. 
Comrey and Marggraff’ 
the Sc scale split the Sı 
factors. 

Factor VI: Inhibition and 
variables had high loadings on 


as evident, although 
s (1958) analysis of 
Cia items among two 


Apathy. Two 
this factor: 


DS Depression-Subtle (51, .43, -69, .50) 
Dz Psychomotor Retardation (46,449, .60, -71) 
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No other subscales had loadings greater 
than .39 for at least two samples. The commu- 
nity S groups had higher loadings on this fac- 
tor than the patients. The Cf factor yielded 
indices of .86, .83, and .85-with the other 
three factors. 

The 28 items from these two scales were 
described in the following terms: inhibition, 
overcontrol, rigidity, obstinacy, feelings of 
apathy and inertia, emotional constriction, 
etc. Ds items were distributed on two of 
Comrey’s (1957a) D factors. Although only 
two variables had sufficiently high loadings to 
define this factor, it is nonetheless a common 
factor according to the factor model employed 
in this study, i.e., using SMCs. However, one 
might prefer to call such factors as these near 
specifics (Wrigley, 1957). The psychological 
or statistical significance of such factors is, 
however, an issue apart from the common- 
specific distinction and such factors will be 
included if replicated, pending their further 
investigation. 

i Fee VII: Social Nonconformity. Only 
one scale had consistently high loadings in all 
four groups: 


Pd: Authority Conflict (.69, 41, .71,.71) 


The second highest loadings for the nra 
patient groups appeared on Ma PTE 
with loadings of .56 and .77 for males an : 
males, respectively. In the community oe 
there was no consistency. Since the Pm ac A 
yielded the highest agreement indices with t e 
other factors, i.e., .81, .81, and .82, and since 
the patients had a second scale rompian E 
loadings to the first (Pdə), it was decide n 
consider this a patient factor, defined by bo 

fay. 

a oe agreement was evident in the com- 
ments of the judges, e.g., rebelliousness, argu 
mentativeness, immaturity, hostility dieges 
specifically against authority, poteng pA 
acting out, nonconformity, gi oe 
pendence, and egocentricity. Only hae 
from two scales represented this factor. Pi bs 
items did not cluster together in Comrey’s 
(1958c) analysis of the Pd items. 


Factors Found in Three Analyses 


Factor VIII: H ostility-Alienation-Projection 
(Pin, Pf, and Cf). Two scales had high load- 
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ing in all three samples: 


Pdi, Social Alienation (.47, .53, .66) 
Pa, Ideas of External Influence (.75, .89, .86) 


Additional subscales with loadings greater 
than .39 were: 

PdO Psychopathic Deviate-Obvious (.33, 47, Ad) 

PaO Paranoia-Obvious (.72, .83, .19) 

Paz Poignancy (41, .42, .14) 

Scia Social Alienation (.51, .50, .31) 

The Pf factor had indices of similarity of 
.95 and .84 with the other two factors. There 
were 29 items in the first two scales. This 
would appear to be primarily a patient factor. 

Factor IX: Somatic Complaints (Pm, PÍ, 
and Cm). There were only two scales with 
loadings greater than .39 in at least two 
samples: 

Hy, Somatic Complaints (.66, .74, .65) 
Scs Sensorimotor Dissociation (.46, .50, .78) 

The Pm factor yielded indices of .89 and 
71 with the other two factors. This too is 
probably a patient factor. There were 33 
items in these two scales. 

Factor X: Poignancy (Pf, Cm, and Cf). 
Only one scale had consistently high loadings: 


Pay Poignancy (.67, .81, .73) 


The Cm factor had indices of .80 and .89 
with the two female factors. In the commu- 
nity groups, PaO had the next highest load- 
ings (.59 and .71, respectively), while in the 
patient groups, this scale was associated with 
Pay, which appeared on Factor VIII. There 
were only 9 items in the above scale and this 
factor was considered to be a near specific. 

Factor XI: Poor Health (Pm, Cm, and 
Cf). The three groups had one scale in com- 
mon with high loadings: 


D; Physical Malfunctioning (.63, .58, .48) 


The Cm factor had an index of .83 with the 
Cf factor, but only a borderline index of 70 
with the Pm factor. There were 11 items in 
this scale. In addition to Ds, the community 
samples had loadings of .61 and .46 on Hys 
Lassitude-Malaise, and consequently this fac- 
tor was considered to be a nonpatient factor. 


Factors Found in Two Analyses 
Three factors which accounted for 63% of 
the variance collectively appeared too small 


and doubtful to deserve anything more than 
a mention. Factor XII had high loadings on 
MaO and Ma, for the community groups; 
Factor XIII had high loadings on MaS for 
the Pm and Cf groups; and Factor XIV had 
high loadings on Mas for the two patient 
groups. 


Discussion 


The present study of six standard MMPI 
scales has demonstrated a dimensionalization 
which is inconsistent with the two-factor posi- 
tions of both Wiener (1948) and Wheeler 
et al. (1951). Instead, the MMPI appears to 
have: (a) a complexity not coextensive with 
the presently constituted six scales, and (b) 
a greater simplicity than might be inferred 
from the 28 titles given to the Harris sub- 
scales (1955). Implications of the present 
findings will be discussed under the following 
topic headings: (a) the multidimensional po- 
tentials of the MMPI, and (0) the “obvious- 
subtle” distinction. 


The Multidimensional Potentials of the 
MMPI 


The term potentials has been chosen ad- 
visedly, since a test does not per se possess 
dimensionality, but only in relation to a speci- 
fied population or set of operations which pro- 
duce the test responses. The results of this 
paper would lead one to conclude that the 
MMPI can reliably measure considerably 
more than two dimensions, e.g., neuroticism 
or psychoticism. 

Of the 15 factors extracted there are seven 
which have emerged from four analyses (ac- 
counting for approximately 64% of the total 
observed variance). There are a further four 
derived from three analyses, accounting for 
an additional 19% of the variance. Admit- 
tedly, the number of factors obtained is 
partly a function of the number of variables 
included and may be to some extent an arti- 
fact resulting from item overlap. Neverthe- 
less, to conclude that because the first two 
or three factors account for roughly half the 
variance that the test is just measuring this 
or that aspect of personality, e.g., sickness or 
general maladjustment, or is influenced pri- 
marily by some response set, is not justified 
on the present evidence. 
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Despite the increased confidence in the fac- 
tors resulting from replication in a number of 
heterogeneous samples, the logical possibility 
is here raised of committing what might be 
called the identity error, i.e., inferring identi- 
cal factors from identical covariance patterns. 
It may well turn out that different groups re- 
spond in like manners to the same items for 
entirely different reasons, either because the 
items have different meanings to different in- 
dividuals or that the items themselves are 
complex factorially and have differing validi- 
ties. Scales are being developed based on the 
replicated factors to test this hypothesis on 
various groups of Ss. 


The “Obvious-Subtle” Distinction 


An inspection of the rotated Varimax load- 
ings on the eight Wiener scales revealed the 
O scales were scattered over at least three 
factors in all four analyses. In general, the 
same findings applied to the S scales. Further- 
more, as in the case of the PaO items, this 
scale was associated with different factors, 
i.e., with Pa; in the patient samples and with 
Pay in the community samples. If the Ob- 
vious and Subtle scales from Hysteria had 
been added, HyO would have probably ap- 
peared on Factors I, IX, and XI, while HyS 
would probably have come out on Factors II 
and IV. Whatever else the Obvious-Subtle 
items might be measuring, Wiener’s conclu- 
sions, based on the intercorrelations among 
these scales and other variables, do not fully 
explain the above results, i.e, “They (the 
Obvious and Subtle scales) seem to have the 
same meaning and to perform the same func- 
tions for each scale” (Wiener, 1948, p. 169).° 
Wiener’s contention would seem to be on 
firmer ground in respect to the O scales, since 
they do yield similar sign patterns across a 
large number of factors (they are positively 
interrelated). It would appear, however, that 
an analysis of his scales in the context of the 
Harris subscales makes previously considered 
specific variance now common. This addi- 
tional source of common variance is, more- 
over, greater than that attributable to obvi- 
ousness per se, i.e., content does seem to make 


an important difference. 


6 Italics and parentheses present author's. 


It would appear from the present evidence 
that obviousness and subtleness may have 
different as well as common meanings and 
functions within each scale and that the dis- 
tinction Wiener has made, while useful in in- 
vestigating response sets, must be accordingly 
modified to admit of many kinds of obvious 
and subtle items. This conclusion would ap- 
pear to be supported by that of Hanley 
(1959) in respect to his investigation of ac- 
quiescence. 


Conclusions 


It can be said that this study has demon- 
strated the multidimensional potential of the 
MMPI and has lent indirect support to the 
configural approach to interpretation ad- 
vanced by the authors of this test. Although 
item overlap may have introduced extra 
sources of variance, thus inflating the SMCs, 
it is felt that the subscale approach used here 
is a valuable one for clarifying factor struc- 
tures. Because only six of the clinical scales 
were analyzed, the inclusion of other MMPI 
scales might have modified, clarified, or in- 
troduced factors to some undetermined de- 
gree. One might expect, however, that Pt 
would have appeared on Factor I and that 
Hs would probably have split on Factors III, 
IX, and XI had they been included. 

It is believed that this study complements 
that of Comrey to the extent that some ob- 
jective evidence can be adduced that the fac- 
tors he obtained within scales are similar from 
scale to scale. This evidence is, of course, in- 
direct, since while there is congruence be- 
tween some of his factors and the Harris sub- 
scales, the correspondence is far from perfect. 
Factor scales to be developed based on this 
analysis are intended to supplement and not 
to replace the standard MMPI scales, al- 
though they might well replace the Harris 
subscales as an interpretative aid. 


SUMMARY 


This investigation was concerned with test- 
ing the merits of the two-factor versus the 
multifactor hypothesis of the MMPI as rep- 
resented by Wiener and by Harris, respec- 
tively. Harris classified MMPI items under 
28 categories, while Wiener chose only 2, i.e., 
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Obvious and Subtle, as an adequate descrip- 
tion of the test’s complexity. 

From the 36 subscales analyzed (represent- 
ing six clinical scales) in four samples of sub- 
jects (patients and nonpatients, males and 
females), seven factors were replicated in 
four analyses and four in three. The 9 of 
these 11 factors which could be readily identi- 
fied were: General Maladjustment (I), De- 
nial of Social Anxiety (II), Loss of Control 
(IIT), Denial of Distrust and Hostility (IV), 
Familial Discord (V), Inhibition and Apathy 
(VI), Social Nonconformity (VII), Hostility- 
Alienation-Projection (VIII), and Somatic 
Complaints (IX). 

It was concluded on the basis of the analy- 
ses that: (a) reliably more than two dimen- 
sions can be demonstrated; (b) the “obvious- 
subtle” distinction, while having some perti- 
nence for the study of response sets, cannot 
fully explain the present factor results and 
that one must admit of many kinds of obvi- 
ousness and subtleness; (c) while the present 
study is far from definitive, the fruitfulness 
of performing analyses on the basis of sub- 
scales rather than composite scales is sug- 
gested; and (d) evidence was offered relating 
the series of factor studies carried out by 
Comrey at the item level to the scale analysis 


of this study. 
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THE RESULTS OF PSYCHOTHEAPY WITH CHILDREN: 
A REVALUATION i 
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Recently Levitt (1957) published a survey 
of the studies concerning the effectiveness of 
psychotherapy with children on the model of 
Eysenck’s (1952) now classic enquiry with 
adults. Levitt concludes that the published 
figures do not support the contention that 
psychotherapy with children is more effective 
than nonintervention. His baseline data are 
derived from two studies: that by Witman 
and Keller (1942) and that by Lehrman et al. 
(1949). Averaging these studies shows that 
untreated controls have an improvement rate 
of 72.5%, whereas a survey of 18 studies 
of the outcome of psychotherapy with chil- 
dren shows an average improvement rate of 
67.05%. More strictly comparable with the 
baseline data studies is the averaging by 
Levitt of 17 follow-up studies of the effects 
of psychotherapy on children. These yield a 
figure which shows that 78.22% of the cases 
were improved. 

Levitt’s article is a challenge to all workers 
in the field of child psychotherapy, indicating 
as it does the relative ineffectiveness of this 
field of endeavor. Levitt is careful to point 
out that it is not proven that psychotherapy 
is ineffective; but the converse, that psycho- 
therapy is effective, is also not proven. 

The present study was undertaken as an 
attempt to isolate from the published litera- 
ture some of the factors contributing to suc- 
cess or failure in psychotherapy as rated at 
the close of treatment, Follow-up studies were 
excluded since criteria for follow-up success 
were not detailed in any definable way. 


THE Sources or UNCERTAINTY 


The problem of establishing a baseline. At 
this juncture the writer wishes to note the 
diffculty of arriving at baseline data for the 
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evaluation of the effectiveness of psychother- 
apy. The method used in the two studies 
(Lehrman et al., 1949; Witman & Keller, 
1942) accepted by Levitt has serious draw- 
backs. These studies used as controls cases 
who had made an approach to the clinic and 
had been put on the waiting list but who, for 
one reason or another, did not accept the 
offer of treatment when it was later made 
available. Levitt says that such cases were 
in all respects similar to treated cases, except 
for the fact that they received no treatment. 
While the published literature contains no 
studies where the controls have been ob- 
tained by any other method (so that no bet- 
ter statistics are available as a basis for com- 
parison), it is, however, not entirely a valid 
method of obtaining controls. 

For one thing, too little is known about the 
factors intervening between first clinical con- 
tact and later acceptance for treatment, and, 
for another, it is common experience at all 
clinics that many cases on the waiting list 
clear up before treatment can be given. Too 
little is known about the dynamics of this, 
but at least two factors are known to account 
for it: (a) cases where the approach to the 
clinic is made in a temporary crisis, which 
subsides while the case is pending, and (b) 
cases where a healing dynamic process is ini- 
tiated by the parents’ recognition of their need 
for help, so that by the time help is available 
it is no longer needed. 

As all these factors are operative for both 
the treated and untreated groups, is it not 
possible that the treated groups represent the 
more serious, intractible problems, while at 
least a proportion of the untreated groups are 
minor or transient problems? 

However, in the absence of any better sta- 
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tistics, these must be accepted, albeit with 
reservations, whose very nature demands that 
conclusions drawn from them should be highly 
tentative. 

Comparison of the studies showing high 
success with those reporting low success. The 
four studies reporting the highest percentage 
of success (Cohen & Davis, 1934; Irgens, 
1936; Newell, 1943; Witmer & others, 1933) 
were compared with the four studies report- 
ing the lowest (Brown, 1947; Christianson, 
Gales, & Coleman, 1934; Cunningham, West- 
erman, & Fischhoff, 1956; Maas, Kahn, 
Stein, & Sumner, 1955).* This brought to 
light several features that were blurred in 
Levitt’s (1957) treatment of the data. 

The influence of the date of the study. 
Most unexpectedly, it was found that the 
cases treated in clinics in the high success 
studies all date from 1931-1932 despite the 
divergence of dates of publication; whereas 
three of the four low success studies date 
from after 1945. The one exception among 
the low success studies (Christianson et al., 
1934), which also dates from 1931-1932, is 
differentiated from the other studies, which 
all deal with major clinics in major cities, by 
being concerned with the northern New Jer- 
sey clinics—i.e., the possibility must not be 
ignored that this study may reflect a lower 
level of service than the others. 

The entire 18 studies were then arranged 
into two categories, dichotomized at the av- 
erage level of success (67%), with the fol- 
lowing findings as regards dates: 


Highs (over 67%) 


Author Percentage Date 
85.7 1936 
Irgens 85.0 
Cohen and Davis 80.7 1934 
Newell 79.2 1943 
Witmer & others 79.0 1933 
Lee and Kenworthy 78.6 Ra 
Burlingham 160 e 
Barbour 74.3 ie 
Hubbard and Adams 73.0 ee 
Reid and Hagan 72.8 5 
67.7 1939 


Carpenter 
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Lows (under 67%) 


Author Percentage Date 
La More 66.0 1941 
Lehrman et al. 65.3 1949 
Albright and Gambrell 65.0 1938 
Christianson, Gales, & 

Coleman 62.7 1934 
Cunningham, Westerman, & 

Fischhoff 59.8 1956 
Brown 59.3 1947 
Canaday 47.8 1940 
Maas, Kahn, Stein, & 

Sumner 43.1 1955 


From immediate inspection it can be seen 
that, of the Highs, 7 of the 10 are stud- 
ies prior to 1940. Of the studies dated later 
than 1940, one (Barbour, 1951) is a British 
study and not strictly comparable with the 
other United States studies; another (Newell, 
1943), though published in 1943, is con- 
cerned with data gathered 10 years earlier, 
in 1933; and a third (Reid & Hagan, 1952) 
is related to residential treatment while all 
the others are concerned with nonresidential 
treatment. 

Among the Lows, on the other hand, only 
two of the eight studies are dated 1940 or 
earlier. One of these (Christianson et al., 
1934) has been discussed above and is not 
regarded as strictly comparable with the 
others in this group. The other (Canaday, 
1940) has not been available for detailed 
study, for reasons given above. 

Consequently, if the Barbour (1951) and 
the Reid and Hagan (1952) studies are 
omitted from the Highs for the reasons given, 
and the Christianson et al. (1934) study 
from the Lows, one finds that all the Highs 
date from before 1939, and six out of seven 
of the Lows date from after 1940. 

The problem of length and intensity of 
treatment. It was remarkable how little in- 
formation could be obtained about the im- 
portant questions: (a) How much treatment 
did each case receive, and (b) Was the per- 
son concerned with the treatment of the 
child a qualified psychotherapist? * This is 

2 The writer is aware that the term “psychothera- 


pist” is open to various divergent interpretations. It 
is here being used to refer to anyone having spe- 
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particularly curious in view of the vast body 
of information that was reported anent pre- 
senting symptoms, age, sex, socioeconomic 
status, parental attitude, and other back- 
ground variables. Only one study that was 
reviewed (Maas et al., 1955) included a 
breakdown of the success-failure percentages 
in terms of length of treatment. This study, 
which reports the lowest over-all improve- 
ment rate (only 43.1%), shows that success- 
ful outcome is directly related to length of 


treatment. 
Percentage 
No. of Interviews N Improved 
Pece ge) I a 
None or single family contact 137 We 
Brief service (2-9 interviews) 77 22.1 
Short-term treatment (10-39 
interviews) 144 53.8 
Long-term treatment (40 or 
more interviews) 144 80.2 
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se lines: “Two in- 
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Related to the Problem of Je 
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is, of course, that of the qual a prany 


ifications of the 
cialized training in child Psychotherapy, irrespective 
of theoretical approach, and including Psychiatrists 
clinical psychologists, and child analysts, but exclud- 
ing social workers, teachers, etc, 


therapist. Once again there is a dearth of in- 
formation, but in the only study which does 
specify the type of skills of the therapist, the 
resulting information js interesting. Witmer 
and students (1933), who report an overall 
improvement rate of 79.0%, also broke down 
the figures to indicate whether the therapist 
was a psychiatrist or a social worker. Her 
findings are not as dramatic as Maas’, show- 
ing percentages varying from 73 to 82 for 
both psychiatrist and social worker for less 
than 15 interviews: but at over 15 interviews 
the psychiatrist achieves 97% success (V = 
30), and the social worker 84% (N = 208). 

The problems of the inclusion of unsuitable 
cases. It was noted that, where information 
was included about presenting symptoms, in 
every instance there were included small 
numbers of cases that are presumably either 
incurable (e.g., mongoloidism) or curable by 
somatic medicine (e.g., endocrine dysfunction- 
ing). Though the numbers are never large 
(the percentages are never in excess of 10% 
of all cases), their exclusion slightly improves 
the percentages for successful treatment. 


DISCUSSION 


In a sense, Levitt (1957) has put the con- 
temporary child guidance clinic to the test 
and found it wanting. But is this in fact 
so? Two main arguments suggest it is not. 
First, there are serious inadequacies (detailed 
above) in the data relating to the recovery 
rate of untreated children which render them 
suspect as a basis for comparison with treated 
cases. Secondly, the studies straddle a period 
in which child guidance clinics have not only 
been growing and modifying their practices 
but also adjusting to rapid social changes in 
the community as a whole, so that all other 
considerations apart (such as lack of uni- 
formity of procedures in individual child 
guidance clinics), it is unsatisfactory to ar- 
rive at conclusions based on the amalgama- 
tion of these studies, for amalgamation treats 
them all as equal; whereas, as has just been 
suggested, it is more probable that a close 
examination will reveal that child guidance 
practice in, say, 1933, had little in common 
with that obtaining in 1950 or 1955. How- 
ever, his study does show that present-day 
child guidance clinic practices are not highly 
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effective in dealing with children’s psycho- 
logical malfunctionings, though it does ap- 
pear that they were considerably more effec- 
tive in the early 1930’s than they are now. 
Why should this be so? 

No definite and final answer can be given 
here, but is it not possible (from the internal 
evidence of the various studies reviewed) that 
during the past three decades child guidance 
clinics have undergone a variety of changes 
both in the nature of the service rendered to 
the community and, even more importantly, 
in the severity of the disturbances they at- 
tempt to deal with? 

It is suggested that during the past two 
decades, child guidance clinics have increas- 
ingly been dealing with deep-seated cases of 
disturbances requiring extensive psychother- 
apy and less with relatively minor cases that 
can be treated by manipulation of the envi- 
ronment.? Probably there is a multiplicity of 
reasons for this change, including increasing 
public awareness of severe childhood disturb- 
ance, increasing availability of trained thera- 
pists, possibly also an increased actual inci- 
dence of more severe disturbance, reflecting 
the social dislocation in the past two decades, 
and possibly many other factors as well. 

It is the writer’s contention that the ques- 
tion “How effective is psychotherapy with 
children” is still unanswered. 


SUMMARY 


In an attempt to throw further light on 
Levitt’s (1957) findings on the apparent in- 
effectiveness of psychotherapy with children, 
the unsatisfactoriness of the baseline data 
was discussed. The literature was re-reviewed, 
and the method of extreme groups was ap- 
plied. The four studies reporting the highest 
f success were contrasted with 
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that: , , 
1. The studies reporting high percentages 
of success relate to cases treated during the 


years 1931-1932, whereas three of four stud- 


e out to some extent by the fact that 
mention of psychotic chil- 
o. Also, where treatment is 
ironmen- 


3 This is born 
early studies make a 
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specified, earlier studies rely heavily on env ao 
tal manipulation, while later studies spent on 
ing numbers of interviews with the patient himseit. 


ies reporting low percentages are postwar 
studies. This relationship was found to hold 
good when a total of 18 studies were di- 
chotomized at the average percentage. 

2. There was little agreement among the 
studies regarding factors responsible for higher 
successes, except that length of treatment was 
related positively to successful outcome, as 
was also the nature of the skills of the thera- 
pist. 

3. The wide divergences in criteria for suc- 
cess, and methods of treatment, render the 
averaging of a large number of studies an un- 
satisfactory method of evaluating the effec- 
tiveness of psychotherapy with children. 

It is contended that, as yet, there is no 
satisfactory answer to the question of how 
effective psychotherapy is with children. 
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REPLY TO HOOD-WILLIAMS 


EUGENE E. LEVITT 
Indiana University Medical Center 


The commentary by Hood-Williams ( 1960) 
casts some interesting and thought provoking 
light on my review of the results of psycho- 
therapy with children (Levitt, 1957b). Hood- 
Williams’ contentions are certainly worthy of 
discussion, but they hardly constitute a head- 
ing so encompassing as “revaluation,” as I 
shall attempt to demonstrate. 

The influence of the date of the study. If 
one accepts Hood-Williams’ analysis, its effect 
is to emphasize the inference that treatment 
has been ineffective. We must allow that the 
work since 1945 is more representative of 
what we now call psychotherapy than work 
done previous to that time. Hood-Williams 
suggests, as an explanation for the date fac- 
tor, that clinics are now dealing with more 
serious problems than they used to. Possibly. 
It appears more likely, though, that the lower 
recent improvement rate is a function of ad- 
vances in diagnostic methods, greater knowl- 
edge of personality dynamics, and increased 
research sophistication. 

On the other hand, one should not hasten 
to accept Hood-Williams’ analysis. In my 
Table 5 (Levitt, 1957b, p. 194), I pointed to 
a potential positive relationship between out- 
come and number of points used in the out- 
come rating scale. All of the reports using 
4- and 5-point scales show above average im- 
provement rates, and all but one were car- 
ried out prior to 1936. Both of the reports 
using dichotomous scales have below aver- 
ge improvement rates; they were reported in 


a 
1955. Thus there are two possible causal fac- 


tors here. 
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trol, and Hood-Williams is in error when he 
says that there have been no other methods 
published. At least five control techniques 
have been employed in the past (cf. Levitt, 
Beiser, & Robertson, 1959). i 

The evidence accumulated since the publi- 
cation of my 1957 paper suggests that the 
idea that the treated cases are serious dis- 
turbances, while the defectors are transient 
disorders, is also erroneous. One study (Levitt, 
1957a) shows that treated and defector child 
cases do not differ on some 61 factors, in- 
cluding two clinical estimates of severity of 
disturbance, and eight other factors rela- 
ing to symptoms. A second study (Levitt, 
1958a) reported that expert judges found no 
difference in severity of symptoms between 
treated and defector child cases, judging from 
case records. The mean severity ratings, based 
on a five-point scale, were 2.98 for treated 
cases and 3.02 for defectors. 

Follow-up interviews with parents of 142 
defector cases (Levitt, 1958b) disclosed that 
only 13% attributed defection to improve- 
ment in the child’s symptoms. Over half 
blamed either the clinic or environmental cir- 
cumstances. In an earlier telephone follow-up 
of 40 cases (Inman, 1956) 14, or 35%, gave 
improvement as a reason for defection. How- 
ever, all but three of these parents saw fit to 
volunteer at least one other reason as well 
which hints at some defensiveness about the 
improvement explanation. x 

Putting the two studies together, 18% of 
the parents gave improvement in the child as 
a reason for defection. This percentage does 
not seem large enough to warrant the state- 
ment that pretherapy remission distinguishes 
treated cases and defectors, though it may 
have affected the intergroup comparison to 
some extent. However, we have no independ- 
ent evidence to suggest that the same propor- 


. ° 


90 Eugene E. Levitt 


tion would not hold for the treated cases as 
well. A parent might well feel that the child 
had improved and yet not see fit to break 
contact with the clinic. 

I do not claim that any of these studies or 
lines of reasoning is conclusive, but taken to- 
gether, their weight seems opposed to the hy- 
pothesis that the defector cases are less dis- 
turbed as a group than cases who go on to 
receive treatment. 

The problem of length and intensity of 
treatment, It was not my intention to deal 
with these factors in my review, primarily be- 
cause the needed information was lacking in 
most reports, as Hood-Williams notes. The 
problem is certainly noteworthy, especially 
since it is by no means as simple to grapple 
with as it might appear at first glance. Con- 
sider Maas, Kahn, Stein, & Sumner’s (1955) 
table presented by Hood-Williams. If we ac- 
cept these data as representative, and com- 
pare them with the defector results, the in- 
ference is not simply that “successful outcome 
is directly related to length of treatment,” 
The data show that improvement is a func- 
tion of at least 40 treatment interviews; a 
lesser number actually makes the patient 
worse. Would the practicing psychotherapist 
care to support the contention that 35 treat- 
ment interviews js Worse than no treatment 
at all? Is there an hypothesis that could ex- 
plain this peculiar Phenomenon? And why is 
it that only 7% of the Maas’ sample having 
no contact, or only one family contact, with 
the clinic showed improvement, while the fig- 
ure is 10 times as great for approximately the 
same degree of contact among defectors? 

Attempts to determine the relationship be- 
tween outcome at close of treatment and 
views are confounded by a 
enty interviews usually span 
Period than five interviews. 
ive relationship appear, we 
cannot be certain that it ig oa bo simply 


time alone, quite apart from 
views. This point i 


ncreases in im- 
without reference 


I agree entirely with Hood-Williams that 
the matters of intensity and type of treat- 
ment, and the qualifications of therapists are 
potentially significant, and should be thor- 
oughly discussed in any evaluation study. (In 
the review, I referred to the “generally poor 
caliber of methodology and analysis” in the 
studies covered therein.) The fact that this 
kind of information js usually lacking sug- 
gests that one should be circumspect in draw- 
ing inferences from the review. It does not 
impeach the findings. 

The character of the evaluated treatment is 
taken up in some detail in the evaluation 
follow-up study at the Institute for Juvenile 
Research in Chicago (Levitt et al., 1959). 
We report that almost half of the cases were 
treated by students or staff members with 
one year of experience or less. Only one-third 
were seen by therapists with more than three 
years of experience. A third had to switch 
therapists at least once during the course of 
therapy, and some had been handled by three 
or even four therapists. Ninety per cent of 
the individuals were treated on a once-a-week 
basis, and only 15% by psychiatrists. 

If similar conditions of therapy have been 
prevalent among child guidance clinics, then 
Hood-Williams’ point might have greater im- 
port. But we do not know, and probably will 
never know. In fact, we do not even know for 
certain that experienced therapists will be 
more effective, though it seems to be a plau- 
sible hypothesis. 

Hood-Williams has raised some reasonable 
points. If his paper serves to focus additional 
attention on the need for continuing evalua- 
tion of our therapeutic efforts, and on the 
necessity for tightening up methodology and 
technique, then its publication is amply justi- 
fied. But his conclusions concerning the re- 
view of results are too sweeping and incau- 
tious for the limited and equivocal analyses 
which he carried out. 
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DEVIANT RESPONSES IN COLLEGE ADJUSTMENT CLIENTS: 
A TEST OF BERG'S DEVIATION HYPOTHESIS ! 


AUSTIN E. GRIGG anp JOSEPH S. THORPE 


University of Texas 


Berg, in a series of provocative papers 
(1955, 1957, 1958), has insisted that indi- 
viduals when taking almost any type of non- 
achievement test make many responses which 
are similar to responses made by most others, 
but they also make responses which are un- 
like those of others in the general population 
and are more like responses made by mem- 
bers of some special subpopulation to which 
the particular individual belongs. Thus, some 
individuals respond to test items of various 
kinds in a manner more typical of lawyers 
than of engineers or of the general popula- 
tion. Some make responses more typical of 
psychotics than of psychopaths or of the gen- 
eral population, 

Berg believes that the particular test con- 
tent is not important, but that the use of 
clearly delineated criterion groups and an 
analysis of response tendencies peculiar to 
the particular group are the basic essentials 
in using deviant responses to study person- 
ality. He hypothesizes that the tendency to 
give deviant responses is a generalized one 
and is exhibited in situations other than the 
testing situation. In regard to personality mal- 
adjustment, he postulates, in effect, that those 
Who give maladaptive responses in everyday 
living will give “deviant” responses when 
Scored against a key of typical responses for 
most any kind of Psychological test. 

Using Berg's approach, Barnes 


MMPI cor- 
Scale 7 and -93 with Scale 8. 


this same author (1955) 


minate between a large 


1 This research was Supported b 
to the University of Texas Testin 
Center by the Hogg Foundation f 


y funds awarded 
g and Counseling 
or Mental Health. 


92 


sample of controls and psychiatric patients 
on the basis of how the subjects responded 
when checking personal preferences for a 
small number of abstract geometrical designs. 
Barnes also studied clinical subgroups by the 
deviant response method. 

Although Berg and his students are cur- 
rently championing the deviant response ap- 
proach, it should be remembered that as 
early as 1910, Kent and Rosanoff used a 
“deviation” approach when they altered the 
Jungian word association technique and ap- 
praised results in terms of common and un- 
common associations to stimulus words as 
diagnostic of personality maladjustment. 

Following Berg’s reasoning, if one takes a 
normal sample of college students and studies 
the common and uncommon responses they 
make when asked to describe themselves on 
an adjective check list and then constructs an 
adjective list from the most commonly and 
the most rarely selected words, one would pre- 
dict that with a new population those who be- 
come therapy clients would check more of 
the rare words and fewer of the common 
words than would the nonclient members of 
the population. The present study tests the 
validity of this prediction. 


METHOD 


In February 1958, during their regular freshmen 
testing sessions, the entering freshmen checked their 
self-descriptions on the 300-item Gough Adjective 
Check List (1955). An item analysis was made of 
the responses by the 181 males and 92 females to 
ascertain the most commonly checked adjectives and 
the least commonly selected adjectives. The original 
analysis was made separately for each sex, but it 
was observed that certain “sex-biased” adjectives 
could be eliminated from the data and the results 
would be applicable to both sexes. (The word 
“sweet,” eg, was commonly selected by girls but 


, 
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rarely by boys.) Seventy-two items of commonly 
checked and uncommonly checked adjectives were 
selected for the revised form of the test by includ- 
ing all adjectives selected by 84% or more of the 
freshmen and also all adjectives selected by fewer 
than 16% of the group. The final selection of 72 
adjectives consisted of 33 commonly selected words 
and 39 uncommonly selected words. 

In September 1958, this revised 72-item adjective 
check list was given to approximately 1,400 enter- 
ing freshmen. The freshmen were asked to check the 
adjectives that best described them. At the close of 
the academic year this pool of data was divided into 
those freshmen who had reported during the year 
for psychiatric help at the Student Health Center or 
to the University of Texas Testing and Counseling 
Center for personal adjustment counseling, those 
who had reported for other types of counseling such 
as vocational-educational counseling, and into those 
who did not seek any type of professional counsel- 
ing or psychiatric aid. In the present study, the 
counseling cases were classified as to type after each 
interview by the counselor, and those who remained 
as educational-vocational did not receive assignment 
to personality maladjustment by their counselors, A 
study was made of the “deviant” responses of the 
psychiatric treatment group, the two counseling 
groups, and a randomly selected control group from 
the nonclient group. A deviant response was defined 
as each time a typical adjective was not checked 
(from the 33 commonly selected adjectives) or each 
time an atypical adjective was checked (from the 
39 uncommonly selected adjectives). The four groups 
studied were: 24 psychiatric treatment cases, 37 per- 
sonal counseling cases, 186 educational-vocational 
cases, and 150 randomly selected nonclient controls. 


RESULTS AND Discussion 


The means and standard deviations of the 
deviant response scores for each of the four 
groups are presented in Table 1. The psy- 
chiatric and personal counseling groups gave 
more deviant responses than did the educa- 
tional-vocational counseling group or the con- 


trol group. . , 
The summary of an analysis of variance of 


TABLE 1 


5 AND STANDARD DEVIATIONS OF Deviant RE- 


MEAN! 
F PSYCHIATRIC, PERSONAL Coux- 


SPONSES O; 


SELING, VOCATIONAL COUNSELING, 
AND CONTROL GROUPS 


Psychiatric 24 14.29 ne 
Personal 37 pos oe 
Vocational 186 9 a 
Control 150 9.77 E 


TABLE 2 


ANALYsIS OF VARIANCE OF DEVIANT RESPONSE SCORES 
OF PSYCHIATRIC, PERSONAL COUNSELING, VOCA- 
TIONAL COUNSELING, AND NONCLIENT 
ControL GROUPS 


Mean 
Source of Variation df Square F 
Between groups 3 186.57 3.91* 
Psychiatric & Personal 
vs. 
Vocational & Control (1) (484.66) 10.15* 
Psychiatric vs. Personal (1) (71.14) 1.49 
Vocational vs. Control (1) (4.92) — 
Within groups 393 47.72 


Total 396 


* Significant at the .01 level of confidence. 


the deviant responses is given in Table 2. 
The between-group means F test, which 
yielded an F value of 3.91, significant at the 
.01 level, was followed by three orthogonal 
comparisons between the various groups. The 
first of these comparisons indicates that the 
psychiatric group and the personal counsel- 
ing group, when combined, differ significantly 
from the vocational group and the control 
group taken together, when compared for av- 
erage number of deviant responses. The two 
groups with personality problems give sig- 
nificantly more deviant responses, a finding 
consistent with Berg’s hypothesis. The re- 
maining comparisons show that the psychi- 
atric group does not differ significantly from 
the personal counseling group in deviant re- 
sponse scores and that the vocational group 
does not differ significantly from the control 
group in average number of deviant responses. 

The differences in the data are consistent 
with Berg’s Deviation Hypothesis. The two 
groups who sought professional help for per- 
sonal adjustment problems, the psychiatric 
group and the personal counseling group, pre- 
sumably exhibited more personality disturb- 
ance than did members of the nonclient con- 
trol group or members of the educational- 
vocational counseling group. In the present 
study, although most of the deviant adjectives 
are of the type that would be selected by sub- 
jects with a response set to describe them- 
selves in self-depreciating terms, only 39% of 
the adjectives in the present study occur in 
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Gough’s list of “least favorable” adjectives, 
and only one of the 30 items of Gough’s Ad- 
justment Scale from his Adjective Check List 
(1955) occurs in the total pool of commonly 
and uncommonly selected adjectives of the 
present study. 

We call attention to a novel element in 
our study. The traditional manner of deriv- 
ing an empirical discrimination between two 
groups has been to give a test to two criterion 
groups, e.g., well-adjusted and maladjusted, 
select the discriminating items by an item 
analysis, and then to cross-validate the em- 
pirical key with new groups. In the present 
study, modeled to test Berg’s hypothesis, the 
empirical test key was established by study- 
ing a general population in terms of typical 
and atypical responses. Then various criterion 
groups drawn from a new population were 
used to ascertain the validity of the deviant 
response key, 


CONCLUSION 


A list of commonly selected and uncom- 
monly selected adjectives from the Gough 
Adjective Check List was established on the 
basis of responses of a group of university 
freshmen. When this list of frequently chosen 
and infrequently chosen words was given to 


a new sample of freshmen, it was found that 
the deviant response scores of freshmen who 
later became personal counseling or psychi- 
atric treatment cases were significantly greater 
than the deviant response scores of a control 
group of nonclients. A group of freshmen who 
later applied for educational-vocational coun- 
seling and were retained in that classification 
by their counselors did not deviate signifi- 
cantly from the control group in their deviant 
response scores. These results are consistent 
with Berg’s Deviation Hypothesis. 
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INDIVIDUAL AND SOCIAL CORRELATES OF PRISON ESCAPES? 
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Many prison administrators believe prison es- 
capes are primarily situational, based on op- 
portunity and triggered by stresses of the prison 
environment. The fact remains that some in- 
mates will escape and others will not even when 
opportunities are equal. This study attempts to 
identify some personal and social characteristics 
of the “escape risk” inmate and, in so doing, 
help fill a curious scarcity of empirical research 
on the subject. 

The Ss were 200 white male inmates of the 
Louisiana State Penitentiary. Half had success- 
fully escaped over a two-year period. The re- 
maining 100, matched by month and year of ad- 
mission and degree of custody, were nonescapees. 
Each escapee was thus matched with a non- 
escapee imprisoned for the same length of time 
and under the same degree of supervision at the 
time of escape. 

Twenty-two 
square analysis. 


variables were subjected to chi 
Eight variables discriminated 
the criterion groups above the .05 confidence level. 
Traits representative of the escape groups were 
as follows: fewer dependents, more prior out of 
state penal commitments, more property crimes 
committed, more juvenile commitments, younger 


1An extended report of this study may be ob- 
tained without charge from F. E. Stockwell, Louisi- 
ana Department of Institutions, Baton Rouge Louisi- 
ana, or for a fee from the American Documentation 
Institute. Order Document No. 6114, remitting $1.25 
for microfilm or $1.25 for photocopies. 
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at first arrest, from smaller cities, from other 
states, and fewer years residence within Louisi- 
ana. 

Three additional variables were significant be- 
tween the .05 and .10 levels. Traits predominant 
in the escape group were: longer sentences, lower 
tested educational achievement, and younger at 
admission. i 

Variables which did not discriminate between 
the two groups were: Otis IQ, marital status 
broken homes, urban backgrounds, father’s occu- 
pation, number of siblings, previous prison com- 
mitments within state, veteran status, psychiatric 
history, years school completed, and number of 
institutionalized siblings. 

Significant variables were intercorrelated and 
cluster analyzed using Tryon’s modification of 
Holzinger and Harman’s B coefficient. The first 
cluster, with a B coefficient of 2.41, included the 
following variables: years residence within Louisi- 
ana, distance to home state, number penitentiary 
commitments elsewhere, and size of community. 
This cluster was labeled “Transient Criminality.” 

The remaining four variables were included 
within one additional cluster with a B coefficient 
of 2.64. The cluster, identified as “Early Crimi- 
nal History,” included number of juvenile com- 
mitments, age at first arrest, number dependents, 
and commission of property crimes. 

The results suggest that with situational factors 
held reasonably constant, the risk of a given 
prisoner’s escaping is related to certain precom- 
mitment life patterns. 
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A DIMENSIONAL ANALYSIS OF CLINICAL JUDGMENT ! 


JOHN W. DONAHOE 


University of Kentucky | 


Commonly, studies of clinical judgment se- 
verely restrict the mode of response open to the 
judge (e.g., psychiatric diagnosis). This is done 
in order to yield a readily quantifiable dependent 
variable (e.g., number of correct classifications). 
Unfortunately, such decisions constitute an un- 
representative sample of the kinds of judgments 
that clinicians are called upon to make. Rather, 
what appears to be needed is an instrument allow- 
ing for more diversified responses so as to provide 
a knowledge of the dimensions along which 
clinical judgments vary. The present study was 
an attempt to develop a rating scale to fulfill 
this function. 

The rating consisted of 50 descriptive and 25 
dynamic seven-step scales. The descriptive 
scales were chosen from adjectives and phrases 
used by Beck in the interpretation of test ma- 
terials. Terms of opposed meanings (e.g., con- 
stricted—spontaneous) were selected as scale end- 
points. The 25 dynamic scales dealt with the 
chief sources of conflict and the mechanisms of 
defense according to Psychoanalytic theory (e.g., 
never uses projection—very frequently uses projec- 
tion), 

The Rorschach productions of six hospitalized 
mental patients were scored and summarized. 
The diagnostic categories represented were schizo- 
phrenia, depressive reaction, personality trait dis- 
turbance, and chronic brain syndrome. Twelve 

1 An extended report of this study may be obtained 
without charge from John W. Donahoe, Psychology 
Department, University of Kentucky, Lexington, 
Kentucky, or for a fee from the American Docu- 
mentation Institute. Order Document No. 6115, 
remitting $1.25 for microfilm or $1.25 for photo- 
copies. 

2 This study was conducted while the author was 


employed at the Veterans Administration Hospital, 
Lexington, Kentucky, 


clinical psychologists with an average of 3.1 years 
of diagnostic experience served as judges. All 
judges were successively presented with the six 
protocols and asked to place a check mark on 
each of the 75 scales and also to diagnose the 
patient in accord with conventional psychiatric 
nomenclature. 

The median interjudge reliability coefficient was 
.51, with a range from .18 to .98. In order to 
examine the structure of the rating form, a scale 
matrix was constructed. Those scales with 
squared vector lengths of less than 5.0 were not 
included, thereby reducing the number of de- 
scriptive scales from 50 to 42 and the number of 
dynamic scales from 25 to 6. The reduced 
matrix was analyzed by the D method of factor- 
ing. The first factor was defined by the scale end- 
points much hostile conflict (pivot), neurotic, 
unstable, insecure, and anxious, These are terms 
intimately associated with the concept of neurot- 
icism. The second factor dealt with a continuum 
from introversion to extroversion as the scales 
having highest coordinates were cautious (pivot), 
introversive, constricted, and withdrawn, The 
third factor, while not as homogeneous as the 
others, was concerned with psychosis. Scales 
heavily loaded on the third factor were organi¢ 
brain involvement (pivot), defenses holding 
poorly, psychotic, and underreactive. A 

The judges were correct in only 38% of their 
diagnoses. When the matrix was analyzed with 
respect to Rorshach protocols, however, the rec- 
ords were appropriately grouped. Thus, while the 
judges were capable of checking the rating form 
with some validity, they were unable to integrate 
these separate judgments into an accurate global 
statement. In short, the scaling technique should 
provide a means for assessing clinical judgment 
under a wide variety of conditions. 
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SOME CORRELATES OF MANIFEST ANXIETY IN CHILDREN + 


IRA ISCOE anp IRENE COCHRAN 
University of Texas 


Two hypotheses were tested. 1. If a re- 
lationship exists between anxiety as measured by 
the CMAS and observable behavior, then children 
who score high on the CMAS should be judged 
as significantly more “maladjusted” than their 
classmates of the same sex who score low on 
this scale. 2. If Manifest Anxiety is related 
to adjustment, it should bear a significant re- 
lationship to the California Test of Personality, 
a frequently used instrument to assess “adjust- 
ment” in school age children. 

For Hypothesis 1, 118 boys and 96 girls, aver- 
age age 10.4 years, from six elementary school 
classrooms, were administered the CMAS. For 
each grade the five boys and five girls having 
the highest and lowest anxiety scores were se- 
lected. A total of 60 Ss was thus obtained. 
Mean HA was 25.5 and LA was 3.4. Homeroom 
teachers rated each S on “adjustment” compared 
to own sex group, using a five-point rating scale. 
They were unaware of CMAS scores of the stu- 
dents. x? was equal to 8.2 (df = 2) which is 
significant at the 02 level, A rank order correla- 
tion yielded a rho of .37, significant at .01 level. 
Manifest Anxiety would appear to be an ob- 
servable characteristic, noticeable to those fa- 
miliar with the Ss’ everyday school behavior. 
Since the concept of Manifest Anxiety 1s de- 
rived from Hullian theory, a consideration of the 


1An extended report of this study may be ob- 
tained without charge from Ira Iscoe, Department 
of Psychology, University of Texas, Austin 12, Texas, 
American Documentation Insti- 


or for a fee from the / nen [r 
tute. Order Document No. 6118, remitting $1.25 for 


microfilm or $1.25 for photocopies. 


relationship between high and low drive and 
teachers’ judgments seems appropriate. Ques- 
tioning of teachers as to the basis of their judg- 
ments revealed a tendency to equate “maladjust- 
ment” with “restlessness,” lack of attention, and 
inability or unwillingness to “settle down.” Good 
adjustment indicated the opposite tendencies. 
There is some support, therefore, for the belief 
that drive level was a prominent observable 
characteristic. 

For Hypothesis 2, 228 fourth- and sixth-grade 
children were administered the California Test 
of Personality and the CMAS. High scores on 
the former indicate good adjustment. 

CMAS correlated — .48 with “Total Adjust- 
ment,” — .49 with “Personal Adjustment,” and 
— 59 with “Freedom from Nervous Symptoms.” 
These r's are highly significant (p = 01). 

The results of Hypothesis 1 suggest a relation- 
ship between general body activity and judgments 
of adjustment. This is conmmensurate with the 
concept of Manifest Anxiety as a measure of 
drive level. The results of Hypothesis 2 suggest 
a significant relationship between high drive level 
and adjustment and also the willingness to admit 
certain subjectively experienced discomforts and 
apprehensions. The possibility of similarity of 
item content on both instruments should also be 
considered. 

Despite certain shortcomings, the concept of 
Manifest Anxiety in children as a measure of 
drive level, if not of adjustment, has in it 
potentialities that deserve further study. 


(Received August 9, 1959) 
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Brief Reports 


A CLUSTER ANALYSIS OF OBJECT SORTING BEHAVIOR? 


A. B. SILVERSTEIN è 
Pacific State Hospital 


Clinical psychologists are frequently called 
upon to assess a patient’s ability to think ab- 
stractly, and we possess a number of instruments 
with which we endeavor to measure this ability 
and gauge its impairment. Not much is known, 
however, concerning the relationships among the 
measures which these instruments yield. The 
aim of the present study was to clarify the inter- 
relations of certain scorable aspects of the Gold- 
stein-Scheerer object sorting test as used by 
Rapaport, Gill, and Schafer (1945), 

The data were those Teported by Rapaport 
et al. for 54 randomly selected members of the 
Kansas Highway Patrol.3 Thirteen variables were 
selected for analysis. All distributions were 
dichotomized as close to the median as possible, 
and Ø/Ømax Was employed as the index of associ- 
ation. McQuitty’s elementary linkage analysis 
Was then utilized to treat the data, 


1 An extended report of this study may be obtained 
from Arthur 


Order 
- 6117, remitting $1.25 for microfilm 
or $1.25 for Photocopies, 

2 Formerly at the Psychiatric Institute, University 
of Maryland, 

3 The author js indebted to D. Rapaport and to 
the Year Book Publishers for permission to make 
use of material appearing in Diagnostic Psychological 
Testing. 


exception was that Functional Definitions from 
both parts of the test formed a cluster of their 
own, 

Rapaport et al. consistently distinguish between 
active and passive concept formation and place 
some emphasis on the diagnostic significance of 
differences in performance on Parts I and II of 
the test. The findings of this study indicate that 
the distinction between the two parts of the test 
is quite justified. Not all research workers make 
this distinction, however. A comparison of per- 
formance on Parts I and II as scored by the 
“conceptual area” analysis of McGaughran and 
Moran (1956) would appear highly desirable in 
the light of the present findings. y , 

It has been suggested that more valid a 
of intelligence, e.g., on Wechsler’s vocabu ary, 
might be obtained if functional definitions were 
to receive less credit than concrete definitions 
since the order abstract-concrete-functional ap: 
Pears to be closer to the truth than the conven 
tional order abstract-functional-concrete. In con- 
trast to this, the results of the present analysis 
suggest that while abstract and concrete defini- 
tions may lie on a continuum, functional defini- 
tions actually represent a thing apart. 
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PSYCHOLOGICAL TEST REVIEWS 


New Tests 


Stott, D. H., and Sykes, Miss E. G. Bristol Social 
Adjustment Guides. Age 5-15.1 Adjustment guide 
(observation blank), scoring templates, and diag- 
nostic form (summary record blank) for (a) the 
child in school (boy) BG-1, and (girl) BG-2; 
(b) the child in residential care BG-3; and (c) 
the child in the family BG-4. Guides BG-1 and 
BG-2 use the same scoring templates and diag- 
nostic form. Guide BG—4 includes a life history 
blank and is scored using a copy of the guide 
with overprinted key in lieu of templates. 

The Bristol Social Adjustment Guides constitute 
a careful attempt to render more objective and pre- 
cise the observations of teachers, social workers, and 
foster parents, with respect to children’s social 
adjustment. The Guides are intended to save time 
for the psychologist by providing accounts of be- 
havior more standardized and compact than anec- 
dotal protocols, and more specific than impression- 
istic ratings. Since the Guides encourage accuracy 
in observing children and provide a system for 
classifying symptomatic behavior, they have a place 
also as aids in clinical training. 

Each Guide consists of numerous adjectives and 
phrases grouped by topic and scrambled as to de- 
sirability, though with negative indications in the 
majority. The observer simply underlines those terms 
which apply to the child concerned. Translucent 
templates are provided on which critical items are 
keyed for maladjustment in specific areas such as 
“hostility to adults,” “anxiety,” “knavery,” and 
“psychosomatic symptoms.” The results can be 
quasi-quantified on the “diagnostic form’ on which 
the keyed items are recorded, by counting the sig- 
nificant items marked in each category. Items in 
certain areas are also grouped as to severity. How- 
ever, the author eschews any overall index of mal- 
adjustment. nae 

Stott and Sykes are thoroughgoing empiricists, 
and their work is a good example of an inductive 
approach to conceptualizing maladjustment. Avoid- 
ing the constructs of current theories, they collected 
descriptions of specific children as reported in case 
conferences. By successive tryouts with teachers, 
social workers, etc., they developed lists of descrip- 
tive terms which could be used comfortably and 
unambiguously by nonpsychologists. Ttems were 
grouped into approximately 15 “attitudes,” egs 
“depression” or “hostility to adults,” by a scalogram- 
like procedure in which children’s names on one 
axis of a large grid and their symptoms on the 
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1ıWi The Social 
With manual, Th of London Press, 1958. 


Pp. 46. London: University 


other were ordered and recorded until coherent 
syndromes of behavior emerged. Only those items 
are now scored whose relative frequencies among 
the stable vs. the maladjusted (omitting a middle 
category of “unsettled children”) yielded significant 
x*’s. However, the “attitude” syndromes shade into 
one another along a behavioral spectrum and have 
merely descriptive significance. 

Since the Adjustment Guides are intended only to 
elicit and organize the specifics of observed behavior, 
not all the usual criteria of test evaluation are rele- 
vant. The 46-page manual by Stott includes a 
lengthy defense of his scientific philosophy and a 
detailed historical account of the successive steps 
in the development of the Guides. Their reliability 
(interrater agreement) in a preliminary study is 
given as around .76. More data are needed, but 
reliability is likely to depend as much upon the 
raters’ skills and opportunities as upon the content 
of these forms. The validity of each item in dif- 
ferentiating between the stable and the maladjusted is 
presented, but without cross-validation. Normative 
data for the interpretation of total scores on each 
attitude are lacking. This is not a serious omission 
for American users who will need to develop their 
own norms anyway. 

Differences between the English and American 
languages constitute a more fundamental problem. 
The author’s success in finding meaningful descriptive 
phrases drawn from the colloquial speech of nurses, 
teachers, etc., eg, “spivvish dress,” or “feckless,” 
will occasionally create problems of understanding 
for Americans. Even clinical vocabulary has its dif- 
ferences—e.g., “unforthcomingness” for “shy,” or 
“unsettled” for “mildly maladjusted.” Those of us 
used to the euphemistic polysyllables of “psycho- 
pathic deviate’ may feel decidedly self-conscious 
characterizing a child as a “knave.” The Guides, 
in summary, will probably not achieve their po- 
tential value in this country until someone undertakes 
to translate them from the authors’ English into 
ours——Read D. Tuddenham. 


Carrol, J. B, & Sapon, S. M. Modern Language 
Aptitude Test. Ages 14—adult. 2 forms, long 
(60 min.) and short (30 min.). Hand or IBM 
scoring. Tape recording necessary for administra- 
tion of long form ($7.50) ; test booklets ($3.50 for 
25), IBM answer sheets ($3.50 for 50), scoring 
stencils ($0.50 per set), specimen set (without 
tape) ($0.75), manual, 27 pp. New York: Psycho- 
logical Corp., 1959. 

The Modern Language Aptitude Test, the outcome 
of a five year study, includes aural and visual ma- 
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terials. The long form which contains both types 
of materials requires a tape recorded for administra- 
tion. The two aural parts of the test, included only 
in the long form, are “Number Learning” and 
“Phonetic Script.” The other three are: “Spelling 
clues,” requring recognition of incompletely spelled 
words; “Words in Sentences,” testing sensitivity to 
grammatical structure; and “Paired Associates,” in- 
tending to tap memory. 

Sufficient data bearing on reliability and validity 
are presented to warrant issuing the test and to 
establish confidence in its usefulness as a diagnostic 
device. The normative data, particularly for high 
school and adult norms, are weak.—E. S. B. 


Stark, R. H. Polyfactorial Study of Personality. 
Adults. Test booklets ($6.25 per package), IBM 
answer sheets ($2.75 per package), stencils ($2.00), 
profile sheets ($2.75) specimens set ($2.50), manual, 
11 pp. ($0.75). New Rochelle, N. Y.: Bruce, 
1959, 


A 300-item inventory reportedly selected on the 
basis of factor analysis and correlations with un- 
specified criteria. The general impression is that the 
major criteria may have been other tests, e.g., Ror- 
schach, MMPI, and the H-T-P test. Ten scores 
are obtained and Tables of Correlations with MMPI, 
the Association Adjustment Inventory, and certain 
subtests of the WAIS are presented. Test-retest re- 
liability for a prison population suggests that the 
Scores are reasonably stable. No other validation 
data are offered, even though reference is made to 
various possible criterion Populations. There are no 
references to published reports. Why was this test 


issued after so little developmental analysis ?— 
E. S. B. g 


Cassell, R. N, Test of Social Insight. Two forms, 
youth edition (junior and senior high school) and 
adult edition. Untimed, Test booklet ($6.25 per 
Package) ; IBM answer sheets ($2.75 per package) ; 
Specimen sets ($2.50); Profile sheet ($2.75 per 
package) ;- IBM scoring stencils ($1.00), manual, 
Eg eP- ($1.75). New Rochelle, N. Y.: Bruce, 


The Test of Social Insight consists of 60 five 
alternative multiple choice items dealing with home 
and family relations, authority figures and social 
agencies, play and avocational interests, and work 
and vocational interests. The two editions are 
parallel in content, only slightly altered to adapt to 
the different age groups. In addition to a total 
score, five part scores: withdrawal, passivity, co- 
operativeness, competitiveness, and aggressiveness, 
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are obtained. There are two sets of norms for the 
youth edition (junior and senior high school stu- 
dents) and two sets for the adult edition (college 
students and adults and guidance counselors and 
personnel psychologists). 

The odd-even reliabilities, even after Spearman- 
Brown corrections, are unsatisfactory, none above 
-88 and, in some populations, as low as .51. Validi- 
ties based on inadequately reported unpublished data 
are more reassuring. Caveat Emptor!—E. S. B. 


Bruce, M. M. Bruce Vocabularly Inventory. Adult. 
Untimed (20 min.); Specimen sets ($1.00); Test 
booklet ($5.00 per package); IBM answer sheet 
($2.75 per package); manual, 3 pp. ($0.75); 
scoring keys, hand sets ($2.00), and machine sets 
($4.50). New Rochelle, N. Y.: Bruce, 1959. 
A 100-item multiple choice test in which the key 

words were chosen at random from the even num- 

bered pages of the Oxford Universal Dictionary. 

Norms are given for eight not very precisely defined 

occupational groups and for the aggregate. The 

groups vary from 61 to 495 in size. While the re- 
ported reliabilities, odd-even, .92, and test-retest, 

.84, are acceptable, the test correlates only slightly 

more (approximately .80) with other vocabulary 

tests than with intelligence tests (approximately .60). 

—E. S. B. 


Bruce, M. M. Association Adjustment Inventory, 
Adult. Untimed (10 min.); Test booklet ($5.00 
per package); IBM answer sheets ($2.75 per pack- 
age); scoring keys, hand set ($2.00) and machine 
set ($450); manual, 15 pp. ($0.75). New 
Rochelle, N. Y.: Bruce, 1959. 

The Kent-Rosanoff list of stimulus words has been 
cast up into a four alternative multiple choice 
format. Thirteen different scores are obtained, in- 
cluding a total (maladjustment) and an immaturity 
score, and 11 dimensions of “deviate ideation.” 
Tables of intercorrelation raise serious doubts as to 
whether there are that many independent scores. 
Norms for “institutionalized” and “general” popula- 
tions are based on large but vaguely defined popula- 
tions. Test-retest correlations for one week and 
one month intervals are satisfactory, but the base for 
claiming validity is very unsatisfactory. There is 
evidence of some general relationships with MMPI 
scales and with a largely unvalidated inventory: 
Reference is made to unreported results of diagnostic 


validity studies that are unpublished. It should 
have been marked “For Experimental Use.”— 


E. S. B, 
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INTELLIGENCE: A CONCEPT IN NEED OF 
RE-EXAMINATION * 


SHEPHARD LIVERANT 


Ohio State University 


The importance of the concept of intelli- 
gence in understanding and predicting human 
behavior appears so obvious that the value of 
this term in a science of behavior has seldom 
been questioned by psychologists. The inade- 
quacies inherent in present conceptions of in- 
telligence have been made explicit by a num- 
ber of authors (Anastasi, 1958a; Coombs, 
1952; Fromm & Hartman, 1955; McClelland, 
1951), but the reader is left with the impres- 
sion that the concept itself is essentially a 
useful and necessary one. By leaving open 
the possibility that the concept may be un- 
necessary, and even an obstacle, in explaining 
behavior, perhaps we can arrive at a better 
solution to the problems now subsumed un- 
der the rubric of intelligence.” 

This paper explores a number of problems 
over which the intelligence concept has stum- 
bled, such as: establishing an unequivocal 
definition, the contribution of heredity and 


environment, measuring an innate capacity, 
the constancy of the 


IQ, intelligence as a general factor, the role 
was written in part at the Interdisci- 
m in the Behavioral Sciences at the 
Summer 1958. The con- 


1 This paper 
plinary Progra’ 
University of New Mexico, 4 4 
ference was sponsored by the Behavioral Sciences 
Division, Air Force Office of Scientific Research un- 
der Contract AF 49 (638)-33. Their support is grate- 
fully acknowledged. f, 

The author also wishes to express his apprecia- 
tion to Julian B. Rotter, Richard Jessor, and Edwin 
Barker for their invaluable suggestions and criticisms. 

2Jn order to forestall misunderstanding, it is em- 
phasized at the outset that the term intelligence is 
being questioned in its explanatory usage, not neces- 
sarily as a descriptive concept. In other words, in- 
telligence is used in two ways; as a descriptive term 
(eg, A’s behavior js more intelligent than B’s) and 
as an explanation (e-8» A solved more problems mon 
B because A is more intelligent). Tt is the validity ‘0! 
the latter use which is being questioned. 


of culture in defining intelligent behavior, and 
the function of nonintellective variables. The 
objective of such an exploration is to re-evalu- 
ate these persistent problems by analyzing the 
logical and methodological adequacy of the 
meanings carried by intelligence. 

Present uses of the term intelligence for in- 
dividual prediction and the difficulties inher- 
ent in such usage are best represented in the 
clinical setting. Consequently a brief look at 
the clinical approach to intelligence will serve 
as a point of departure for the subsequent 
analysis. 


CLINICAL APPROACH 


Clinical case studies have produced a greatly 
increased awareness of the failure of an IQ 
(or any other behavioral appraisal) to reflect 
accurately a constant ability to perform at a 
certain level in all situations. This awareness 
has led to the clinical practice of consider- 
ing the IQ as an index of present intellectual 
functioning rather than intellectual potential 
or capacity. However, the belief in the exist- 
ence of a “true” IQ or intelligence persists, 
and attempts to discover it in each individual 
continue. 

Largely on the basis of scatter or pattern 
analysis of intelligence tests, statements are 
made concerning the size of the discrepancy 
between what is now present in the way of 
intellectual functioning and some assumed 
absolute capacity. Were uneven test profiles 
produced only by deviant individuals such 
interpretations might be meaningful, but sub- 
test scatter is the rule rather than the excep- 
tion within any population in spite of efforts 
to build homogeneous measures (Shofield, 
1952). As will be argued later, it is prob- 
ably more fruitful and certainly more empiri- 
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cally justifiable to regard uneven test profiles 
as the result of conditions (not necessarily 
abnormal ones) which have differentially af- 
fected the acquisition of certain skills and/or 
the manifestation of these skills in certain 
classes of situations. Then the task becomes 
one of determining what these conditions are 
and how they operate, rather than ascribing 
subtest patterns to various types of disease 
entities which affect another entity (intelli- 
gence). 

In general it appears that the host of evi- 
dence gathered over the last twenty or more 
years challenging the general and innate na- 
ture of intelligence has been subsumed under 
an interference hypothesis. Consequently, in- 
telligence can be and still is viewed as a gen- 
eral and innate source of variance, but it is 
increasingly recognized that in order to get 
at it from behavioral observation one must 
strip away an ever increasing number of com- 
plex variables which interfere with its effects 
on behavior. 

This point of view is illustrated by the work 
of the Chicago group (Eells, Davis, Havig- 
hurst, Herrick, & Tyler, 1951) who, after 
demonstrating the effects of culture on test 
performance, conclude that the answer to the 
problem is to develop “culture-free” intelli- 
gence tests. By contrast, Anastasi (1950) 
uses the same kind of evidence to question 
the validity of any test as a measure of an 
abstract conception of intelligence without re- 
gard to validation criteria. As Rotter (1958, 
p. 19) points out, “It is somewhat surprising 
that the emphasis on culture and training did 
not result in a discarding of the notion of 
over-all intelligence and the IQ in favor of a 
clear understanding of the value of varied 
tests in assessing what has been learned and 
what each test can predict. Rather the early 
trend to entitize and reify the IQ continues.” 
(Quoted by permission of McGraw-Hill.) 

The question being raised here is what if 
anything will be left as a contributor to un- 
derstanding behavior on a psychological level 
once the stripping away process is complete. 
If more and more of the variance which ac- 
counts for individual differences in intellectual 
performance can be attributed to other vari- 
ables, what is intelligence? 


Shephard Liverant 


THE PROBLEM OF DEFINITION 


Spiker and McCandless (1954) attempt to 
resolve the definitional problem by an appeal 
to early operationalism., In effect they con- 
clude that intelligence is what different intel- 
ligence tests measure in different situations 
with different groups and the relationship of 
these measures to other measures. Their solu- 
tion reduces intelligence to the status of an 
intervening variable or pure disposition term 
as described by Carnap (1956). There is 
nothing wrong with such a definition of in- 
telligence except that most people are not 
willing to accept it. Intelligence means more 
to most of us than a score on an intelligence 
test even though the “more” hasn't been made 
explicit (i.e., we use intelligence as a hypo- 
thetical construct whose validity depends on 
the verification or refutation of predictions 
derived from the theory which contains it). 

More recent thinking on the subject sug- 
gests that the ultraempirical approach is not 
a satisfactory solution to the problem of defi- 
nition. Several authors (Cronbach & Meehl, 
1955; Jessor & Hammond, 1957; Liverant, 
1958) have argued that the construction, in- 
terpretation and validity criteria of any test 
depend on the nomological network (mean- 
ings) surrounding the implicit or explicit con- 
struct(s) which the test is designed to meas- 
ure. Intelligence like any other construct takes 
its meaning from the assumed antecedent con- 
ditions and subsequent responses which it 
mediates. The controversies over nature-nur- 
ture on the antecedent side and general versus 
specific factors on the response side of the re- 
lationship characterize the difficulties in ar- 
riving at an accepted definition of intelligence. 
Thus, the next step is to examine first the 
nature of the antecedent conditions and then 


the subsequent responses subsumed by in- 
telligence. 


ANTECEDENT SIDE or THE PROBLEM 
Heredity and Environment 


As now evolved, the antecedents of intelli- 
gence do not refer to a set of conditions 
(either inherited or environmental) but rather 
to a conceptual model involving postulated 
genetic operations, Although stated in many 
different forms, genetic intelligence seems to 
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Re-examination of Intelligence as a Concept 


embody the idea that each organism, from 
the moment of conception throughout the life 
span, has inherent within it an unmodifiable 
growth process determined by gene composi- 
tion which if given the proper environmental 
stimulation at the right time will result in 
maximum problem solving performance for 
that organism under suitable conditions of 
motivation, health, etc. Improper stimulation 
(degree and kind unspecified) may inhibit 
certain manifestations of this innate prop- 
erty, but given an adequate restoration of en- 
vironmental conditions (provided structural 
damage has not occurred) it will proceed 
upon its genetically determined course. How- 
ever, no amount of environmental manipula- 
tion will serve to modify the limits of intel- 
lectual performance prescribed by the genetic 
constituents. It is this last proposition which 
lends an air of crucial significance to this 
formulation, since knowledge of these innately 
fixed limits (intelligence) will permit us to pre- 
dict maximum attainment under optimal con- 
ditions—predictions which are invaluable in 
terms of educational and vocational planning. 

There are several implications of genetic 
intelligence which need clarification, if not 
complete refutation, in light of recent con- 
ceptual advances in both genetics and psy- 
chology. Paramount is the notion of intellec- 
tual limits set by heredity. As usually inter- 
preted this idea means that beyond a certain 
point (usually unspecified) any and all en- 
vironmental conditions cease to make a dif- 
ference as far as increments in problem solv- 
ing proficiency are concerned. But the con- 
clusions of modern day geneticists (e.8., 
Huxley, 1949; Stern, 1952) reveal a basic 
fallacy inherent in this line of reasoning. 
Geneticists postulate that the appearance of 
any characteristic (behavioral or otherwise) 
depends upon the interaction of a specific 
genetic structure operating within a specific 
environment. In this sense genes set the limits 
within any given environment but these limits 
may vary as the environment varies. Without 
in any way arguing against the Wiesman- 
Morgan theory of the immutability of the 
germ plasm, it is no more logical to attribute 
a limiting role to heredity than it is to as- 
sign the same role to the environment. Per- 
haps the irrevocability of genetic determi- 
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nants as compared to situational ones has con- 
tributed to this mode of thought, but it is one 
which contradicts the postulated interaction 
effect. 

The implication of a true interaction is not 
that only within a limited range (set by one 
of the variables) does the effect of the other 
make a difference, but that they both operate 
to produce their effects over any conceivable 
range. Now it does appear that heredity can 
influence the variety of conditions which will 
or will not have an appreciable effect on a 
given characteristic. For example, it can be 
said that species characteristics are deter- 
mined by heredity in the sense that the oc- 
currence or nonoccurrence of an extremely 
wide range of external conditions does not 
make a difference in the morphology neces- 
sary to identify a given species. Both theo- 
retically and empirically, however, a set of 
conditions could be and, in fact, have been 
produced which radically alter species char- 
acteristics. Feeding thyroid abstract to the 
young axolotl to produce a creature adapted 
for the land rather than the usual aquatic life 
is a case in point (Jennings, 1930). On the 
other hand, intraspecies behavior is appar- 
ently affected by a much greater host of non- 
genetic variables. Furthermore, the impor- 
tance of situational variations in determining 
individual differences seems to increase as 
one ascends the phylogenetic scale. Beach’s 
(1954) findings regarding comparative varia- 
tions in sexual behavior emphasize the ever 
increasing role of situational factors in de- 
termining the sexual patterns displayed by 
increasingly complex organisms. It is likely 
that the same relationship is even more pre- 
dominant in the case of complex problem 
solving behavior. Since the evidence points to 
such a great number of circumstances which 
can affect intellectual performance it becomes 
conceptually difficult to speak of heredity as 
the determiner of intellectual limits, particu- 
larly in the case of human beings. 

From this line of reasoning it follows that 
although no practical way may be known to 
affect appreciable positive changes in an in- 
dividual’s performance level, it cannot be as- 
sumed that such changes are precluded by 
genetically determined limitations. On the 
contrary, the implication of genetic theoriz- 
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ing in this respect is that changes in any 
characteristic would result from sufficient en- 
vironmental changes. After producing previ- 
ously unknown patterns of performance in 
pigeons (ie., bowling behavior), Skinner 
(1958, p. 96) concludes that, “It is danger- 
ous to assert that an organism of a given spe- 
cies or age cannot solve a given problem.” 
The implications of this reasoning in the field 
of mental deficiency are apparent. 

A further failure to recognize the signifi- 
cance of the continuous interaction between 
genes and environment in producing behavior 
is indicated by the one-way effect of inter- 
action implied by the intelligence model. The 
implication is that training can only have an 
effect after neural maturation paves the way 
and that it is by controlling the matura- 
tional rate that heredity plays its vital role in 
the development of intelligent behavior. The 
“Johnny and Jimmy” type of studies (Mc- 
Graw, 1935) are cited as evidence in this 
connection. But this approach neglects the 
reciprocal relationship between physiological 
maturation and behavior. The “learning how 
to learn” phenomena demonstrated by Har- 
low (1949) strongly suggests that the rate of 
acquisition of problem solving ability is di- 
rectly dependent on neural changes which 
occur as a function of previous problem solv- 
ing performance. In other words, previous 
experience as well as genes “shapes up” the 
organism’s maturational readiness for new 
training procedures. 

Acceptance of the interaction thesis con- 
cerning heredity and environment invalidates 
the concept of innate capacity unless the term 
includes the operations of an environment. 
Consequently, the prediction of behavior based 
on genetic antecedents involves the problem 
of specifying situational conditions, a prob- 
lem with which the present definition of in- 
telligence as an inherited capacity has not 
dealt. There are two kinds of situational con- 
ditions which need to be specified: first, the 
conditions of development, growth and learn- 

` ing, and second, the criterion conditions lies 
the actual performance situations). The em- 
phasis in this section is on the former condi- 
tions. The criterion conditions will be dis- 
cussed more fully in the subsequent response 
section, 
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Environmental Specificity į 


Í 

Perhaps the most pronounced conceptual 
weakness of the present model lies in its in- 
ability to cope with environmental specificity. 
The position that individual differences in 
problem solving are a function of heredity 
given a relatively constant “demographic” en- 
vironment (e.g., similar cultural opportunities 
or similar family life) grossly oversimplifies 
the specific and still unknown effects of the 
“psychological” environment on the heredi- 
tary potential. 

Recent research concerned with the effects 
of early experience upon later behavior has 
brought into sharp focus the preeminent role 
of experiential factors in influencing problem 
solving processes. For example, evidence has 
been gathered that rats (Beach & Jaynes, 
1954), dogs (Clarke, Heron, Fetherstonhaugh, 
Forgays, & Hebb, 1951), and humans (Stone, 
1954) reared in impoverished perceptual and 
social environments show “irreversible” prob- 
lem solving impairment in adulthood. Exten- 
sive clinical case material now supported by 
still tentative research findings (Axline, 1949) 
indicates that much more is involved in a 
child’s inability to profit from the most “well 
conducted” training procedures than lack of 
the proper genetic constituents. The crucial 
role of interpersonal relationships in the ac- 
quisition of problem solving behavior is be- 
coming increasingly evident. 

In order to avoid confusion, it is important 
to recognize that no amount of empirical evi- 
dence relating intelligent behavior or the lack 
of it to environmental events can directly in- 
validate the model in question. As now con- 
ceptualized the position concerning genetic 
determinants is that they exist and somehow 
make a difference, a position which it would 
be absurd to deny. The most that data can 
accomplish is to question the usefulness of 
the genetic intelligence scheme and indicate 
lines along which a potentially more produc- 
uve one may develop. The network of studies 
beginning to elucidate the complex set of in- 
terrelated past and present situational fac- 
tors involved in determining the adequacy 
of intelligent behavior tends to place a he- 
redity explanation of problem solving ability 
in the position of an appeal to ignorance. In 
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the present stage of conceptualization in this 
area, it is the absence of knowledge con- 
cerning situational determinants which brings 


forth genetic ones. 


The Constancy of the IQ 


In the final analysis the value of any sci- 
entific approach must be judged in terms of 
the number of valid hypotheses which it gen- 
erates. The questionable validity of one of the 
two major hypotheses stipulated by the ge- 
netic theory of intelligence (i.e., intelligence 
as a general factor) will be discussed in the 
subsequent response section. The inability to 
validate or refute the other major hypothesis 
concerning the constancy of the IQ repre- 
sents still another instance of the essential un- 
testability of the genetic model. 

It may well be that given a theoretically 
constant environment individuals would main- 
tain the same relative position in problem 
solving adequacy over any given time period 
as a function of genetic differences. However, 
the indications that the ixconstancy as well 
as the constancy of the IQ can be better ex- 
plained by alternative hypotheses (other than 
genetic ones) seriously challenge the existing 
model, Disregarding individual differences in 
patterns of IQ change, the empirically ob- 
served “constancy” of intellectual status at 
the upper childhood age ranges has been ac- 
counted for on the basis of an “overlap” hy- 
pothesis. Anderson (1940) among others has 
proposed that the similarity in IQ from one 
age level to the next depends on the propor- 
tion of similar test items included in the total 
score at various age levels. 

A more crucial test of the constancy hy- 
pothesis is provided by the longitudinal stud- 
ies (Bayley, 1955) explicitly considering in- 
dividual differences. These conclusively dem- 
onstrate the lack of IQ constancy over any 
childhood period. Sontag, Baker, and Nelson 
(1958), in probably the most thorough and 
well designed study of its kind, found that 
62% of their children changed more than 15 
TQ points in either direction sometime during 
the developmental period from the age of 
three through ten years. Most significant is 
the finding that the direction and degree of 
change in rate of “mental growth” are re- 
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lated to personality dimensions (primarily a 
need for achievement). 


Reductionism 


It*can and has been argued that envi- 
ronmental explanations in and of themselves 
must always remain incomplete since genetic 
differences undoubtedly do play a role. The 
demand for the incorporation of genetic op- 
erations in behavioral explanatory conven- 
tions brings us to the problem of reduction- 
ism, i.e., the question of the necessity and/or 
feasibility of seeking explanations of the phe- 
nomena described in the languagé of one 
discipline by reference to the descriptive lan- 
guage of another. Reductionism, as far as psy- 
chology is concerned, usually refers to at- 
tempts to explain molar behavior in terms of 
physiological (i.e., neurological) events. Since 
genetics is still lower in the hierarchy, in the 
sense that its effects on behavior must be 
mediated through physiological processes, the 
criticisms cited in reference to physiological 
reductionism should be even more pertinent 
in the case of genetic reductionism. Jessor 
(1958) cogently argues against the logical 
possibility of reducing constructs designed to 
abstract organism—environment interactions to 
physiological operations, since the latter lack 
terms to describe the behavioral environment. 
This argument seems to be particularly ap- 
plicable to any attempts to seek the causal 
antecedents of problem solving behavior (cer- 
tainly a complex organism—environment in- 
teraction) in genetic laws (certainly lacking 
terms to describe the behavioral environment). 

Whatever the eventual relationship between 
genetic determinants and complex behavior 
proves to be, it is obvious that this relation- 
ship is a very distant one.* Bridging this gap 
necessitates taking into account a number of 
intervening processes at different levels of or- 
ganization. And the simpler or more molecu- 
lar the process, the closer is the relationship 
to the genes. Thus, initial sensitivity to stimuli 
may be more genetically determined than dif- 
ferential responding to these stimuli. Since 
these processes preceding complex perform- 
ance are continuously changing as a function 

3 Anastasi (1958b) elaborates this same point in 
much more detailed and analytic fashion in a recent 
article. 
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of innumerable environmental variations in 
large part brought about by the behaving or- 
ganism, the more complicated the behavior, 
the greater is the potential error in prediction 
based on genetic antecedents. 

In this sense, it is not at all surprising to 
find a lack of correlation (Bayley, 1955) be- 
tween infant scales which of necessity sample 
lower (more molecular) levels of behavior and 
intelligence tests dependent on higher order 
processes. But the major conclusion to be 
drawn from our attempts to develop infant 
scales and culture free tests is that the more 
we narrow our performance sample to elimi- 
nate or minimize the effects of environmental 
variables, the less likely we are to arrive at 
valid measures of intelligent behavior as we 
now conceptualize it. 

The enormous difficulties involved in estab- 
lishing predictive relationships between genes 
and complex behavior raise the question of 
the usefulness of any single construct which 
attempts to mediate such a relationship. Ful- 
ler’s (1954, p. 16) statement is relevant in 
this connection, “On a priori grounds a bi- 
ologist would suspect that intelligence would 
not be a very good subject for genetic analy- 
sis, since it is a complex character depending 


upon the coordination of many separate proc- 
esses,” 


SUBSEQUENT RESPONSE SIDE ¢ 
THE PROBLEM 


Analogous to the 


dF 


l previous discussion of its 
aplecedents, intelligence flpes fet refer to a 


j g $ 

Well defined set of subsequent responses, bül 
to an elusive set of notions concerning prob- 
weal eat a stuldtiig jHielligenee as the 
genera somethin tales. anotay agauad 

tential) which 8 ( 6, energy, capaci 
sary to solve 
he directly 
ever, 
have follo 


other frames 


problem solving. 


Intelligence as a General Factor 


Analysis of intelligence 
8.0 e as a psychologic 
construct raises the pa logican 


problem of pointing to 
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the behavioral referents which are placed in 
the class, intelligence. The same analysis also 
requires determination of the grounds upon 
which certain aspects of behavior are classi- 
fied. For, depending upon the basis of classi- 
fication, the invented construct may fulfill its 
goal in expediting the understanding of be- 
havior or as suggested by Peak (1953, p. 248) 
it may prove, “. . . artifactual and without 
systematic significance.” Unfortunately it is 
not clear which are the behavioral referents 
for intelligence or which principles are in- 
volved in isolating the intelligence concept. 

Intelligence as a prescientific concept (i.e; 
its meaning in everyday language) apparently 
refers to an unspecified but certainly very 
wide range of problem solving abilities. It is 
important to note that this usage does not 
refer directly to the behavior involved in solv- 
ing problems but to the adequacy of the re- 
sults of the behavior. In this sense the im- 
mediately distinguishable referents for intelli- 
gence are not behaviors but problems, The 
importance of all this lies in the recognition 
that selecting a sample of problems from an 
unknown universe of problems on some his- 
torical, cultural and/or common sense basis 
to measure intelligence (a procedure tradi- 
tionally followed in intelligence test construc- 
tion) may involve a number of behaviors 
which are not functionally related, However; 
the construct validity of intelligence as refer- 
ring to all intellectual problems (Le, a gel 
eral factor) depends upon the functional re- 
Jatedness of all ot at icasi most of the lje- 
liavlors involved {i problem solving, Thal 1 
in terms @f one set of operations, the de- 
stribuble evecnis #iibin ihe class. intelligence. 
must correlate as a function of a comilUl 
source of variance 

Although factor analysis has repeatedly 
demonstrated the unrelaledness of a striking 
number of behaviors traditionally groupec 
under intelligence, there still persists the hy- 
pothesis of a general ability which now, as the 
result of training, interest, motivation, ete» 
has become differentiated into the various 
abilities demonstrated by correlational meth- 
ods. The persistence of a general factor is 
based on the low but positive correlations 
often found between the factors which emerge 
from the test matrix (i.e, a second-order fac- 
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tor). However, even discounting similarities 
inherent in the content of the tests usually 
placed in the matrix, it cannot be assumed 
a priori that these correlations confirm the 
existence of general intelligence as usually 
understood. The communality which is found 
may, for example, be the result of the testing 
situation rather than any inherent property 
within the individual which transcends all 
situations. A series of studies conducted by 
Sarason and his coworkers (Gordon & Sara- 
son, 1955; Mandler & Sarason, 1952; Man- 
dler & Sarason, 1953; Sarason & Mandler, 
1952) indicates that variables (test taking 
attitudes or test taking anxiety) operating as 
a function of the interpersonal competitive 
situation represented by the test may con- 
tribute significantly to test performance. It 
seems reasonable to hypothesize that these 
situational variables may to a large extent 
account for a general factor, and surely, this 
is not what is meant by intelligence. By 
systematically varying the conditions under 
which the same tests are given to the same 
individuals, it may be possible to demonstrate 
concomitant changes in the measures as a 
function of these situational variables. 
Regardless of the accuracy of the hypothe- 
sis concerning the role of situational factors 
in accounting for G, the criticism of the exist- 
ence of a general factor based on factor ana- 
lytic methods stands. Guilford (1956, p. 290) 
in an extensive review of factor analytic 
studies of intellectual tests concludes that, 
“The methods of multiple factor analysis 
at ie hier brer eve) DA thee Aaa pa 
tor al Mne frat arder eve ly Lay 
second-order factor that cai propelly 1 


claim to the title of intelligence.” 
Ji is beebining intressinels clear that con 

A strac 

structs which explftily allampl 10 aber 
functionally related behaviors in a situation 
class are potentially more useful than con 
om iverant 


structs which do not, For example, t 

(1958) found increased predictabi iy 3 ; 
iding need for recognition into the need 
g in academic situations, and 
cognition in social situations. 
It is reasonable to assume that greatly lower- 
ing the level of generality of intelligence by 
subcategorizing it in terms of specific classes 
of situations will significantly increase age 


ni 
for recognition 
the need for re 
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curacy of prediction. This likelihood has long 
been recognized within such categorization as 
social intelligence, academic intelligence, me- 
chanical intelligence, etc. But somehow, we 
persistently return to a general intelligence, 
perhaps, because the meaning of the term 
proclaims the existence of one. 


Cultural Definition 


Whatever the most useful classification of 
intellectual behaviors proves to be, the very 
generality of intelligence makes it impossible 
actually to sample a unidimensional popula- 
tion of behaviors subsumed by intelligence 
since in one way or another most, if not all, 
of behavior sooner or later becomes involved 
in problem solving. Even scratching one’s 
epidermis may be viewed as solving the prob- 
lem of itching. However, such behavior tends 
not to be included in the typical intelligence 
test. In fact, the criterion of what is or what 
is not intelligent behavior involves a cultural 
value judgment. This makes for a great deal 
of difficulty in arriving at a consistent sample 
since by definition values are different in dif- 
ferent cultures and tend to change with time 
in a given culture. 

Rapaport (1953) suggests that in the case 
of terms having value connotations it is mean- 
ingless to ask what is X. Rather our problem 
becomes one of finding commonly accepted 
criteria for X. According to this point of view, 
rather than asking what is intelligence, we 
should be seeking established criteria for in- 
telligent behavior which will vary from one 
class of situations to another. In this sense 


intelligence is divorced from its explanatory 
gignificange and serves a descriptive tinction, 


a state of aflaits which may prove very de- 


sirable. 
Tho Role of Nowinnctestee Verentes 


Reference to the situational context in 
which intellectual behaviors operate brings 
into focus the role of variables ather than the 
intelectual behaviors per se in the problem 
solving process. While it is generally conceded 
that nonintellective factors are important in 
all actual measures of problem solving ability, 
within the intelligence model no clear concep- 
tualization of what these variables are and 
how they function has yet been made. As 
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testers, we somehow assume that by holding 
nonintellective variables constant (vaguely re- 
ferred to as establishing rapport or providing 
the proper conditions) during the testing pe- 
riod, an index of an individual’s fundamental 
capacity to solve problems can be obtained 
apart from these nonintellective factors. But 
the manifestation of any problem solving abil- 
ity in or out of the testing situation must be 
in accord with certain laws of behavior which 
include the operation of these so-called non- 
intellective variables. 


necessary to solve 
a given problem for either of two reasons: 


Neither intelligence tests nor the intelli- 
gence model provides an objective means of 
making a decision regarding which of the 
two reasons is operating in the case of failure 
to solve problems, Wechsler (1958) attempts 
to circumvent this problem by including the 
operation of nonintellective variables within 
his Practical definition of intelligence. In prac- 
tice, however, it is the inability to identify 
and formulate the operation of these variables 
which results in the lower predictive validity 
of intelligence tests, since it is these very vari- 
ables which change as the situation changes. 
As a consequence of these situationally modi- 
fiable variables we have the fairly common 
case of the child who functions so well in 
school but behaves so “stupidly” with his 
peers or, what is even more puzzling within 
the confines of intelligence, does admirably on 
an intelligence test and abominably in school. 
It is only within a theory of problem solving 
which attempts an explicit formulation of all 
the variables necessary to predict behavior 
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that these apparent contradictions become un- 
derstandable. 


MENTAL DEFICIENCY 


An adequate discussion of mental deficiency 
in light of the present suggestions concerning 
intelligence is beyond the scope of this pres- 
entation. It is mentioned at this point, how- 
ever, because it is in relation to the intellec- 
tually handicapped that the discarding of in- 
telligence as an explanatory construct may 
have its most far reaching effects. Fortu- 
nately, a number of prominent, students 
(Hutt & Gibby, 1958; McCandless, 1952; 
Sarason, 1959) of this problem have antici- 
pated and demonstrated the detrimental fal- 
lacy of regarding certain kinds of problem 
solving inadequacies as largely a matter of an 
inherited lack of capacity to perform any and 
all tasks at a certain level of difficulty, How- 
ever, continued use of such terms as limited 
intelligence, inferior intellectual capacity, low 
IQ, feebleminded, mental defective, etc, (all 
referring to lack of intelligence as an explana- 
tion for inadequate performance) hinders the 
understanding and the development of the 
point of view being expressed by these authors 
because of the surplus meanings inherent in 
the term intelligence. What is necessary from 
our point of view is recognition of the logical 
basis for discarding a contradictory and out- 
moded terminology which persists through 
tradition and not through theoretical and/or 
empirical justification. 


AN APPROACH TO CHANGE 


Inherent in the reasons for relegating in- 
telligence to a nonexplanatory usage are sug- 
gestions for an alternative approach to un- 
derstanding problem solving proficiency. Es- 
sentially what is 


intelligence be replaced by a new term but 
that it be relegat 


which can þ 


As argued by Mc- 
Candless (1952) and Rotter (1958) among 


others, the Constructs of modern learning 
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theories would appear to be a likely point of 
departure for this endeavor. 

Rather than regarding intelligence tests as 
measures of a single dimension (general in- 
telligence) it is more in keeping with the 
evidence to interpret them as assessing cul- 
turally expected standards of academic pre- 
paredness involving a number of complexly 
interrelated variables (e.g., implicit reaction 
patterns, motives, habits, expectancies, rein- 
forcements, etc.) to account for the results. 
Attempting to incorporate the varieties of 
socially learned behaviors operating in the 
testing situation within the domain of de- 
veloping theories of learning has the salutary 
effect of replacing a notion which is essen- 
tially untestable with constructs which have 
been selected with a view to functioning in 
potentially fruitful theories. 


SUMMARY 


An attempt has been made to question criti- 
cally the validity of the term intelligence as 
an explanatory construct. It has been argued 
that the term intelligence carries with it es- 
sentially untestable and empirically unjusti- 
fiable connotations which preclude coherent 
and systematic organization of the awesome 
variety of activities ascribed to its domain. 

An analysis of the antecedent and subse- 
quent conditions defining intelligence reveals 
the following difficulties inherent in this con- 
cept: ; 

1. The genetic model representing the ante- 
cedent conditions of intelligence is essentially 
untestable and not in accord with recent con- 
ceptual advances in both genetics and psy- 
chology. ‘ 

2. The usefulness of postulating genetic 
antecedents for complex problem solving be- 
haviors prior to a much more thorough un- 
derstanding of how heredity and environment 
interact in a given case (a long and tortuous 
process at best) appears quite dubious both 
on theoretical and practical grounds. But 
the point are the more -immediate 
of understanding and predicting 
ehavior by continuing to pursue 
lines of investigation designed to reveal the 
situational conditions which facilitate or 
hinder the acquisition of various on 
solving skills and the conditions which facili- 


more to 
possibilities 
intelligent b 
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tate or hinder the expression of these skills in 
certain kinds of situations. 

3. On the subsequent response side of the 
problem, the large majority of the evidence 
(primarily from factor analytic studies) in- 
dicates that a single dimension is not capable 
of subsuming the great varieties of behaviors 
now assumed to be related to intelligence. 

4. The very generality of the term prohibits 
the isolation of a functionally related class of 
behaviors. Any attempt to establish a stable 
functional unity is further complicated by 
the definitional necessity of involving cultural 
value judgments in selecting a population of 
intelligent behaviors, judgments which vary 
from one culture to another and tend to 
change with time in a given culture. 

5. Perhaps most crucial is the failure of in- 
telligence theory to specify systematically the 
effect of situational variables on the behaviors 
it is purported to predict. 

6. Finally, the arguments against the ex- 
planatory usage of intelligence suggest a 
fruitful approach to be the incorporation of 
the behavioral realm typically ascribed to in- 
telligence within the confines of modern learn- 
ing theory, particularly a social learning 
theory. 
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PERSONALITY DIFFERENCES AND LEVEL 
OF ASPIRATION 
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A number of previous investigations have 
sought to study the relationship between per- 
sonality differences and different patterns of 
level of aspiration. While many of these stud- 
ies have found no stable relationships, a few 
(Cohen, 1954; Gruen, 1945; Klugman, 1948; 
Sears: 1940, 1941) find a recurrent relation- 
ship between “maladjustment” and particular 
patterns of goal setting. Specifically, two pat- 
terns appear to be more often characteristic 
of maladjusted subjects (Ss) than others, 
either very high (compensatory OF wishful) 
or very low (protective) levels of aspiration. 


Cohen (1945), in discussing his results, asks 
what individual differences might explain why 
feelings of self-rejection 


some Ss with strong ngs | ‘f-re} 
deal with the level of aspiration situation by 
making low, seli-protective statements while 


others deal with the same situation by mak- 
ing high, unreal statements? ; 

The present study concerns goal setting be- 
havior as it relates to two independent dimen- 
sions of maladjustment in personality func- 
tioning. These two dimensions, designated A 
and R, were measured by two special MMPI 
scales devised by Welsh (1956). These scales, 
obtained by factor analytic methods, are un- 
correlated with each other and may be used 
singly or in combination. While the scales are 
at present designated only as A and R, Welsh 
(1956) notes that high A scores are related 
to disability of a dysthymic and dysphoric 
nature in which anxiety is prominent” while 
high R scores are “characterized by. repres- 
sion and denial.” Preliminary work with psy- 
chiatric populations indicates that when the 
two scales are used together to classify Ss the 
resulting four groups are distinguishably T 
ferent in psychiatric diagnosis (Welsh, 19 6). 
Investigations with college students ene 
1957; Pepper; 1957; Saute, 1958) have like- 


li demonstrated differences in behavior be- 
ween groups selecte! i iti 
=e ae d on the basis of position 
The present study was designed to investi- 
gate two hypotheses: (a) Ss scoring higher on 
A than on R (sensitizers) + will more charac- 
teristically employ a protective or low pat- 
tern of goal setting in a failure situation; (b) 
Ss higher on R than on A (repressors) will 
more often employ a compensatory or high 
pattern of goal setting when confronted with 
failure. These two hypotheses are consistent 
with a picture of the sensitizer as an indi- 
vidual who anticipates and fears failure and 
criticism and who deals with the situation by 
“admitting it first” and with a picture of the 
repressor as one who avoids discomfort by 
denying negative things about himself. 


METHOD 


Seven sections of elementary psychology courses 
were administered an adapted group form of the 
MMPI, containing the A and R scales, and a group 
Jevel of aspiration measure during one of their regu- 
Jar class meetings. The group level of aspiration 
measure consisted of three sets of 15 anagrams and 
one set of 15 number series. 

These four tasks were presented in booklet form 
with the following instructions: 


On the following pages you will be asked to take 
a number of short experimental tests. These are 
new tests which we are just trying out. Prelimi- 
nary work indicates that they may be good tests 
of intelligence or general academic aptitude. We 
are also interested in how well people can predict 
their own test performance before taking a test; 
therefore, before each test you will asked to make 
a prediction of the score you think you are most 
likely to obtain. : 

On each of the next three pages you will find 15 
series of scrambled letters. If the letters are prop- 


gee 

1 The terms “sensitizers” and “repressors” will be 
used here for convenience. They refer only to the 
operational definitions given. 
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erly rearranged they will make a common English 
word. No proper names or trick words are in- 
cluded. Do any that you can; you need not work 
on them in order. You will have 24 minutes to 
work on each set of anagrams. 

When I say “Go” you may turn the page and 
begin. But before beginning, please write in the 
blank below the number of words you think you 
will be able to complete. On each of the series of 
anagrams the average college student is able to 
complete 8 of the 15 words. Nearly all students 
complete 4; a few students complete 12. Please 
do not turn the page until you are told to do so. 


The instructions were read aloud by E while Ss 
followed them in their test booklets. Ss were asked 
to fill in identifying data on the front page as well 
as to make an estimate of their probable perform- 
ance on the first set of anagrams. On a signal from 
E, all Ss turned to the second page and began work- 
ing. They were told to stop at the end of 110 sec- 
onds. At the bottom of the second page they then 
recorded their estimates of their probable perform- 
ances on the second set of anagrams. On a signal 
from £ they turned to the third page and began 
work, 

The procedure was repeated for the third and 


fourth pages; however, on the fourth page they were 
told: 


On the next page you will find 15 number series. 
Four numbers are given and they are arranged ac- 
cording to some system. Figure out the system and 
then circle the one of the alternative numbers in 
the second column that fits the system. Three ex- 
amples are given below. 
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The average college students gets 7 correct solu- 
tions out of 15 in 44 minutes; nearly all complete 
4, a few get 13. Please estimate in the blank below 
how many you think you will be able to com- 
plete. Do not turn the page until you are told to 
do so. 


The time limit of 110 seconds for the sets of ana- 
grams was so selected that the majority of Ss, if 
they set their aspiration close to the supposed group 
average of 8 correct solutions, could not succeed. 
(Although Ss were told that they were to be per- 
mitted a longer time to work than was actually al- 
lowed, no S questioned the imposed time limit.) For 
the number series they were actually allowed 4% 
minutes to work, This longer time limit combined 
with the lower supposed “group average” was de- 
signed to let most Ss reach or exceed their aspira- 
tion estimate and hence to obtain some success at 
the end of the experiment. 

Personality test and aspiration data were obtained 
from 147 Ss. In order to test the hypotheses regard- 
ing differences in aspiration behavior of sensitizers 
and repressors, Ss were selected from this larger 
group. Only male Ss were included in the experi- 
mental groups, since previous research has shown sex 
differences in level of aspiration behavior (Sheehan 
& Zelen, 1955; Sumner & Johnson, 1949). 

A and R scores were translated into T scores by 
means of a table provided by Welsh (1956). Sensi- 
tizers were defined as those with A scores above 50, 
R scores below 50, and with at least a 10-point dif- 
ference between the two scores. Repressor’ were simi- 
larly defined. For comparison purposes a control 
(A=R) group was also selected; controls obtained 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF PERFORMANCE Scores, AsPIRATION ESTIMATES, 
AND PERFORMANCE-ASPIRATION DIFFERENCES 


Tasks 
Anagrams 1 Anagrams 2 Anagrams 3 Number Series 
Groups M SD M SD M SD M SD 
Sensitizers (NV = 17) Di 
Performance 5.88 215 4.53 1.58 3.82 221 7.35 2.06 
Aspiration 8.00 1.61 641 182 5.24 122 6.88 1.37 
Difference —2.12 2.57 —1.88 3.14 —142 244 es a 
Repressors (N = 25) 
Performance 5.44 1.65 4.08 1.92 4.52 1.72 7.08 2.50 
Aspiration 8.48 1.58 7.32 2.05 6.52 2.10 Ta 1.68 
Difference —3.04 2.31 —3.24 2.72 —2.00 3.08 — Mas 
Controls (V = 26) 
Performance 6.15 2.19 4.65 1.99 4.69 2.72 7.42 2.70 
Aspiration 8.46 1.43 7.15 1.59 6.13 1.52 7.27 2.41 
Difference =231 2.83 —2.50 2.07 —2.04 2.33 


+ 
= 
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ae 
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Personality and Level of Aspiration 


A and R scores within the range from 45 to 55 and 
had no more than 5 points difference between their 
two scores. In addition no S was included in any 
group who had actually succeeded in meeting or ex- 
ceeding his aspiration estimate on any of the three 
anagram tasks. 


RESULTS 


Means and standard deviations of actual 
performances, aspiration estimates, and dis- 
crepancies between performance and aspira- 
tion on each task are given in Table 1. Mean 
differences between groups were evaluated by 
t tests. For each task (Anagrams 1, 2, and 3 
and Number Series) the mean score of the 
sensitizers was compared with the mean score 
of the repressors; means of each of these 
groups were also compared with the mean of 
the control group. No differences between the 
groups in actual performance were significant. 
Likewise, the groups did not differ in size of 
discrepancies between performance and as- 
piration for the same task (which might be a 
measure indicative of intensity of failure). 
However, the mean aspiration estimate of the 
sensitizers for the third set of anagrams was 


significantly different from either the mean of 


the repressors Or the mean of the control 
2.25 


group. Values of # were, respectively, 
p < .02 > 01, df 40) and 3.43 (p< 01> 
.001, df 41). None of the other means of as- 
piration estimates differed significantly among 
the groups. y 
Table 2 presents means and standard devia- 
tions of d scores, i.e., the discrepancy between 
aspiration estimate and performance on the 
previous trial. Mean differences were tested 
in the same manner as for the data in Table 1. 
None of the mean differences between groups 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF d SCORES 
(ASPIRATION ESTIMATE MINUS PERFORMANCE 
ON PREVIOUS TRIAL) 


Scores 
di=A:— P: da=A:— P2 d= A — Pa 
Groups mM SD M SD M SD 
3.06 2.97 
Sensitizers 53 253 .65 1.68 
Beiers 96 239 244 206 260 r 
Controls 1.00 2.47 2.08 1.64 258 2 
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in the first d scores are significan 

the sensitizers had scores loser n le 
pressors as predicted (£= 1.84, p< .10 > 
05, df 40). The mean difference between 
sensitizers and repressors in the second d 


. score is highly significant (¢ = 2.94, p< .01 


> 001, df 40). The same mean of the sensi- 
tizers approaches being significantly different 
from that of the control group (¢ = 1.82, p 
< .10 > .05, df 41). None of the comparisons 
of means of the third d score, which was 
based upon aspiration for the number series 
were significant. i 

Examination of Tables 1 and 2 reveals that 
all groups begin the experiment with roughly 
equivalent levels of aspiration, perform in 
about the same manner, and fail to about the 
same extent. All groups tend to shift their 
aspiration estimates downward over the three 
trials of anagrams and to raise them again 
when the number series is introduced. How: 
ever, the sensitizers as a group tend to shift 
their aspiration levels downward to a more 
marked degree than the other two groups 
thus keeping the discrepancy between her 
estimate for any trial and their previous per- 
formance relatively small as compared with 
either the repressors or the control group. 
These data might be considered to confirm 
tentatively the hypothesis that sensitizers 
would be characterized by the use of a pro- 
tective pattern of goal setting in a failure 
situation. The hypothesis that repressors 
would be characterized by a high, compen- 
satory pattern fares less well. Although the 
repressors as a group do tend to keep the 
discrepancy between their last performance 
and next aspiration relatively larger than do 
the sensitizers, their d scores are not larger 
than those of an equal tendency control 


group. 
DISCUSSION 


The result, while suggesting the sensitizer 
is prone to adopt a protective pattern of goal 
setting in a failure situation, does not find 
any clear indication that the repressor adopts 
a compensatory pattern. Inspection of the 
data also failed to confirm an alternative hy- 
pothesis that repressors might be utilizing 
two or more patterns rather than just one 
Both repressors and controls in making esti- 
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mates either retained the estimate given for 
the previous trial or else lowered their old 
estimates slightly, i.e., there were no marked 
differences between repressors and controls in 
their responses to this experimental situation. 

Although the original intention of the study 
was to use the A and R scales as two dimen- 
sions of maladjustment, the choice of opera- 
tional definitions for sensitizers and repres- 
sors would argue that few, if any, Ss included 
in either group could legitimately be charac- 
terized as markedly deviant. Clearer differ- 
ences between sensitizers and repressors and 
between both these groups and the control 
group might have been obtained if more ex- 
tremely deviant Ss had been available. 

In a carefully executed study, Miller (1951) 
investigated level of aspiration in psychiatric 
patients. Among his several criteria for inclu- 
sion of patients in one of the four diagnostic 
groups studied were patterns of scores from 
the clinical scales of the MMPI. Welsh’s 
(1956) demonstrated relationships between 
the A-R classification and profile configura- 
tions on the MMPI suggests that the findings 
of the present study utilizing a “nondeviant” 
Population may be profitably compared with 
Miller’s findings from a “deviant” population. 
Miller found that neurasthenics produced on 
the average the highest positive d score, that 
both Paranoid schizophrenics and a group of 
nonpsychiatric hospital patients produced d 
scores just slightly above zero, and that 
character disorders and conversion hysterics 
produced slightly negative d scores. Welsh 
found that elevation on D, Pt, and Sc were 
Prominent in the profiles of high A/low R 
Ss and that Psychiatric diagnoses of severe 
Psychoneurosis, anxiety state, and paranoid 
schizophrenia occurred most often in this 
group. High R/low A Ss were more likely to 
show elevations on Hs and Hy and a marked 
lack of elevations on the psychotic scales; 
they were more likely in a psychiatric popu- 
lation to obtain diagnoses of mild psycho- 
neurosis, hysteria, and conversion reaction. 
Examination of Miller’s definition of neuras- 
thenia (elevation on Hs, D, and to a lesser 
extent on Hy; lack of elevation on P, Se, 
and Ma) indicates a fair degree of corre- 
spondence to Welsh’s diagnostic expectations 
for the high R/low A group, and, as in the 
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present study, this is the group found to have 
the highest positive d scores. Similarly, Mil- 
ler’s diagnoses of character disorder and para- 
noid schizophrenia correspond rather clearly 
to the high A/low R scoring group described 
by Welsh. And again the observations of both 
studies are congruent; both groups show a 
low average d score, implying a self-protective 
orientation toward goal setting. 

There are, however, some discrepancies in 
the findings of the two studies. Miller’s nor- 
mals set goals more like his paranoid schizo- 
phrenics and character disorders, while in the 
present results normals set goals more like 
repressors than sensitizers. The second de- 
parture in the findings is the negative d score 
of Miller’s conversion hysterics. Following 
Welsh, we would expect these Ss to behave 
like the repressor group, obtaining positive 
rather than negative d scores. Miller’s cri- 
teria for the diagnosis of conversion hysteria 
seem, however, to have been somewhat more 
stringent than those employed in the con- 
struction and validation of the Hy scale, 
which may account in part for the discrep- 
ancy of results. The over-all degree of corre- 
spondence between the present findings and 
Miller’s lends considerable support to the 
general idea that characteristic patterns of 
goal settings and defending against failure 
are a significant aspect of different person- 
ality organizations all along the continuum 
of adjustment. 

The present study also differs in several re- 
spects from many previous studies. One im- 
portant difference is that Ss were subjected 
to consistent failure throughout the first three 
trials of the experiment; consequently, the 
obtained results could not be generalized to 
anything other than a failure situation. The 
influence of personality trends on goal setting 
might be expected to operate differently in 
situations where Ss succeed or where success 
and failure are interspersed, This experiment 
also utilized a relatively short series of trials; 
the character of the results might change if 
Ss were subjected to more prolonged failure. 


SUMMARY 


Groups of college students classified as 
“sensitizers” or “repressors” by means of the 
Welsh A and R scales (MMPI) were com- 


of aspiration measure. The situation was ar- 
ranged so that Ss experienced failure on the 
OA tasks. Sensitizers employed a goal set- 
ting pattern closer to their real previous per- 
rmance than did repressors or an equal 
endency control group (A = R). Repressors 
did not show an hypothesized tendency to set 
high or compensatory goals. Limitations of 
the findings are discussed. 
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MANIFESTATIONS OF HOSTILITY IN NEURODERMATITIS* 
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Neurodermatitis is a relatively common 
chronic skin disorder, characterized by in- 
tensely pruritic excoriations, and is usually 
attributed to psychogenic factors. The nature 
of these psychological disturbances is de- 
scribed as centering around the suppression 
of intense hostile impulses (Dunbar, 1954; 
Obermayer, 1955). Neurodermatitis patients 
thus present an opportunity to investigate a 
specific personality pattern in a relatively 
objectively classified population. The purpose 
of this research was to test by an experiment 
the conclusions obtained through clinical ob- 
servations, and to contribute to a better un- 
derstanding of the disorder. 

Early studies using rating scales and per- 
sonality inventories (Allerhand, Gough, & 
Grais, 1950; Greenhill & Finesinger, 1942; 
Lynch, Hinckley, & Cowan, 1945) unani- 
mously reported a higher incidence of hos- 
tility in neurodermatitis patients than in vari- 
ous control groups. None of these investiga- 
tors, however, reported a cross-validation of 
their findings. 

Several projective studies are reported in 
the literature (Cleveland & Fisher, 1956; 
Fiske, 1953; Fiske & Obermayer, 1954; 
Geist, 1957; Kepecs, Rabin, & Robin, 1951; 
Levy, 1952). All investigators concurred with 
the finding that neurodermatitis patients 
manifest more hostility than do normal per- 
sons on projective tasks. Little is known, 

1 This study is based on a doctoral dissertation 
submitted in partial fulfillment of the requirements 
for the PhD degree at Teachers College, Columbia 
University. Thanks are extended to Joel R. Davitz, 
Paul E. Eiserer, and especially to Laurance F. Shaf- 
fer, who was of invaluable assistance during all 
stages of this study. The author is indebted to A. C. 
Carr and G. S. Baroff for serving as judges. 

2Now at the Institute for the Crippled and Dis- 
abled. 


however, of the relationship between covert 
hostility as manifested in projective proto- 
cols and overt expression of aggression. In an 
effort to correlate various levels of assessment 
of behavior, Leary (1957, pp. 395-396) con- 
structed a multilevel personality battery. Re- 
porting on skin patients he described his find- 
ings: “The multilevel pattern is that of a 
sweet, responsible facade with underlying 
rage and bitterness .. . they exhibit an in- 
tense conflict between this bland exterior 
and sadistic rage.” Unfortunately, no control 
group was used for comparison with the skin 
patients. 

An outstanding therapeutic experiment con- 
ducted by Seitz (1953) emphasized the sup- 
pression of hostility in dermatitis patients 
and based a therapeutic procedure on this 
characteristic. Reporting on a series of 25 pa- 
tients, he arrived at the conclusion that the 
alleviation of skin eruptions could be achieved 
through a short series of therapeutic inter- 
views focusing only on hostility and aggres- 
sive impulses. In a period of 10 sessions he 
explored possible sources of annoyance to the 
patient and encouraged him to talk about 
them. Only 13 of the 25 patients remained in 
therapy; 12 of the 13 were completely re- 
lieved of symptoms. 

Summarizing the present state of knowl- 
edge, the neurodermatitis patient may be de- 
scribed as a generally hostile person who ex- 
pends great effort on successful control of his 
impulses. Outwardly he appears calm and 
collected, yet covertly he harbors deep re- 
sentment. In psychotherapy such a patient 
will readily respond to favorable interper- 
sonal conditions by verbalizing a great deal 
of hostility. This research was designed to 
test these formulations. 
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Manifestations of Hostility in Neurodermatitis 


METHOD 


_ The instruments were chosen to reflect the central 
issues of the research: They were (a) a Manifest 
Scale of Hostility to test the hypothesis that neuro- 
derr atitis patients do not see themselves as more 
aggressive than. normal persons; (b) seven Thematic 
Apperception Test (TAT) cards selected to elicit 
hostility and thus to tap the covert level of this 


. specific behavior; and (c) a Sentence Formation 


Task (SFT) on which sentences were constructed 
and which could be reinforced. It was postulated 
that the SFT best reflects the therapeutic situation 
and thus represented a laboratory analogue to Seitz’s 


method. 


The Manifest Scale of Hostility 


A manifest scale on which the subject (S) evalu- 
ated himself with respect to hostile behavior was 
chosen for comparing overt hostility. Inasmuch as 
the Ne scale (Allerhand et al, 1950) had been es- 
pecially designed for use with neurodermatitis pa- 
tients and contained a number of items dealing with 
the central issue of this research, its use seemed ap- 


propriate. 

Twenty 
cluded in the instr 
ditional items in order t 


-four items from the Ne scale were in- 
‘ument, supplemented by four ad- 
o obtain greater reliability. 
The score was the sum of the items judged by two 
psychologists to pertain to hostility. In such a man- 
ner 15 items were included in the final score.? In 
spite of the added items in an effort to obtain a re- 
liable measure, the index of split-half reliability was 


low (r= 39). 


The Thematic Apperception Test 
g a covert hostility 


For the purpose of obtainin 
ds was administered 


score, a series of seven TAT car 
in the standard manner. Cards 3 BM, 6 BM, 8 BM, 
12 M, 13 MF, 18 GF, and 18 BM were used. The 
stories obtained were independently analyzed by two 


clinical psychologists according to the scoring instruc- 
and Hinds (1951). It was 


tions developed by Fisher 

emphasized that scoring should not involve inter- 
pretation, but mere classification of content. Seven 
categories of hostility were classified from story 
content: (a) openly hostile stories, (b) verbally ex- 
pressed hostility, (c) rationalized hostility and sui- 
cide, (d) illness and accidents, (e) ambivalently ex- 
presed hostility, (f) neutral content, and (g) posi- 
e or favorable content. Interjudge reliability for 
categories ranged from .79 to .85 on the 
Jes, In those instances where judges dis- 
d clinician was consulted to resolve the 


tiv 
the scoring 
different sca! 
agreed, a thir 


issue. A A 
The scores were weighted in the following manner: 

a 
the research Buss and 


nception of 


3 After the i o: 
a Hostility- 


Durkee published 
may have been par 
publication came 

porated in the resea 


Guilt inventory which 
ticularly useful in this study. The 
“te. however, to be incor- 
rch design (Buss & Durkee, 1957). 
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a score of 3 was assigned to Catego: 

tile stories), a score of 2 to Ceo pees 
(verbal expressions of hostility and rationalized Lees 
tility), a score of 1 to Categories d and e (illness 
accidents, and ambivalently expressed hostility), í 
score of -1 to neutral content (Category f) and of 
—2 to positive content (Category g). The final score 
was the algebraic sum of the scores of th 

stories for one individual. se 


The Sentence Formation Task 


While some previous studies used 
measures, few, if any, attempted oF eee 
process of psychotherapy itself. Psychotherapy has 
been viewed as a process of selective reinforcement 
by the therapist (e.g., Shaffer, 1947). Inasmuch as 
the elicitation of hostile responses is central to the 
psychologic treatment of neurodermatitis patients, it 3 
was believed that verbal reinforcement of hostile Te- 
enonse is pangoni to the therapeutic situation. A 
recent review y Krasner i; i 
available restarek in this T wom 

In order to elicit reinforcable respon. 
completion task was constracted ‘Ch a oe 
3” X 5” cards four verbs were typed, one of which 
was of hostile stimulus value. The S was presented 
with Si card at a time and instructed to construct 
one sentence using any one 
= enee g any of the four verbs pre- 

The hostility value of a verb was determined by 
two clinical psychologists. Each was given a list of 
82 verbs selected for possible hostility value, and 
was asked to indicate which of the verbs Aa be 
termed “hostile” when used in a sentence. Both 
judges agreed on 74 verbs, and these were used in 
the construction of the instrument. 

Three scores were obtained for each individual 
consisting of the number of verbs of hostile stimulus 
used in each of the three conditioning periods: oper- 
ant, reinforced, and extinction. A split-half reliability 
coefficient was computed for each of the three ex- 
perimental conditions. The correlations for hostile re- 
sponses were .60, .62, and .77, respectively. 


Procedure 


The manifest scale was administered i i 

i in a mimeo- 
Leer ape on which the patient had to indicate 
y circling either T or F, whether a st: : 
plied to him or not. Í woe 

The seven TAT cards were admini: i 
in; 
standard manner. iseer a 
The Sentence Formation Task was i 
t as int: i 

the following statement: “I want to ae Seg v 
you are at making sentences. I am going to a 
you some cards, each of which has four verbs. You 
can choose to make a sentence with any of the 
verbs.” The first 30 cards were presented without 
any comment in order to establish an operant level 
of response. On the next 30 cards, whenever the S 
chose the hostile verb to make a sentence, he was 
reinforced by, the experimenter who zid, “yes,” 
good,” “fine,” “uh-huh,” or “hmm-mm,” ae the bos 
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casion warranted. The presentation of the last 14 
cards was regarded as an extinction period and no 
reinforcement was administered. The cards were pre- 
sented in a random order. 

The testing session required from 45 to 60 minutes. 


Subjects 


The Ss of the investigation were 40 patients re- 
ferred through the Department of Dermatology of 
the Presbyterian Hospital and the College of Physi- 
cians and Surgeons of Columbia University as well 
as from one private dermatologist. Three experi- 
mental Ss were obtained through the latter source, 
while the rest of the neurodermatitis patients, as 
well as all the controls, through the former. 

In order to control for the psychological effects of 
skin disorder per se, all control Ss were derived from 
the same general population as the experimental 
group, except for their diagnosis. Only patients af- 
flicted with clearly nonpsychogenic disorders (e.g. 
contact dermatitis, lupus, etc.) were accepted. The 
experimental Ss were diagnosed independently by a 
board-certified dermatologist or by the staff of the 
clinic, 

All Ss were white, over 18 years of age, and were 
born or had received their schooling in this country. 
The experimental group consisted of 5 males and 15 
females, and the control Ss were 8 males and.12 fe- 
males. No significant differences were found between 
the experimental and control groups with respect to 
age (the means were 38.75 and 38.10, respectively) 


and years of school attended (means 12.35 and 12.85, 
respectively). 


RESULTS AND CONCLUSIONS 
Overt Hostility 


No significant differences between the neuro- 
dermatitis patients and the control group were 
found (¢ = 1.02 with 38 df). One must bear 
in mind, however, that the reliability coeffi- 
cient of the scale used was only 39, so that 
the acceptance of the null hypothesis applied 
only to the measure used. Further research 
with a more reliable scale would throw addi- 
tional light on the matter at hand. 


Covert Hostility 


It had been hypothesized that neuroderma- 
titis patients show greater hostility than the 
control group on instruments tapping covert 
personality factors, such as measured by the 


4The author is indebted to Carl T. Nelson, Ex- 
ecutive Officer of the Department of Dermatology, 
for his permission to use the Ss and for his active 
cooperation in the study, and to D. P. Torre and 
J. McCarthy for their assistance and cooperation. 
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TABLE 1 
Comparisons OF MEANS AND STANDARD 
DEVIATIONS ON TAT Scores 


ND Statistics of 
Statistic Patients Control Comparison p 
Mean 6.7 33 
SD Si 2.8 


TAT. This hypothesis is borne out by the re- 
sults of the research. Table 1 shows both sig- 
nificantly higher mean hostility scores and 
greater variance for the experimental Ss in 
comparison with the dermatology control 
group. These results show that neuroderma- 
titis patients emit both more hostile responses 
and are as a group more variable on a pro- 
jective test designed to tap covert person- 
ality variables. 


Response to Reinforcement 


From the reports of psychotherapists, it be- 
came evident that neurodermatitis patients re- 
spond favorably to verbal reinforcement of 
hostile responses. It was therefore hypothe- 
sized that such patients manifest different re- 
sponse frequencies of verbal hostile behavior 
under experimental reinforcement conditions 
than normal persons. 

Three measures were computed for pur- 
poses of inter- and intragroup comparisons, 


TABLE 2 


COMPARISONS OF MEANS AND STANDARD DEVIATIONS 
or Hostitiry Scores on SFT 


(In percent possible score) 


ï Experi- Statistics 

mental ND Con- of Com- 
Condition Patients trols parison p 
Operant 

Mean 10.8 14.3 t = 2.04 -05 

SD 5.0 5.6 F = 1.26 NS 
Reinforced 

Mean 18.2 18.5 = 0.83 NS 

SD 9.4 8.6 F=119 NS 
Extinction 

Mean 18.5 18.5 t = 0.00 

x = NS 
SD 9.8 9.2 F=113 NS 
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Manifestations of Hostility in Neurodermatitis 


corresponding to operant, conditioning, and ex- 
tinction periods. As can be seen from Table 2, 
neurodermatitis patients show a significantly 
lower hostile response frequency on the SFT 
under nonreinforced conditions than control 
Ss. Under favorable (i.e., reinforced) condi- 
tions, however, they emit no fewer hostile re- 
sponses than the control group. 

As it was originally stated, the hypothesis 
with respect to increased verbal response 
rates of neurodermatitis patients under fa- 
vorable conditions was not substantiated. 
These patients differ from the control group 
not in response to reinforcement, but in their 
base level of verbal hostility. Reinforcement 
enables these persons to achieve a normal 
rate of emission of verbal hostile responses. 
The results are consistent with the clinical 


observations. 


Hostility and Neurodermatitis 


Certain conclusions with respect to the 
manifestations of hostility in neurodermatitis 
patients may be drawn from the test battery 
used in this research. Assuming that the 
Manifest Scale of Hostility measures overt, 
and the TAT covert, expressions of aggres- 
sive impulses, the neurodermatitis patients 
emerge as persons with a covert hostility 
drive which they do not manifest overtly. 
This conclusion is substantiated by their low 
operant level of emittance of hostile verbs. 
The spontaneous, unreinforced response nd 
constitute a sample of the verbal hostility 
rate of the person in everyday life. A low 
operant level thus reflects infrequent use 0 
hostile words and consequently placid verbal 


behavior. 
DiIscUsSION 


The results of this study demonstrate the 
feasibility of constructing a test battery spe- 
cifically designed to measure overt and covert 
levels of a specific personality dimension, as 
well as the verbal emission of certain —— 
categories. Such batteries may, in the future, 


- vestigate the nature of the psy- 
CS cess and may contribute 


rapeutic pro 3 a 
o T understanding of the verbal in- 
teraction between therapist and client. 


The research study concerned itself with a 
specific medical-diagnostic group of patients 
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and confirmed impressions formulated by 
clinical workers with respect to one character 
dimension (hostility), but was not designed 
to test all hypotheses concerning the etiology 
of the disorder. It is entirely possible, for ex- 
ample, that other personality characteristics, 
such as anxiety, are concomitants of the neuro- 
dermatitis syndrome. Furthermore, it is pos- 
sible and quite likely that disturbances in the 
psychologic sphere are necessary contributing 
factors to the etiology of the disorder, but 
that an organic predisposition constitutes a 
prerequisite for the formation of the syndrome. 

The findings of the study are limited by 
the deficiencies of the instruments used. The 
Manifest Scale of Hostility in particular was 
found to be so low in reliability that the ac- 
ceptance of the null hypothesis with respect 
to the two groups may well have resulted 
from the inadequacy of the scale rather than 
from a real absence of difference in response 
pattern. Inasmuch as available research pre- 
sents conflicting evidence with respect to 
overt and covert relationships, the data of 
this study are insufficient to demonstrate 
conclusively that the hostility manifested in 
neurodermatitis patients is demonstrable on 
a purely covert level. Taken at face value, 
the results obtained on the TAT and the SFT 
are consistent with theories postulating an in- 
verse relationship with respect to overt and 
covert hostility patterns. The lower unrein- 
forced use of hostile verbs on the latter meas- 
ure can be regarded as a behavioral measure 
of verbal hostility. 

The experimental procedure in the research 
was designed in such a manner as to permit 
future applications to research with the proc- 
ess of psychotherapy. The SFT appears 
suited to simulate an experimental psycho- 
therapeutic relationship. 


SUMMARY 


Substantial clinical evidence suggests that 
patients afflicted with neurodermatitis suffer 
from a personality disturbance in which the 
suppression of intense hostility is a central 
feature. This research was designed to obtain 
empirical evidence concerning the manifesta- 
tions of hostility on overt and covert levels 
in such patients. 
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It was hypothesized that neurodermatitis 
patients (a) evidence no more hostility than 
control Ss suffering from nonpsychosomatic 
skin diseases on an instrument tapping overt 
behavior, (b) that neurodermatitis patients 
manifest more hostility on a covert measure 
of hostility than the control Ss, and, finally, 
(c) that neurodermatitis patients differ in 
verbal hostile responses on a conditioning task. 

The results were consistent with all three 
hypotheses. No difference between the two 
groups was found on the Manifest Scale of 
Hostility, but the low reliability of the scale 
does not permit more than a tentative ac- 
ceptance of the null hypothesis. Differences 
on the TAT, as rated by two independent 
clinicians, were significant at the .01 level 
with respect to both higher mean and larger 
variance in favor of the neurodermatitis 
group. On the Sentence Formation Task the 
control patients emitted significantly more 
hostile responses on the operant base level 
than the experimental Ss. The experimental 
Ss increased their hostile responses under re- 
inforcement conditions, reaching a level equal 

to that of the control Ss. Some of the impli- 
cations of the methodology with respect to 
future research were discussed. 
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A COMPARISON OF CHANGES IN PSYCHONEUROTIC 
PATIENTS DURING MATCHED PERIODS OF 
THERAPY AND NO THERAPY * 
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Despite a very large increase in the num- 
ber of published research reports on the ef- 
fects of psychotherapy, the more basic ques- 
tion as to its effectiveness seems still to remain 
unanswered. This question has been framed 
most often as one of the effectiveness of 
therapy versus spontaneous remission. It has 
been dealt with in these terms in a review by 
Eysenck (1952) and in recent research stud- 
ies by Barron and Leary (1955), and Dy- 
mond (1955). Barron and Leary concluded 
from their careful study of MMPI changes 
that therapy patients did not improve signifi- 
cantly more than did waiting list controls. 
Cartwright (1956), in a note on their paper, 
pointed out that though therapy patients did 
not improve more, there was significantly 
more change with regular, formally defined 


psychotherapy than without it. This was in- 
in the variances of 


dicated by the difference 1 
the changes for the treated and untreated 
groups. Dymond (1955) reported a compari- 
son of improvements made by a small sam- 
ple of waiting list “improvers” and therapy 
“Gmprovers.” The analysis strongly suggested 
the improvement of the two groups to be dif- 
ferent in kind. ae 
Although neither study gives any definitive 
answer to the basic question, each supplies 
some evidence of therapy having some meas- 
urable impact not found in its absence. The 
study ‘to be reported here is another of this 
kind: it appears to help clarify the argument 
that therapy does have some special effective- 
ness with psychoneurotic patients by specify- 
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ing the conditions of the effectiveness more 
fully. 
HYPOTHESES 


The major hypothesis can be stated most 
generally as: 

1. There will be more change in measured 
adjustment for a self-referred group of psy- 
choneurotic patients during a period of time 
in which they are meeting regularly with a 
therapist than during a period of time equal 
to it during which no regular formally consti- 
tuted therapy is taking place. 

Several subhypotheses will also be investi- 
gated. If it is supposed that therapy has more 
effect than no therapy, then: 

2. Longer therapy periods will be associ- 
ated with more improvement than shorter 
therapy periods. 

3. Longer waiting time periods will not 
be associated with more improvement than 
shorter waiting periods. 

4. Patients seen by experienced therapists 
will improve more in a therapy period than 
those seen by inexperienced therapists. 


METHOD 


Design and Subjects 


Thirty subjects (Ss) who applie i 
sity of Chicago Counseling Center pile! a 
asked to participate in a research study which. Guia 
involve testing at four points in time: (a) imm a 
ately upon their agreement to participate (Pre rail 
test), (b) immediately before beginning coun 
(Pre-therapy test), (c) sometime during the co = 
of their therapy at a point exactly equal in cane 
the length of time between Test 1 and Test 2 a 2 
therapy test), and (d) when therapy had been ae 
pleted (Post-therapy test). (` 

Of the 30 Ss who were initially tested, 8 remained 
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TABLE 4 


EFFECT or Pre-Wair STATUS ON DIRECTION OF 
CHANGE AND LENGTH OF Warr (TAT) 


(Direction of Change on TAT while Waiting) 


+ 0- Total 
High 10+ 
Pre-Wait status 
Long Waits 3 3 6 
8-24 Weeks 
Short Waits 1 4 5 
3-7 Weeks 
Low 9— 
Pre-Wait status 
Long Waits 6 0 6 
Short Waits 2 3 5 
12 10 22 


the three variables—length of time (waiting 
or In-therapy), original status, and direction 
of change—a two-by-two-by-two analysis was 
used. The first of these was designed to an- 
swer the question “Does the original Pre-wait 
Status on the TAT affect the association be- 
tween the length of waiting time and direction 
of change on this measure?” Separate two-by- 
two tables were set up to look at the direc- 
tion of change positive as against no change 
or negative change for long wait and short 
wait Ss, one for those whose original status on 
the TAT was high and the second for those 
whose Pre-wait status was low. 

These two tables were then combined in 
terms of number of cases on the diagonal 
and number off the diagonal. The difference 
between the proportions on the diagonal for 
high and low original status was tested for 
significance using the Fisher and Yates exact 
method. In this case, the difference between 
the proportions 7/11 and 9/11 is not signifi- 
cant. The answer to the original question, 
then, is that long wait Ss improve and short 
wait Ss do not improve or deteriorate ọn the 
TAT irrespective of their original status. 

A second analysis of this type was done to 
determine if deterioration of the high Pre- 
therapy TAT scorers and improvement of the 


6 This technique was suggested b: illi 
Willi 
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TABLE 5 


CONDENSATION OF TABLE 4 


On- Off- 
Diagonal Diagonal ‘Total 


High 7 4 1 
Pre-Wait status 


Low 9 2 11 


Pre-Wait status 


16 6 22 


low Pre-therapy scorers during a therapy pe- 
riod was affected by the length of the time in 
therapy. The proportion on the diagonal for 
the long therapy group in this case was 7/10 
and not significantly different from the pro- 
portion on the diagonal for the short therapy 
group 9/12. Thus the relationship found in 
Table 3 holds: those with high TAT scores 
before therapy begins tend to deteriorate or 
show no change, while those with initially low 
scores tend to improve these scores regardless 
of the length of time involved in the In-ther- 
apy period. It can now be stated in relation 
to Hypothesis 2 that longer therapy is not re- 
lated to more improvement on either test. 
Hypothesis 3 that longer waiting would not 
be related to more improvement is confirmed 
for the Q adjustment score, but not for the 
TAT, where longer waiting was found related 
to more improvement regardless of original 
status. 


Testing Hypothesis 4 


Hypothesis 4 states that Ss seen by experi- 
enced therapists will improve more on the 


TABLE 6 


RELATION O; CHANGE IN THERAPY TO 
EXPERIENCE OF THERAPIST 


(Change while in therapy) 


Q Adjustment TAT Score 
+ 0— $ i 
Exper. 6 6 3 : 
Tnexper. ri 3 i 5 
NS x? = 7.245* 
s Significant at .01 level. ¥ ——— 


Changes During Therapy and No Therapy 


two test measures than those seen by inex- 
perienced therapists. To determine if there 
were any bias in the selection of the cases 
seen by those two groups, the direction of 
change while waiting was examined. The ex- 
perience of the therapist-to-be-seen was found 
to be unrelated to changes before therapy 
begins. Table 6 shows the relation of the di- 
rection of change while in therapy to the ex- 
perience of the therapist. Before the very 
high degree of significance in Table 6 is taken 
as clear evidence of the positive effect of ex- 
perienced therapists on the TAT changes of 
their patients, several other factors must be 
explored for their possible interaction. 

1. Since length of In-therapy time showed 
a nonsignificant trend (.10) in the direction 
of longer cases changing negatively on the 
TAT, the first question of this kind is: are 
the experienced therapists seeing more cases 
that waited a shorter time to begin therapy 
and so had a shorter In-therapy period and a 
bias toward positive change? This was found 
to be a nonsignificant relationship in a chi 


square test. 
py status on the TAT 


2, Since Pre-thera| 
was found to be negatively related to In-ther- 
apy direction of change (see Table 3), are 
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the experienced therapists perhaps seeing more 
cases with initially low scores? This. too was 
found not to be the case using a chi square 

An on-diagonal, off-diagonal analysis cons 
firmed these findings. The relation of the ex- 
perience of the therapist to the direction of 
the TAT change was not significantly affected 
by the Pre-therapy status nor by the length 
of time in therapy. It seems clear that the ise 
perience of the therapist is the most powerful 
influence of those investigated here in produc- 
ing positive change in the patient’s TAT men- 
tal health rating during therapy. 

Up to this point then, it can be said that 
there were two conditions under which the 
TAT mental health rating improved signifi- 
cantly: (a) without therapy, where the Sit 
ing period before beginning therapy was § to 
24 weeks (Long-wait), and (b) with therapy 
where the patient was seen by an experienced 
therapist. As far as the TAT is concerned, it 
seems a reasonable conclusion that over the 
time periods involved here, no therapist at 
all was better than an inexperienced one. 
Whether the gains made while waiting would 
have been sustained if no therapy had been 
entered into remains an open question, but a 
comparison of the mean changes under these 


TABLE 7 


CED AND INEXPERIENCED GROUPS 


MEAN SCORES FOR EXPER 
a “ad tes F Pre- F. In- Post- t : i 
Wait Therapy Therapy Therapy Pre-In Pre-Post 
oee =CO*O*C« yi q 
Exper. A 
oN ‘es 390 46.0 a 
ae x Tnexper. : 
7 2.163% 
| va 3 a 3281 4 40.25 =e eal 
| TAT Scores 
2.58 2.98% 
ie A e 1027 i 12.72 l 2.190* 
5 age 80 11.40 
X 7 n Da or i 10.25 pa ua 


enne 
* Significant at .05- 
#* Significant at 02. 
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conditions confirms the above conclusion. 
The mean waiting period improvement for 
the Long-wait group was +2.9. The mean 
in-therapy time improvement for those seen 
by experienced therapists was +2.0. The ¢ 
between these, .677, was not significant. The 
mean In-therapy change for those seen by 
inexperienced therapists was —1.4. The ¢ be- 
tween this and the improvement of the Long- 
wait group is 3.014, significant beyond the 
.01 level. Is the conclusion, then, that there is 
no difference between spontaneous remission 
and therapy change brought about by an 
experienced therapist and that inexperienced 
therapists have, if anything, a deleterious ef- 
fect on their patients? As this conclusion 
would have strong implications, it is impor- 
tant that it be explored carefully. For this 
reason, the analysis was extended to the post- 
test data, 

Table 7 reports the mean scores for both 
tests at the four test points for those seen by 
the two groups of therapists. A £ test analysis 
for the differences between these means was 
used. The significant improvement between 
the Pre-therapy and the In-therapy test on 
the TAT of those in the experienced thera- 
pists’ group has already been noted. A further 
result appeared to be a significant improve- 
ment on the Q adjustment score during this 
time of those in the inexperienced therapists’ 
group. However, when an on-diagonal, off- 
diagonal analysis was used to examine the re- 
lationship of Q change in therapy and thera- 
pist experience for high and low Pre-therapy 
Ss, it was found that these proportions (8/11 
on diagonal for low scorers) was significantly 
different from the proportion (4/11 off-diago- 
nal for high scorers) at the .05 level. Thus, the 
mprovement on Q adjustment for Ss in ther- 
apy with inexperienced therapists is largely a 
function of bias in Pre-therapy status. The Vs 
for the comparisons between Pre- and Post- 
therapy scores were reduced to 19 by the loss 
of 3 cases; 1 from the experienced and 2 from 
the inexperienced group. The results here 
show that patients seen by experienced thera- 
pists improve significantly by the end of ther- 
apy on both measures, whereas those seen by 
inexperienced therapists failed to improve sig- 
nificantly on the Q score and, in fact, came 
close to a significant deterioration on the 
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TAT. As has been noted, those who waited a 
long time improved significantly on the TAT, 
but there was no commensurate shift of the 
Q adjustment score. 

One difference between improvement with 
and without therapy has been found in this 
study. Those who waited a long time before 
beginning improved on the projective meas- 
ure of their mental health but did not incor- 
porate this into their self-picture as reflected 
by the Q score. Those who completed therapy 
with experienced therapists reflected their 
“deep” improvement on the conscious level 
in an improved self-description. Those com- 
pleting therapy with inexperienced therapists 
not only did not improve their self-picture, 
but bordered on a deterioration in health. 


SUMMARY AND CONCLUSIONS 


Twenty-two self-referred psychoneurotic pa- 
tients were tested at four points in time: (a) 
Pre-wait—on first being accepted as therapy 
cases and placed on the waiting list; (b) Pre- 
therapy—after a waiting time which varied 
in length; (c) In-therapy—after therapy had 
begun at a point equal to the waiting inter- 
val; and (d) Post-therapy. The instruments 
employed were the (a) Butler and Haigh Q 
sort, a self-descriptive test from which the Q 
adjustment score was computed, and (b) the 
TAT, a projective, from which a diagnostic 
rating was made. Four hypotheses were tested: 

1. There will be more change during a pe- 
riod of therapy than an equal period of no 
therapy. This was confirmed for the Q ad- 
justment score but not for the TAT. 

2. Longer periods of therapy will be asso- 
ciated with more improvement than shorter 
periods. This hypothesis was not confirmed 
for either test. 

3. Longer waiting periods will not be asso- 
ciated with more improvement than shorter 
periods. This was confirmed for the Q ad- 
justment score but not for the TAT. Those 
who waited longer did in fact improve on the 
TAT, and those with shorter waits did not. 
The long wait improvers appear to be a spon- 
taneous remission group with respect to the 
TAT. 

4. The fourth hypothesis that patients in 
therapy with experienced therapists would 
improve more in the In-therapy period than 
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those in therapy with inexperienced thera- 
pists was confirmed for the TAT but not for 
the Q score. The fact that the positive effect 
of the experienced therapists was clearly in- 
dependent of the Pre-therapy level of the 
TAT score and of the length of time in ther- 
apy made a strong case for the special effec- 
tiveness of these therapists. This in-therapy 
improvement could not be attributed to a 
continuation of a spontaneous remission trend. 

The Post-therapy tests showed those in 
therapy with experienced therapists to have 
improved significantly on both tests, whereas 
those in therapy with inexperienced thera- 
pists not to have improved significantly on 
either and, in fact, they bordered on a sig- 
nificant decrease in health on the TAT. 

It would appear that this study could con- 
clude: 
1. Therapy affects adjustment based on 
self-description, whereas equal waiting time 
does not. 

2. A long waiting peri 
patient improves the menta 


revealed by the TAT. 
3. Equally long therapy does not neces- 


sarily improve the mental health picture. ; 
4, An experienced therapist is effective 1n 
improving the mental health both as revealed 
by the TAT and as self-described. Š. 
5. An inexperienced therapist is ineffective 
in improving the self-picture and may actu- 


od for this type of 
] health picture as 
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ally be effective in decreasin, 

revealed by the TAT. a ee 
6. From the last two conclusions, it ap- 

pears that therapists have special effects, de- 

pending on the level of their experience. 
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In recent years, the literature concerning 
the nature of normality in human personality 
has shown increasing support for an “idealist” 
conception of psychological normality. Nor- 
mality is viewed as an approximation to an 
ideal or combination of ideals, such as health, 
rationality, honesty, integration, maturity, or 
morality. This literature has justifiably criti- 
cized the traditional approaches to this prob- 
lem, i.e., a “symptom-free” conception, speci- 
fied as absence of pathological stigmata, or a 
“centrist” conception, specified either as the 
statistical average of normally distributed 
characteristics or as the balance between 
tendencies, which, in their extremes, are con- 
sidered pathological, like introversion and ex- 
troversion. This paper will present a critique 
of normative perspectives in personality the- 
ory, focusing on the “idealist” position (criti- 
cism of traditional approaches may be found 
in works by Hartman, 1939; Maslow, 1954; 
Mowrer, 1954; Shoben, 1957, and others), and 
Propose an alternative to these approaches. 

A common starting point among authors 
Proposing an idealist conception of normality 
is the recognition of the enormous extrabio- 
logical potential for variation inherent in hu- 
man beings as evidenced by complex cultural 
activity, symbolization, and altruism. They 
stress that these inherently human qualities 
deserve more serious attention than they 
are getting from students of personality and 
should be embodied in the concept of nor- 
mality. Shoben (1957, p. 185), for example, 
states, “the fundamental contention advanced 
here is that behavior is ‘positive’ or ‘integra- 
tive’ to the extent that@it reflects the unique 


1Part of this paper was prepared while the writer 
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attributes of the human animal.” As the 
“unique attributes” of humans are extremely 
varied and frequently repugnant, the writers „e 
go on to specify what they mean. Maslow t 
(1954, Ch. 12) lists 17 characteristics of self- 
actualizing people including acceptance of 
self, others, and nature; spontaneity; prob- 
lem centering; the quality of detachment; 4 
autonomy; democratic character structure; 4 
creativeness; and resistance to, enculturation. “ 
Bond (1952) mentions freedom to focus en- 
ergy on main purposes, ability to work and -~ 
love with ease, and to achieve happiness and 
efficiency somewhat in proportion to circum- 4 
stances. Shoben (1957) writes of self-control, ** 
personal responsibility, social responsibility, 
democratic social interests, and ideals. Mc- 
Laughlin (1950) enumerates emotional inde- < 
pendence and self-reliance; balance between ~ 
giving and getting; relative freedom from ego- 
tism, inferiority feelings, and excessive com- / 
petitiveness; conscience; genital sexuality; 
constructive aggressiveness; solid sense ‘of re- 
ality; flexibility; and adaptability. 

Several major difficulties and limitations are _ 
inherent in this approach. First, it establishes ; 
as absolute desiderata patterns of behavior | 
and values that grow out of a particular cul- g_ 
tural context during a particular period of its ` 
history. “It assumes that what is good for us 
is good for everybody everywhere. Several au- 

. 
/ 
Í 


thors explicitly recognize that these criteria 
of normality involve assumptions in the na- 
ture of value judgments. Other writers do not, 
but cultural anthropology and history pro- 
vide evidence that freedom from inferiority i 
feelings or genital sexuality or constructi i 
aggressiveness, for example, are not and have 
not been universally considered desirable goals 
for human behavior. i 


` Ss this approach generally does not 
‘ake into account the circumstances under 
_ which the personality is to function. It im- 
| plies that variations in circumstances do not 
' determine or even influence the functioning 
of the personality and precludes the study 
of how behavior and circumstance covaty. 
Third, it largely assumes that postulated cri- 
teria of normality are positively correlated, or 
can be positively correlated. As to whether 
they are correlated, reference can be made to 
several attempts to study “normal, healthy 
individuals.” Maslow (1954) and Bond 
(1952), for example, went to great efforts 
to obtain suitable examples. Bond guarded 
against his own selection biases by studying 
almost complete populations of student coun- 
cil members and concluded nonetheless that 
57% could benefit from ideal mental health 
services. Maslow grudgingly devotes two 
pages to the imperfections of self-actualizing 
people. Roe’s (1950) studies of eminent sci- 
entists (to be sure, using criteria of success 
and not of normality) amply indicate that 
nonnormal characteristics regularly accom- 
pany achievement, creativity, and success. 
Redlich (1952) reviews several studies of 
“normal” populations in which “rich patho- 
logical” material was found. J 
Perhaps even more significant 1s the ques- 
tion of whether it is reasonable to anticipate 

. that characteristics of normality such as those 
under discussion can be positively correlated. 
Consider, for example, such positively valued 
attributes as love and independence. Ideal 
patterns of loving devotion and enduring com- 
mitment to one’s beloved, coupled with inde- 
`~ pendence, assertiveness, and freedom from 1n- 
_ fantile demands of succorance, are generally 
< accepted as positive characteristics of the 
normal; but the conflictual nature of multiple 
human needs and aspirations is obscured by 


an approach which does not recognize that 
‘Jove and commitment necessarily and inevi- 


tably restrict independence. i 
Finally, & value based conception of nor- 
mality tends arbitrarily to slant research so 
as to preclude investigations of the relation- 
ships between cel Ae of behavior 
i tterns of behavior. 
Eros ne ue are not to be construed as 


arguments for banning values from considera- 
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tion in personality theory. The question is 
where and how to deal with them in a way 
that is scientifically appropriate. Values and 
value systems may be studied objectively as 
variables, and the task of psychology is to 
assess their relationships with other variables 
and with behavior. Such an approach does not 
imply that every moral and ethical system is 
as good or as workable as any other nor that 
the search for the “good” in either individual 
or culture should be minimized, but it does 
indicate a way in which such problems may 
be considered at levels of discourse which are 
germane to science. In this manner science is 
not faced with questions it cannot answer. 
It is significant that the proponents of the 
point of view under discussion seem to be all 


‘for the good life but are rather vague as to 


what they mean by it, while theologists and 
ethicians are quite explicit and vigorous de- 
fenders of the positions they take. The ap- 
parent proclivity for tolerance, understand- 
ing, and minimization of differences, charac- 
teristic of this school of thought (with few 
exceptions), conceals and overrides hard dif- 
ferences in the approach to life embodied in 
different value systems.* Yet, to consider 
values as variables seems definitely uncon- 
genial to those who hold that the values they 
profess are the only values worthy of con- 
sideration and study. For example, Shoben 
(1957), while arguing that a normative a} - 
proach based on a statistical average pre- 
cludes the criticism of a culture, confuses a 
conflict of values with a scientific contro- 
versy when he disdains the possibility that 
the storm trooper would be considered the 
prototype of integrative adjustment in the 
Nazi culture. Disapproval of Nazi culture has 
nothing to do with understanding how a man 
raised in Nazi Germany might be expected to 
become a storm trooper. Science can do no 
more than predict the consequences of a 
value system. Political, religious, or cultural 
conflicts are fundamentally not resolved in 
the laboratory or on the couch. 

Writers adopting an idealist approach con- 


2See Herberg (1957) for a critical discussion of 
the ethical inadequacies of theoretical orientations in 
psychology which minimize differences in value al- 
ternatives and preclude the conceptualization of the 
realities of moral conflict in the individual. 
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cerning psychological normality take a posi- 
tion that is steeped in contradiction. On the 
one hand, their conception of normality has 
an absolutist quality that is laced with cul- 
ture-bound value assumptions. On the other 
hand, they show marked reservations concern- 
ing a conception of normality based on social 
conformity and they abhor any taint of abso- 
lutism. Maslow (1954), for example, specifies 
autonomy, independence of culture and envi- 
ronment, and resistance to enculturation as 
characteristics of the self-actualizing (normal) 
person. In doing so, Maslow evidently means 
to divorce himself from purely statistical con- 
ceptions of social conformity. Certainly he 
does not mean compulsive autonomy, or com- 
pulsive resistance to enculturation, yet his 
whole perspective, shared with many others, 
including Fromm (1941), Mowrer (1954), 
and Shoben (1957), is one of meeting a speci- 
fied ideal, and this inevitably involves some 
form of cultural bias, some form of limita- 
tion, some constriction of freedom. 

In a discussion of this contradiction, Knight 
(1946) deals with the problem by (a) speci- 
fying that freedom is largely a subjective 
matter; (b) showing that many subjective 
feelings of freedom are spurious, and (c) 
concluding that genuine freedom and mental 
health involve the sense of inner compulsion 
and conformity based on a rational apprecia- 
tion by the individual of the assumptions, 
conditions, motives, and values in his life. It 
is this bedrock of rationality to which all of 
these writers ultimately arrive. 

Kubie (1954) takes perhaps the extreme 
position and asserts that the sole criterion for 
normality is the predominance of conscious 
and preconscious over unconscious factors in 
the determination of an act. Since Kubie 
rules out consciousness, per se, as a criterion 
(a delusion is conscious but not normal), he 
can only be interpreted to mean that rational, 
a Oe he ee mental behavior is nor- 
factory solution this seems to be a se 
tes mee o the problem. It posits a 

on which is neutral as far as value is 
ia and can conceivably lead to objec- 

e assessment procedures. However, such a 
point of view seemingly would label all non- 
rational behavior and motivation as abnor- 
mal and in doing so would devalue and im- 
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Ba 
plicitly condemn any manifestation of emo- 
tional spontaneity. Arguing, as Kubie does, 
that it is the origin of the act and not its 
subsequent automatic execution that must be 
assessed, would nonetheless require, at some 
point, meticulous evaluation of matters of 
taste, preferences in art, religious values and 
beliefs, and love objects in order to fulfill this 
criterion. Kubie’s disclaimer (1954, p. 187); 
“Here is no unreal fantasy of a ‘normal’ in- 
dividual out of whom all the salty season- 
ing of secondary unconscious motivations has 
been dissolved,” when followed in the next 
paragraph by “What was unconscious - : * 
must become accessible enough to self-inspec- 
tion to become conscious when needed,” sug- 
gests that his conception of “salty seasoning” 
is not so spicy after all. 

When the search for the normal leads tO 
the elevation of the rational, a host of prob- 
lems concerning the nature of reason, logic, 
and rationality may be overlooked. These in- 
clude the distinctions between a rational proc- 
ess and rational content (Hartman, 1956; 
Reider, 1950), the illogical and prelogical fac- 
tors which contribute to the development of 
intelligence and reason (Piaget, 1953), the 
role of unconscious and regressive factors in 
creativity (Kris, 1952), and the process of 
change in form and content of knowledge and 
logic through a history which has not ended 
and whose future course is unpredictable by 
any reliable means. À 

Finally, and this criticism is germane to 
practically every attempt to define normality, 
it assumes that rational, conscious, and pre- 
conscious processes, Once established, carry 
on of their own accord without environmental 
support and stimulation. Recent studies on 
the effects of sensory deprivation (Hebb, 
1955), the classic studies of perception under 
distorted conditions (Lawrence, 1949), and 
many studies of behavior under stress (Laza- 
rus, Deese, & Osler, 1952) indicate that this 
is simply not the case. To specify conscious- 
ness or rationality as the essence of normality 
merely begs the question. 

A truly objective, scientific solution to the 
problem of defining normality when one 18 
concerned with something more than averag- 
ing existing traits and behaviors, when one 
seeks to embody in this definition a particu- 


’ 
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Jar conception of what man may be, is im- 


possible to attain. As soon as values, sym- 
bols, altruism, and the like are specified in the 
definition of normality, objectivity yields to 
time-bound and culture-bound assumptions 
which are not subject to scientific verification 
and which are of limited generality. That 
each person deliberately or unconsciously has 
a scheme of values, beliefs, and ethics which 
imparts to him his uniquely human character, 
that, indeed, to be a human requires such a 
scheme, is a proposition that does not estab- 
lish a particular system of values or logic as 
an absolute criterion of normality. 
Psychology should stop asking the question 
“What is normal?” Personality theory re- 
quires another orientation. It seems to me 
that, for purposes of scientific theory con- 
struction and also for practical clinical pur- 


poses, a rather different and potentially more 
h can be taken. This would 


fruitful approac i 
entail the viewpoint that human beings have 
and that the 


a variety of potentialities, 
achievement of certain potentialities may en- 
tail certain limitations and that achievement 

and limitation vary with conditions. 
Consider it is the distant, withdrawn per- 
able to form close relation- 


sonality who is un f 
ships who, it has been said, stands the best 
chance of enduring the stress of a prolonged 


flight into space. The suspicious mistrusting 
individual, difficult to get along with, may, 
because of these very characteristics, be an 
excellent tax inspector, research scientist, Or 
counterintelligence agent. The vain narcis- 
sistic personality can become a great enter- 
tainer. Reverse the point. Is great acting pos- 
sible without narcissism? Is creative research 
possible without mistrust? Is prolonged iso- 
lation tolerable for a gregarious individual? 
The legitimate task of science is not to stand 
back smugly and label the various distribu- 
tions of energy and effort as abnormal or nor- 
mal. The task of theoretical psychology is to 
provide the concepts and principles which 
would enable us to comprehend achievements 
and failings and their interrelationships. 
Such an approach focuses on mans be- 
havior as the data available for scientific 
study. Existing terms and concepts such as 
repression, motive, conformity, defense mecha- 
nism, and habit need not be discarded as 


conceptual tools, but with this perspective 
the scientist does not imply damnation or 
praise when he uses them to characterize a 
psychological process. This approach sets no 
limits on the sophistication of the theories 
developed to encompass the facts of behavior 
but it lessens the likelihood of confounding 
facts and theories and value assumptions. It 
avoids positing criterion value systems but 
freely admits the incorporation of value sys- 
tems as variables contributing to or limiting 
different potentialities. It sets the stage for 
research into the development and exercise 
of man’s potentialities for what, on other 
grounds, may be considered good or evil, but 
does not prejudge or condemn. It permits the 
description and understanding of man as he 
was, is, and can be. 

Several features of the present position have 
been elaborated in the literature. Marmor and 
Pumpian-Mindlin (1950) and Jahoda (1953) 
for example, have stressed the interaction of 
personality and situational variables in the 
determination of behavior. Hartman (1951) 
points out that conflicts are inherent to hu- 
man existence and (1939) that rationality 
and freedom do not necessarily imply health, 
while regression and defense are not neces- 
sarily maladaptive. Schafer (1954, p. 172) 
writes, “The growth and organization of per- 
sonality appears to require that certain of 
each individual’s potentialities be cultivated 
and others neglected or even plowed under.” 
Marzolf (1947) concludes, “In fact there is 
no need for the term ‘mental disease.’ ” Sev- 
eral writers (Darrah, 1939; Jones, 1942; 
Wile, 1940) are doubtful about the existence 
of “normal” personalities. But in one way or 
another, they all (with the exception of Wile, 
1940) cling to the traditional perspective and 
express or reflect some concept of normality. 

If the framework proposed is theoretically 
acceptable, objections may nevertheless be 
raised on practical grounds. If no valid con- 
ception of normality is possible, is treatment 
ever indicated and, if so, toward what goals 
is treatment directed? At present, certain 
personality patterns can be labeled abnormal 
and treatment is presumably directed toward- 
changing abnormal personalities or patterns 
of behavior into normal ones. The definition 
of what is normal is considered critical to 
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clinical practice and its elimination apparently 
leaves an enormous conceptual void. 
Actually, there are several unwarranted as- 
sumptions implicit in the view that a concep- 
tion of normality is central to clinical prac- 
tice. First is the assumption that treatment is 
initiated with the finding of abnormality, 
however defined. Operationally, treatment is 
initiated when motivation from some source 
exists for treatment and when therapists are 
motivated and Prepared to undertake treat- 
ment. There is a reciprocal relationship be- 
tween professional personnel and patient pool 
in that the professional proffers the expecta- 
_ tion of relief from distress and contributes to 
the development of this anticipation in po- 
ce of some relevant form 
seems to be one of the 
ailure in the treatment of 
by predominantly middle 
and upper-class Professionals (Hollingshead & 
e point is that the initia- 


zed or treated psychothera- 
plies that he is “normal” js 
prig.) Furthermore, as al- 


and confi: : 
expense of oth onflicts, this may be at the 
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within the personality seems indicated in cer- 
tain types of schizophrenia, in severely schiz- 
oid personalities or in antisocial personalities. 
Very many of these cases require that a way 
of life be taught. In other words, certain cases 
require far more than reorganization of the 
personality; they require education itself and 
the inculcation of values. Once again, it seems 
to me that science can contribute to an un- 
derstanding of the educative process but that 
the content of this education is largely a mat- 
ter that must be decided on other grounds. 
Here, the articulation and specification of 
the values under which or toward which the 
therapist operates is of great importance, but 
these choices are not scientific matters. We do 
the best we can much like any parent living 
at a particular time and in a particular place. 

This point, that values, symbols, and the 
like are involved in psychotherapy (Hacker, 
1945), brings the discussion full circle. Davis 
(1938, p. 65) in a little cited analysis of the 
mental hygiene movement has argued that the 
social function of the mental hygienist “is not 
that of a scientist but that of a practising 
moralist in a scientifc, mobile world.” It 
seems to me that what Davis is saying is that 
professionals dealing with “ways of life” in- 
evitably have to rely on some consistent 
framework to fill gaps and resolve conflicts 
where they exist and that this framework 
willy-nilly will be that prevalent in the cul- 
tural group to which the professionals belong. 
If this issue may be obscured in the treatment 
of adults, it is blatantly apparent and quite 
inevitable in the treatment of children and in 
the advice given to parents about techniques 
of child rearing (Erikson, 1950). As a con- 
sequence, it seems likely that psychological 
treatment is a conservative force in any so- 
ciety no matter how radical the theories about 
psychotherapy sometimes appear to be. But 
the implicit or explicit commitment to values 
involved in the practice of psychotherapy is 
an issue separate and distinct from a scien- 
tifically useful specification of the concept of 
normality. The future development of a sci- 
entific understanding of personality depends 
on a less restrictive perspective which, ironi- 
cally, should benefit clinical practice as well. 

A shift in Perspective in personality theory 
from one which looks upon man as normal or 


k 
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abnormal or somewhere in between to one 
which views him as having varying potentiali- 
ties and limitations under varying conditions, 
seems to offer an orientation more viable, 
secure, and fruitful than that prevailing at 
present. The viable aspect of this position 
comes out of its capacity to conceive of man 
in different times and different circumstances. 
The security features result from the elimina- 
tion of the necessity for the clinician to ra- 
tionalize about his own normality (Reider, 
1950), permitting his work to be evaluated 
objectively in terms of its results. Finally, the 
fruitfulness of this approach rests on its re- 
luctance to condemn, to label abnormal, and 


its emphasis on the potentialities of every per- 


son under the proper conditions. It sets the 
stage for more extensive investigations into 
the nature of these potentialities and the con- 
ditions of their achievement, all the while 
keeping before us the pitfalls and limitations 
that such achievement may involve. 
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One of the relatively unexplored areas in 
objective personality measurement concerns 
age differences in test scores. Many manuals 
for personality tests state that an age correc- 
tion must be made in the raw scores before 
the scores can be referred to the test norms, 
or norms are given for different age groups 
without any apparent interest in what these 
age differences mean in terms of possible 
changes in personality integration. A second 
question relates to possible age differences in 
the internal structure or interrelationships 
among the scales included in the personality 
inventory. Most of these inventories have 
been developed, by factor analysis or some 
other statistical technique, on relatively young 
groups of subjects (the traditional college 
sophomore being widely used), and the scale 
scores are subsequently interpreted under the 
assumption that each scale means the same 
thing or measures the same underlying per- 
sonality dimension at all age levels. This ap- 
pears to be a hazardous assumption in the ab- 
sence of data on the stability of personality 
test scores over the mature age range. 

A personality inventory that is widely used 
with Ponpsychiatric populations is the Guil- 
ford-Zimmerman Temperament Survey (Guil- 
ford & Zimmerman, 1949) which contains 10 
Seales at General Activity (G), Restraint (R), 
Stability (BU Sociability (S), Emotional 
(F) Tee a Objectivity (O), Friendliness 
(P ) i x t ulness (T), Personal Relations 

), and Masculinity (M). These scales are 
revised and condensed versions of 13 scales 
that had been developed by factor analyses 
of test items and published in previous inven- 
tories (Guilford, 1940; Guilford & Martin 
1943a, 1943b). Although the scales prerii- 
ably measure 10 orthogonal or independent 


first-order personality dimensions, scores from 
certain scales typically show rather large in- 
tercorrelations. For example, correlations be- 
tween the A and S scales usually are about 
.68 and between the E and O scales average 
around .66. However, the correlations among 
other scales are quite low. This suggests that 
scores from these scales may be organized at 
a higher level of personality into a smaller 
number of more general factors describing 
broader syndromes of behavior. Factor analy- 
ses of the correlations among the GZTS scales 
would provide evidence as to the number of 
such factors involved and an empirical method 
of describing these syndromes. However, as 
pointed out by Guilford and Zimmerman 
(1956, p. 2), these second-order or “super 

factors represent factors common to the in- 
ventory scores and do not necessarily repre- 
sent the second-order organization among the 
original first-order inventory factors which 
the scores were designed to measure. 

The research reported below was designed 
to provide evidence as to the factor organiza- 
tion of the GZTS scores and to investigate 
possible age differences in both the average 
scale scores and the factor loadings of the in- 
dividual scales. 


PROCEDURE 


Subjects. The Ss were 400 men whose GZTS scores 
were available from the files of a local psychological 
service agency.1 Over 800 male and female Ss were 
tested during the year 1953, and the cases were ran- 
domly selected from this larger group with the re- 
strictions on randomness being: (a) only male Ss 
were included; (b) 100 Ss were selected from each 
of the following age (at time of testing) ranges— 
20-29, 30-39, 40-49, and 50-59 years; and (c) the 


1 The author is indebted to James Horgan and the 
research staff of Psychological Service of Pittsburgh 
for their kindness in making these data available. 
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TABLE 1 


Ace DIrFERENCES IN MEAN Scores or Mex (N = 400) 
ON THE GUILFORD-ZIMMERMAN ‘TEMPERAMENT SURVEY 


Age Groups 


Significance of Age Differences (F) 


— = E 
maë 20-29 30-39 40-49 50-59 Mean Quad. 
31S Ge, Sp: Mem SD Mee SD Mean SD Groupe ine ty Gre 
G 18.86 4.69 18.87 5.06 18.85 17.38 5.16 2.15 3.90* 2.15 
R 18.87 4.21 20.18 4.32 20.64 20.66 4.00 433" 10.30% 254 ia 
A 19.83 4.97 17.61 16.74 5.65 6a7 © 19.33" 01 Se 
S 23.77 501 21.84 20.60 6.22 6.56 19.56 11 a 
E 2187 475 20.96 8 63 47 ‘91 D 
O 2187 475 20.96 1.87 3.32 ‘60 is 
F 17.39 473 17.4 1.04 a5 26 2.70 
9.63 4.28 20.53 17.47 1.18 241 10 103 
Le P 21.61 5.38 22.11 20.31 341* 1.04 195 7.23% 
} 20.96 3.20 19.54 11.55 5.43% 15.90% 38 “00 
*p <.05 g 
** P <01 
RESULTS 


within each age decade was kept as 

rectangular as possible. The mean ages for the four 

groups of Ss were 24.8, 34.5, 44.2, and 54.0, while 
; the standard deviations of ages within each group 
B were 4.4, 2.9, 3.0, and 2.5. The Ss had been referred 
to the agency by local business firms for psycho- 
logical testing for employment, promotion, or com- 
pany personnel survey purposes and had been given 
a large battery of aptitude, interest, and tempera- 
ment tests including the GZTS. 

Method. The raw scores from each of the 10 GZTS 
scales were subjected to a one-criterion (between- 
within) analysis of variance with the age of the Ss 
(four decades) being the independent variable. The 
between-age groups sums of squares were further 
analyzed into linear, quadratic, and cubic compo- 
nents by the method of orthogonal polyomials 
(Grant, 1956) and tested for significance. The F 
ratio test of the between-groups sums of squares 
tests for the presence of significant differences among 
the GZTS score means of the age groups, but does 
not specify the form of the functional relationship 
between age and the scale means. The further break- 
down of the sums of squares tests whether the re- 
lationship is a straight line (linear), simple curvi- 
linear (quadratic), or more complex (cubic) change 
jn means with increasing age. 

The scores within each age group were intercor- 
related by the usual product-moment method, and 
each of the four matrices of correlations was fac- 
tored by the complete centroid method, with five 


factors being extracted from each matrix. Each suc- 
cessive factor W 


age distribution 


as tested for significance using Hum- 
phrey’s rule (Fruchter, 1954, pp- 79-80), and the ap- 
parently significant factors were rotated to orthogo- 
nal simple structure using the normalized varimax 
criterion (Kaiser, 1958)” 

2 The author's appreciation is extended to Gary 
Lotto and William B. Kehl of the University of Pitts- 
burgh Computation and Data Processing Center for 
making their facilities available for this phase of the 


statistical analysis. 


A summary of the analyses of variance of 
the GZTS scales can be found in Table 1 
which gives the means and standard devia- 
tions of the age groups on each scale, the 
within-age groups (error) mean square from 
the analyses of variance, and the F ratio tests 
of significance of the between-age groups 
mean square and its linear, quadratic, and 
cubic components. The age differences in 
mean scores were significant at the .01 level 
for four scales (R, A, S, and M), and one 
additional scale (P) was significant at the 
05 level. The linear trend in the means was 
significant at the .01 level for four scales (R, 
A, S, and M) and at the .05 level for the G 
scale. The cubic trend was significant at the 
‘01 level for the P scale. None of the quad- 
ratic components was significant even at the 
10 level. The means of the A, S, and M scales 
show a linear (straight line) decline with in- 
creasing age, while the G scale is essentially 
level until age 50 and shows a sharp drop be- 
tween the 40-49 and 50-59 age groups. The 
R scale mean scores increase with age with 
some deceleration in the rate of increase. The 
P scale means tend to increase with age, but 
the increase is irregular, with the largest P 
scale mean found for the 30-39 age group. 
Scales E, O, F, and T show no systematic 
variation with age. 

The first two centroid factors were signifi- 
cant at the .01 or .05 levels within all four 
age groups, while the third factor was sig- 
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nificant at the .01 level for the 20-29 and 
30-39 groups but was not significant for the 
two older groups of Ss. Neither the fourth 
nor the fifth factor approached significance at 
the .05 level within any age group. For com- 
parability between age groups, the first three 
factors were retained in the subsequent nor- 
malized varimax rotations with the remaining 
two factors being dropped. The percentages of 
variance accounted for by the first three or- 
thogonal factors within each group were 54.1, 
51.3, 45.1, and 44.3. 

The rotated factor loadings and communali- 
ties for each GZTS scale can be found in 
Table 2. The approximation to simple struc- 
ture was moderately successful with slightly 
over half of the loadings having absolute 
values less than .20. The first factor is char- 
acterized by consistently high loadings on the 
O, P, and F scales for all age groups and high 
to moderate loadings on the E and M scales 
only for the younger groups. The second fac- 
tor shows consistently high loadings on Scales 
S and A and moderate loadings on Scale G for 
all age groups and gradually increasing load- 
ings on Scale E as the age of the Ss increases. 
The third, factor loads Scales R and T only 
with no consistent loadings on any other 
scale. The O scale has a small but consistent 
loading on the second factor, and Scales T 
and R have slight negative (T) and positive 
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(R) loadings on the first factor. The simi- 
larity between comparable factors appearing 
in the four analyses was quantified by the 
method suggested by Wrigley and Neuhaus 
(1955). The indices of factor similarity for 
the first rotated factor ranged from .98 to 
.83, with the median of the six indices being 
.93. Indices for the second factor ranged from 
.96 to .68 with the median also being .93, 
while the range for the third factor was .96 
to .41 with a median of .85. The indices were 
consistently lowest when factors were com- 
pared between the two groups at the extremes 
of the age continuum, 20-29 and 50-59, and 
is probably attributable to the changing fac- 
tor composition of Scales E and M.* 


Discussion 


The first factor found in the factor analyses 
of the GZTS scales included the scales of Ob- 
jectivity, Friendliness, and Personal Relations. 
These three scales are GZTS versions of the 
Objectivity, Agreeableness, and Cooperative- 
ness scales developed by Guilford and Mar- 
tin (1943b) from a factor analysis of items 


3Tables giving the original correlation matrices, 
unrotated factor loadings, transformation matrices, 
tests of the significance of the factors, and indices 
of factor similarity have been deposited with the 
American Documentation Institute. Order Document 
No. 6120, remitting $1.25 for microfilm or $1.25 for 
photocopies. 


TABLE 2 


ROTATED ORTHOGONAL Factor Loaprncs (DECIMAL Ports OMITTED) OF THE GuILrorD-ZIMMERMAN 
‘TEMPERAMENT SURVEY SCALES FOR Four AGE Groups or MEN 


(N = 100) 
Age 20-29 Age 30-39 Age 40-49 Age 50-59 
GZTS 
Scale Fr SA EL Æ Ti SA EL P Fr SA EI # Fr SA EL # 
G —03 54 06 30 phl 55 —20 36 —09 38 —13 17 —07 43 00 19 
R 22 03 68 52 11 —07 65 44 —07 —11 3 iš 31 —03 52 36 
A 16 77 20 67 1S 84 09 73 20 82 16 75 12 73 —07 55 
S 10 85 —11 74 11 73 00 54 10 85 08 74 17 69 —32 60 
E vith 32 —06 70 70 48 29 81 37 46 —32 45 35 70 15 64 
oO 90 19 —02 , 84 78 34 —01 72 73 19 —41 75 72 30 —04 62 
F 57 —12 13 36 64 —l4 34 55 55 —34 —31 5i 77 —02 15 O) 
sil —18 09 69 51 34 05 49 35 —12 19 56 36 —06 —04 44 20 
P 62 26 —04 46 51 20 —12 31 74 02 —05 55 74 08 03 55 
M 53 —13 02 30 S3 10 —16 32 34 07 —02 12 22 22 —14 12 
Percent- 
age of an Dees M 
variance 25:1 18:5 10.2 53.8 22.2 19.6 9.5 51.3 17.1 196 87 45.4 19.8 183 64 445 


; 
4 
! 


Factor Structure of GZTS 


reflecting “paranoid disposition.” Thurstone 
(1951) reanalyzed the intercorrelations be- 
tween 13 Guilford scales (including Objec- 
tivity, Agreeableness, and Cooperativeness) 
reported by Lovell (1945) and found nine 
second-order factors, one of which (Friendli- 
ness) had loadings only on the three Guil- 
ford-Martin scales. Guilford and Zimmerman 
(1956) reanalyzed Lovell’s basic data, sepa- 
rating the items on the 13 scales into 70 
short subscales, and factor analyzed the matrix 
of intercorrelations among the 70 variables. 
Three of the rotated orthogonal factors could 
be clearly identified as Objectivity, Agreeable- 
ness, and Cooperativeness. Although these 
factors were orthogonal, of the 26 subscales 
that were significantly loaded for at least one 
of these factors, 35% also had significant 
loadings on at least one of the other factors 
in this trio. Klugh and Bendig (1959) cor- 
related the 10 GZTS scales with the ACE 
test and found that the O, F, and P scales 
were the only scales significantly correlated 
with the measure of general verbal intelli- 
gence. The evidence strongly indicates that 
the O, F, and P scales consistently define a 
personality factor that may be called Friendli- 
ness (Fr). z 

The second GZTS factor was defined by 
the Sociability, Ascendance, and (to a lesser 
extent) General Activity scales. Thurstone 
(1951) also found a second order factor 
loaded only on the S and A scales (Social 
Leadership) while Guilford and Zimmera 
(1956) found four subscales loaded on both 
S and A factors. Again 8 and Ao Pictor to ia 
fine a stable second-order GZTS factor whim 
also is linked for the younger Ss with the © 
scale, Because of this linkage with a G p 
inadequate evidence that Leadership is E 
flected in the factor, it appears Ee ame 
Monious to call this second order G : 
tor Social Activity (SA). 

GZTS Restraint and Thou ‘ 
Consistently defined the third fa 
oe obvious that m a fs 1) and is the 
or urston 3 i 
reverse ai m EET: Extraversion fac 
tor discussed at length 


103-128). To distinguish ; 
P : ctor. 
from Guilford’s first-order Rhathymia factor, 


> F sion—Introver- 
We will use the name Extravers} 
sion (EI), 


R les 
oughtfulness scale 
rd factor. It is 
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The Emotional Stability scale appears to 
have loadings on all three second-order fac- 
tors. It is probable that the E scale is measur- 
ing a second-order factor rather than a first- 
order factor, since Guilford and Zimmerman 
(1949) combined first-order factor scales of 
Depression and Cycloid Disposition in de- 
veloping the GZTS E scale. Since there is 
only one scale on the GZTS that measures 
this second-order factor, a separate (fourth) 
factor of Emotionality (Em) could not ap- 
pear in the analyses. The E scale divided its 
common factor variance primarily between 
the Fr and SA factors with considerable in- 
tersample fluctuation in the size of its load- 
ings on these second-order factors. 

The evidence seems rather strong that 
scores from the 10 GZTS scales can be or- 
ganized into at least three, and possibly four, 
independent (orthogonal) factors: Friendli- 
ness (Fr), Social Activity (SA), Extraver- 
sion—Introversion (EI), and Emotionality 
(Em). The relative independence of the first 
three factors is confirmed by the correlations 
reported by Thurstone (1951) between his 
second-order oblique factors of Friendliness, 
Social Leadership, and Reflectiveness. Al- 
though rotation of oblique factors permits 
large intercorrelations among the factors 
(while orthogonal rotation permits only zero 
correlations among factors), Thurstone found 
the correlations to be .01, —.11, and .07. 
However, the results with the GZTS E scale 
suggest that the hypothetical Em second-or- 
der factor may be correlated with the Fr and 
SA factors. 

Age differences are evident on two of the 
three major factors. Of the three scales de- 
fining the Fr factor, mean scores of O and F 
remained relatively constant between the age 
groups, and while the P scale scores tended 
to increase with age, they increased irregu- 
larly, reaching their peak in the 30-39 years 
age group. The first-order factor of Personal 
Relations or Cooperativeness may vary with 
age, but the broader factor of Friendliness re- 
mains relatively stable. Masculinity (GZTS 
Scale M) tends to be associated with Fr 
among the younger Ss, but as the mean scale 
M scores decline with age (contrary to the 
stability of the other scales defining Fr), the 
loading of this scale on Fr decreases to in- 
significance. Among the more generally mas- 
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culine groups of younger male Ss, individual 
differences in Masculinity are positively as- 
sociated with differences in Friendliness, while 
within the generally more feminine groups of 
older Ss the correlation between Masculinity 
and Friendliness disappears. 

All three of the scales loading on the fac- 
tor of Social Activity show decreasing mean 
scores with increasing age. The loading of the 
GZTS G (General Activity) scale on factor 
SA slightly decreases with age with the size 
of the loading tending to parallel the size of 
the G scale mean. The older male Ss appear 
on the GZTS to withdraw more from vigorous 
social activities and to be more submissive in 
the social world. This does not imply a with- 
drawal from society but, rather, a reorienta- 
tion of their role in society toward a less com- 
petitive position that is more free of the de- 
mands of social leadership. 

The factor of Extraversion—Introversion 
shows some changes with age, primarily in 
the direction of greater introversion among 
older Ss. This trend was statistically signifi- 
cant only for the Restraint component of EI, 
with Thoughtfulness not increasing as mark- 
edly among the older groups. Scores on the 
two scales loaded for the EI factor increase 
up to the 40-49 age group, and no change in 
mean R and T scores is apparent between 
the 40-49 and 50-59 age groups. 

Obviously this cross sectional study does 
not imply that male Ss show personality 
changes with age that are discussed above, 
but that these personality differences existed 
among groups of Ss who were born in the 
years (approximately) 1928, 1918, 1909, 
and 1899. Whether the trait differences ap- 
parent in 1953 reflect age changes in the Ss 
or differences in the environmental factors in- 
fluencing Personality formation in childhood 


during these early years cannot be assessed 
from this type of study. 


SUMMARY 


Scores on the 10 Guilford-Zimmerman Tem- 
perament Survey scales were obtained for four 
groups of male Ss (N = 100) in different age 
decades: 20-29, 30-39, 40-49, and 50-59 
years. Scores on the General Activity 
ance, Sociability, and Masculinity scales sig- 
nificantly and linearly decreased with increas: 
ing age, while the Restraint scores linearly 


Ascend- 
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increased with age and scores from the Per- 
sonal Relations scale showed an irregular in- 
crease. Factor analyses of the scale intercor- 
relations resulted in three major orthogonal 
factors: Friendliness (Fr), Social Activity 
(SA), and Extraversion-Introversion (EI). 
Scales of Objectivity, Friendliness, and Per- 
sonal Relations consistently loaded on the Fr 
“super” factor, Ascendance and Sociability 
loaded on the SA factor, while Thoughtful- 
ness and Restraint loaded for the EI factor. 
The loading of Masculinity on the Fr factor 
showed a progressive decline with increasing 
age. Emotional Stability inconsistently loaded 
on both Fr and SA factors and it was sug- 
gested that the E scale may represent a fourth 
second-order factor (Emotionality) present in 
the GZTS inventory. 
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Koffka’s (1935) discussion of the con- 
stancies gave a controversial status to the 
question concerning progressive development 
of size constancy in childhood. Of recent 
years, empirical evidence has accumulated to 
resolve this controversy. Lambercier (1946a, 
1946b), Piaget and Lambercier (1943, 1951, 
1956), and Zeigler and Leibowitz (1957) all 
agree as to the existence of a developmental 
trend in objective size-distance judgments, 
which reaches nearconstancy or overconstancy 


in adulthood. 

The present study 
the type of impaired 
due to organic brain dam 
that organic mental defe th 
impaired organizing and integrating abilities, 
would make judgments which fall short of 
size constancy, relative to the judgments of 
normal persons of equivalent age. The defec- 
tive individuals would thus occupy a place on 


the developmental curve which is normally 
marked by a younger age group. ? 
erence in mean size- 


Since an obtained diff i ‘| 
matches between organic mental defectives an 
normals could be attributed to the amar 
in tween these groups, 1 
uari atally retarded 


planned to use a familial men 
group as a control. The familials would hav i 
mental ages similar to those of the organics, 


but would lack the factor of brain cae 
which the present hypothesis specifies as e 
cause of judgments atypical for a 8 


focuses upon effects of 
development which is 
age. It was expected 
ctives, due to their 


i hen the authors were 
1 This study was C° d 5 
at the h School at bare Thanks A ate 
to th intendent and staff of the Vinetai z 
ee a ee ati ade possible collection 
> 
of some of the data. 


chronological age. Success in judging the 
physical size of a distant object is consid- 
ered to be an ability developed through nor- 
mal visual experience and not primarily de- 
pendent on intelligence. It was therefore ex- 
pected that the familial group would make 
size—distance judgments which are closer to 
those of the normal group than to those of 
the organic group. 

A supplementary aim of the investigation 
was to study the trends of the size-matches 
when the comparison object was presented 
at three different distances. Previous work by 
Jenkin (1957, 1959) has shown that with 
normal adult Ss, objective size-matches of a 
standard at a constant distance become pro- 
gressively reduced as the comparison object 
recedes from the point of observation. This 
tendency, which has been interpreted as an 
increase of apparent size with increasing dis- 
tance, was thought to be probably a product 
of visual learning. If this is so, an interaction 
of groups by distances would be found when 
the organics are compared with other groups. 
The expected direction of the relationship 
was towards organics showing less difference 
between distances than either normals or 


familials. 
METHOD 


Subjects. Four groups of Ss were used, each group 
numbering 16 cases and including both sexes. Cases 
of gross visual defect were excluded from each group. 
The organic group consisted of mentally retarded 
residents of the Training School at Vineland and 
the Vineland State School. Each member of this 
p had been classified in the records of the re- 
spective institution as brain damaged from causes 
occurring prenatally or early in life. The median age 
of this group was 18 yr. 6 mo., and the median men- 
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il age (Binet, or equivalent) was 7 yr. 10 mo. The 
amilial group was obtained from the same institu- 
ions. Its members were classified in the records as 
nentally retarded cases of familial etiology. The 
nedian age was 15 yr. 9 mo., and the median men- 
al age (Binet, or equivalent) was identical with that 
yf the organic group, 7 yr. 10 mo. The normal ado- 
lescent group consisted of students from a nearby 
junior high school who agreed to serve as Ss for 
monetary compensation. The median age was 14 yr. 
No mental ages were available, but it was ascer- 
tained that the students were all achieving at school 
at a rate normal for their age. The normal adult 
group consisted of employees of the Training School 
at Vineland, most of whom were young professional 
personnel. The median age was 23 yr. 

Apparatus. The apparatus consisted of white card- 
board squares. The standard card, 4 in. square, was 
attached by a steel wire to a ringstand and was 
placed directly in front of S at a distance of 320 in. 
from his eyes. The comparison series was a set of 
squares ranging from 21 in. to 6 in. by steps of 4 in. 
They were individually displayed at the same height 
as the standard upon a platform which was situated 
on S’s right. Concealed pins and a slight depression, 
invisible to S, served to maintain the comparison 
cards in vertical position in the middle of the plat- 
form. The angular separation between the standard 
and the comparison was 87°. Homogeneous fields 
against which both stimuli were viewed were pro- 
vided by wooden screens painted a dull green. Little 
daylight entered the room, and the stimuli were 
illuminated by 75-w. floodlights placed so as to give 
an illumination reading of 11 foot candles (measured 
by a Macbeth Illuminometer) from a correctly po- 
sitioned square. 

Design. Each S judged the size of the standard 
with the comparison object placed at three different 
distances: 20 in., 80 in, and 160 in. Two trials were 
conducted at each distance, one an ascending and 
the other a descending series, Each group of Ss (or- 
ganic, familial, adolescent, adult) was divided into 
two subgroups, A and B. Subgroup A experienced 
the different distances in the order 80 in., 160 in., 
20 in.; and Subgroup B experienced the order 20 in., 
160 in., 80 in. Half the members of each subgroup 
began with an ascending series and the other half 
began with a descending series. 

Procedure. The height of the stimulus objects and 
S's eye level were properly matched. The platform 
was placed at the distance appropriate for viewing 
the comparison object on the first trial. S was then 
instructed about the nature of the judgment re- 
quired. He was asked to respond to each comparison 
sanl yirt an answer of “smaller,” “the same,” or 
A T explained that “the same” meant 
aeons ee card would exactly match the 
e oe] ne eg dimensions. The concepts of 
BEES. ras ee were illustrated with demon- 
ae coiiparison i aera as to lie outside the range 

a ‘ards used in the experiment. The 

concept of “equality” was illustrated with i 
cards 3 in. square, held in F’: ei with e. pair of 
3 s hands, then superim- 


posed and rotated before S’s eyes. Two practice 
trials were then conducted, one ascending and the 
other descending. A pilot study had indicated that, 
with retarded Ss, at least this much experience was 
necessary for the thresholds to attain reasonable sta- 
bility. Data from the practice trials were therefore 
excluded from the analysis. S was required to look 
first at the standard and then at the comparison, 
keeping his head erect and in the same position for 
each judgment. He was not prevented from repeat- 
edly looking back and forth if he wished to do so. 
Within predetermined limits, E varied the size of 
the card with which each series began, and also ran- 
domly switched from one set of comparison cards 
to a second identical set. 


RESULTS 


Table 1 shows the mean size-matches at 
each of the three distances for the four groups 
of Ss. It will be seen that the grand mean for 
the organic group is less than 4 in., which 
latter value would represent perfect constancy. 
The grand mean for every nonorganic group 
is higher than 4 in. 

The results of an analysis of variance of 
the data are shown in Table 2. Variability 
among the four groups is statistically signifi- 
cant (P < .01). The three distances also rep- 
resent a significant source of variance. The 
specific hypotheses of the experiment are an- 
swered by é tests of the significance of the 
difference between pairs of groups. With 30 
df, the value obtained for the organics vs. 
familials comparison was £= 2.32, p< 05. 
A test of the organics vs. adolescents com- 
parison yielded # = 2.63, p< .02, and be- 
tween the organics and adults, ¢ reached a 
value of 4.06, p < .001. All comparisons of 
nonorganic pairs of groups, as expected, 


TABLE 1 


MEAN SIZE-MATCHES, IN INCHES, OF A 4-IN. STANDARD" 
BY 4 GROUPS OF SUBJECTS 


Distance of Comparison Object 


Grand 

Group 20In. 80In. ‘160In. Mean 
Organic defectives 3.777 3.805 3.746 3.776 
Familial defectives 4.178 4.250 3.982 4.137 
Normal adolescents 4.246 4.195 4.113 4.185 
Normal adults 4.605 4.355 4.262 4407 


a The standard was 320 in. from the point of observation, 
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TABLE 2 
ANALYSIS or VARIANCE OF SizE-MATCHES 


Source df MS F 
== peanae 
Groups (G) 3 210.25 5.66* 
Order (0) 1 103.55 2.79 
GXO 3 49.15 1,32 
Ss 56 37.16 
Distances (D) 2 33. tars 
DXG 6 7.96 1.85 
DXO 2 5.51 1.28 
DXGXO 6 3.57 0.83 
Residual 112 4.30 

*P <.0i, a j ii 
** P 2 001. 
showed nonsignificant differences, p being 


greater than .05 in every case. 
Tt is noteworthy that the organic group 
made a mean size-match significantly smaller 


than that of the familial group, while the 


size-match of the latter group did not differ 


significantly from those of either of the nor- 
mal groups. The atypical performance of the 
organics cannot be attributed to low meas- 
ured intelligence, since the two retarded 
groups were equated on this variable.* 

The effort to match the ages of the or- 
ganics, familials, and normal adolescents was 


not entirely successful. The unavoidable error 
of having an organic group with too high a 
in a direction which 


median age is, however; c 1 
weakens the chances of a positive finding. 
Both the familial and the normal adolescent 
groups are, on the average, younger than an 
organic group and, on the developmental y- 
pothesis, this would tend to minimize any 
differences which may be due to organicity. 
Nevertheless, our hypothesis concerning the 


i her 
ionshi the organics to the ot 
ee ean ed by the data. 


groups is strongly support 
‘An examination of Table 1 shows a 
trend of mean size-matches among the ae 
different distances of the comparison app a 
as expected for the normal groups, ié; g 
wards smaller matches as distance ae r 
Also as expected, there is less variability 
"2A further test of the possible as of m 
: i ; correlations - 
was made by computing the ran inks 2 
tween mental ages a ze-matches. Nonsig 


nd mean si c E 
nificant values of —22 for the organic group @ 
—.05 for the familial 


group were obtained. 


tween distances in the case of the organic 
group. Table 2 indicates, however, that the 
differences among these trends are not great 
enough to produce a significant groups X dis- 
tances interaction. 


DISCUSSION 


The present study has demonstrated that 
organic mental defectives tend to underesti- 
mate the size of a distant object. Normal per- 
sons tending to be younger than the defec- 
tives are inclined to overestimate the size of 
a distant object, and so are normals who tend 
to be older than the defectives. The difference 
between the performances of organics and 
normals cannot be attributed to the differ- 
ence in Binet mental age, as shown by the 
performance of a control group of familial 
etiology. 

These results, which support the main hy- 
pothesis of the investigation, may be usefully 
compared with data from a study of normal 
children and adults by Piaget and Lambercier 
(1951). A group of children, aged from 7-8 
yr., made a mean objective match which fell 
short of size constancy. The next group, aged 
from 8-10 yr., slightly exceeded size con- 
stancy, and the three remaining groups re- 
spectively gave average values which showed 
a progressive overestimation as age increased. 
The direction and extent of the difference 
between Piaget and Lambercier’s adolescent 
(12-14 yr.) and adult groups resemble what 
was found from the comparable groups of the 
present study. Our organic group, however, 
behaved like the young children (7-8 yr.) of 
Piaget and Lambercier, even though our cases 
were very much older. 

It appears reasonable, therefore, to inter- 
pret the behavior of our organic group as in- 
dicating immaturity of size-distance judg- 
ment, due to brain damage occurring pre- 
natally or early in life. A firm conclusion of 
this type is not at present possible, since the 
present authors did not have the opportunity 
to study a group of individuals who had suf- 
fered recent brain damage. A future experi- 
ment might determine whether or not such 
persons behave like normals or like young 
children in the size-distance judging situa- 
tion, and the role of learning would thus be 


further clarified. 
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In the meantime, the authors consider that 
the present results are generally consistent 
with other work on perception in organic 
mental defectives, summarized by Strauss and 
Kephart (1955) and by Gallagher (1957), 
and might usefully be interpreted along 
broadly similar lines. This is to the effect 
that prenatal or early brain damage impairs 
the development of integrative cerebral mecha- 
nisms which are necessary for normal percep- 
tual functioning (Strauss & Kephart, 1955, 
pp. 57-58). Organics thus tend to show fea- 
tures such as a fragmentary response to a 
complex pattern of stimulation, inadequate 
perceptual discrimination, and a failure of 
synthesis in a situation where several sources 
of information are normally combined. 

It is the latter feature which in particular 
seems to have been demonstrated in the pres- 
ent study. It cannot be pretended that a fully 
satisfactory theory of size constancy exists, 
but it is known that its manifestation de- 
pends upon the apprehension of various depth 
cues and the integration of these with other 
sources of information. The proximal stimu- 
lus size and also identification and categori- 
zation of the object displayed are probably 

included among these sources. The chroni- 
cally impaired ability to integrate several 
sources of information and the consequent 
weakness of the information which is given 
by experience are factors probably respon- 
sible for the tendency of the organic defective 
to rely more heavily than other groups upon 
the proximal stimulus size. 

It is of interest to note in this connection 
that “normal” perception is not necessarily 
the most veridical perception. Table 1 indi- 
cates that the organic group was, on the av- 
erage, more successful than the adult group 
in matching the objective size, and the same 
statement may be made about the young chil- 
dren in Piaget and Lambercier’s study. As 
pointed out by Jenkin and West (1958), in 
connection with the size—-weight illusion, the 
gross degree of error measured in some per- 
ceptual situations is not an index of percep- 
tual inefficiency; rather, it is the opposite. 
This point of view leads us to interpret the 
behavior of the organics as tending to be 
“stimulus bound,” whereas the normals, by 
virtue of their superior organizing ability and 


more adequate fund of experience, tended 
more to respond to the total situation, in- 
cluding its illusory * aspects. 


SUMMARY 


Binocular objective size-matches of a stand- 
ard 4-in. square situated 320 in. from the 
observer were obtained at three different 
distances of the comparison object. Under 
conditions of strong depth cues and ample 
illumination, judgments were made by four 
groups of Ss (N = 16 per group): (a) fa- 
milial type mental defectives, (b) brain dam- 
aged mental defectives, (c) normal adoles- 
cents, (d) normal adults. As hypothesized, 
the organics made matches significantly closer 
to the visual angle than those of the other 
groups. There was no significant difference 
between the familials and the normals. As ex- 
pected, variation among the different dis- 
tances was smaller in the case of the organic 
group, but this trend was not statistically 
significant. The data were discussed in terms 
of perceptual learning, and it was suggested 
that the performance of the brain damaged 
cases is due to chronically impaired ability 
to integrate several sources of information. 
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IMPULSE REPRESSION AND EMOTIONAL ADJUSTMENT 
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Theories of personality development have 
emphasized what behavior the individual con- 
siders acceptable or right and what behavior 
he considers unacceptable or wrong. These 
judgments of rightness or wrongness form im- 
portant parts of psychological concepts such 
as value systems, super ego, ego-ideal and 
self concept. The basic premise is that while 
very young the child learns that certain types 
of behavior resulted in parental approval, 
while other forms of behavior bring disap- 
proval. Since the young child’s security and 

sense of well being are so closely bound to 
parental evaluation, he soon learns to avoid 
the disapproved forms of behavior and even- 
tually to repress impulses to such acts. The 
child experiences anxiety if he behaves in a 
nonacceptable fashion or even if impulses to 
such behavior are, experienced, 

Since the above position places such strong 
emphasis on conflicts between impulses to be- 
ee paa oes pd = inhibit behavior, 
Beant anxiety, pulse repression the- 

The experimental evidenci 
validity of the impulse repr 
contradictory. Stone and La 
that high school seniors wh 
democratic family structure experi 
conflicts with their fanle ET Se 
raised in an autocratic family. Watson (1934) 
studied the effects of permissive ys, strict 

home discipline and found that subjects raised 
under strict home discipline reported more in- 
dications of emotional conflict; however Wat- 
son (1957) in a later study found no differ- 
ence in the emotional adjustment of children 


e concerning the 
ession position is 
ndis (1953) found 
© were raised in a 


1 This paper is based upon a doctoral dissertation 
completed at Pennsylvania State University. The 
writer should like to acknowledge the advice and 
suggestions of William U. Snyder under whose direc- 
tion this study was done. 


raised in strict and permissive families. Brown 
(1942) found no relationship between the 
attitudes of parents toward the child’s behav- 
ior and the child’s adjustment. Lasko (1952) 
reported that children raised by laissez faire 
parents gave indications of emotional malad- 
justment. 

Two factors limit the effectiveness with 
which one can use the above empirical data 
to evaluate the impulse repression theory of 
anxiety. First, as previously indicated, the 
above studies are contradictory. Second, they 
have tended to emphasize the democratic- 
autocratic continuum. It is quite possible for 
a family to have an autocratic structure and 
yet allow the child relatively free expression 
of certain basic impulses. Thus, determining 
that an autocratic family structure results in 
greater maladjustment does not indicate that 
this maladjustment resulted from the repres- 
sion of basic impulses. 

Since the impulse repression theory already 
presented depicts anxiety as resulting from 
conflicts between impulses and repression, the 
child who has been taught a large number of 
behavior restrictions by the significant people 
in his life should experience greater ceny 
than the child brought up under more libera 


conditions. The above should be true ie i 
would be more opportunity for the child w 
dards to experience 


a high set of behavior stan 
conilicts between impulses and ome T 
well as between behavior and standar A 
the child with the high set of penato ia have 
ards experiences more conflict, he shou couse 
also developed more symptoms of ae 
flicts, that is, more neurotic symptoms: the 
This research was designed to explore Rat 
following questions which are related 7 d 
impulse repression theory of anxiety A h 
Presented. Can subjects who ee hed 
number of neurotic symptoms be distinguishe 
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from subjects who report a smaller number 
of neurotic symptoms on the following fac- 
tors: 


(a) The number of moral and ethical 
standards of behavior held by the sub- 
ject? 

The frequency with which they en- 
gaged in behavior involving a moral 
or ethical judgment? 

The number of moral and ethical be- 
havior standards held by one of the 
significant adults in the subject’s life 
as perceived by the subject? 

The number of moral and ethical be- 
havior standards held by the signifi- 
cant adult in the subject’s life? 

The frequency with which the subject 
engaged in behavior which he consid- 
ered wrong, perceived a significant 
adult in his life as considering wrong, 
or is considered wrong by a significant 


adult in his life. 


(b) 


(c) 


(d) 


(e) 


METHODS AND PROCEDURES 
Single, white, Protestant university females wee 
selected as the population to be used in this neee s 
Mothers were selected as the significant adult in the 
subject’s life. f h 
e gia Multiphasic Personality Tnyemary 
was chosen as the instrument to measure sanan 
adjustment. Since the experimental questions re ge 
to neurotic symptoms, these three scales eE Ss 
neurotic triad, hypochondriasis (Hs), depron * 
and hysteria (Hy), were selected as meas! 
neuroti tomatology- F 
In nets obtain items for a scale par ae 
Standards, a questionnaire was wes fo A ole 
females. They were asked to list the ear qe ae 
or ethical decisions that females of their = Tae 
called upon to make. list of or tea 
Piled from the returns to this questionne, "a 
These items were used to form oe a ET 
item was contained o all scales, TA BEA 
vere as respond in ee 
the ps oe indicated whether they vest 
the behavior described would be moral ee ike 
right or wrong for all females of their . 
Second scale, they were asked to Piger t 
they believed their mother would cons 
for females © 


Whether or not they h n 
Scribed. There ee aed the Daughter es respec- 
ceived Mother Wrong, and Not Dot í 
ively, ; 
These three scales were a le 

100 college females. The so 


by 
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counting the number of items checked wrong on 
the Daughter Wrong and Perceived Mother Wrong 
scales, and by counting the items marked as not done 
on the Not Done scales. The results were used in an 
item analysis. Responses of subjects falling in the 
top and bottom quarters were compared for each 
of the three scales. Phi coefficients were calculated 
graphically for each item. Those items which dis- 
criminated between the top and bottom quarters on 
one or more scales at the 5% level of confidence 
were retained. A total of 62 items were included on 
the basis of this criterion. 

The items included on the final scale measured be- 
havior or attitudes involving hostility, heterosexuality, 
religion, college regulations, and some items less 
easily described which involve some of the social be- 
havior expected of university females. The following 
are examples of each of the above types of items: 


(a) being critical of your parents 

(b) having sexual relations prior to marriage if en- 
gaged 

(c) changing religious beliefs because of what is 
taught in a course 

(d) breaking living unit 
hours 

(e) social drinking 


regulations regarding 


Split half reliabilities were calculated for each 
scale. These corrected reliabilities were as follows: 
Daughter Wrong .87; Perceived Mother Wrong .84; 
Not Done, .88. 

The MMPI and the Moral Ethical Values Scales 
were administered to a sample of 136 college females. 
The sequence in which the Moral Ethical Value 
Scales were administered was alternated in order to 
reduce the influence of any particular sequence. All 
of the subjects included in the population were sin- 
gle, white, Protestant, of average college age who, 
except for the time spent at college, were living with 
their mothers at the time of testing. All were volun- 
teers from second or third year courses in psychol- 
ogy or education. They were not asked to put their 
names on the test. Instead they signed their names 
to a sign-up sheet and code numbers were used to 
match tests and subjects. The volunteers did not 
know the matchings could be made. 

The items on the Moral Ethical Values scales were 
sent to the mothers of these 136 subjects. They were 
asked to indicate whether they felt that each item 
described behavior which would be morally or ethi- 
cally right or wrong for a female of college age. 102 
mothers responded and 95 of the responses were 
usable. The others were discarded for failure to fol- 
low directions accurately. The mothers were not 
asked to sign their names to these scales and the 
mothers and daughters scales were matched by using 
code numbers. r 

The daughters of the 95 mothers whose responses 
the study were compared with the 
the mothers who were not included. 
ceived on the neurotic triads of the 
the three Moral Ethical Value scales 
these comparisons. Means and critical 


were used in 
daughters of 
The scores re 
MMPI and on 
were used for 
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ratios were calculated to determine if there were sig- 
nificant differences between the means of these two 
groups. No significant differences were found be- 
tween these two groups on the scores they received 
on either the neurotic triad of the MMPI or on the 
Three Moral Ethical Value scales. 

The subjects received the following scores on the 
Moral Ethical Values scales: 


(a) Daughter Wrong (DW). The number of items 
on this scale which the daughter considered 
morally or ethically wrong for a female of 
her age. 

(b) Perceived Mother Wrong (PMW). The num- 
ber of items the daughter marked indicating 
that she believed her mother would consider 
the behavior being described as morally or 
ethically wrong for a college female. 

(c) Not Done (ND). The number of items on the 
scale which described behavior in which the 
subject had not engaged. 

(d) Mother Wrong (MW). The number of items 
on the scale which the mother indicated she 
would consider morally or ethically wrong for 
a female of college age. 

(e) Daughter Behavior Conflict (DBC). The fre- 
quency with which the daughter had engaged 
in behavior which she indicated was morally 
or ethically wrong. 

(f) Perceived Mother Behavior (PMBC). The 
frequency with which the daughter had en- 
gaged in behavior which she perceived her 
mother as considering morally or ethically 
wrong. 

(g) Mother Behavior Conflict (MBC). The fre- 
quency with which the daughter had engaged 
in behavior which the mother considered mor- 
ally or ethically wrong. 


The subjects were divided into groups on the basis 
of the Scores they received on the MMPI. All of the 
Subjects rated in the top third of the sample on the 
Hs scale were placed in one group called the Hs 
group. Subjects who fell into the top third on the 
D scale were placed in a D group, and subjects who 
rated in the top third on the Hy scale were placed in 
a Hy group. It was possible for subjects to be in- 
cluded in more than one neurotic group if they had 
eye high ratings on more than one scale. The 
remaining subjects, none of whom had been in- 
cided in a neurotic group, were formed into a non- 
neurotic group, 

Table shows the mean T scores and T score 
ranges received by the neurotic and nonneurotic 
groups on the MMPI, 

Bog nec into top 3rd and bottom 2/3rds could 
act because subjects received scores which 


were at the cut-off i ; 
point b third 
and bottom two Midst etween the upper thir 


ing all subjects who ti 


Harry A. Grater 


TABLE 1 


MEAN T Scores (ROUNDED) AND T Score RANGES RE- 
CEIVED BY THE NONNEUROTIC AND NEUROTIC 
GROUPS on THE NEUROTIC Trian 
or THE MMPI 


Mean Hs Hs T Score 


Group T Score Range 
Nonneurotic 46 
Hypochondriacal 59 
Nonneurotic 43 32-51 
Depressive 60 52-76 
Nonneurotic 50 40-57 
Hysterical 65 58-80 


of each of the above groups was reduced when the 
MW scores were used. The number in cach group 
was reduced to: Hs, 24; D, 24; Hy, 31; nonneu- 
rotic, 48. 

Means for these four groups were calculated for 
the scores received on the Moral Ethical Values 
scales. Each mean score received by the neurotic 
groups was compared with the means received On 
the same scale by the nonneurotic group. Critical 
ratios were calculated to determine whether the dif- 
ferences were significant. 


RESULTS 


Table 2 shows the mean scores received by 
the neurotic and nonneurotic on the Moral 
Ethical Values scales. Critical ratios have 
been calculated for the difference between 


TABLE 2 


MEANS AND THE CRITICAL RATIOS OF THE DIFFERENCES 
BETWEEN THESE MEANS OF THE SCORES RECEIVED 
BY THE NoNNEuROTIC AND NEUROTIC GROUPS 
ON THE MORAL-ETHICAL VALUES SCALES 


ND 


Group DW PMW MW 
Nonneurotic 37.04 28.60 47.81 51.48 
Hypochondriacal 36.01 28.58 44.72 47.81 

Critical ratio 53 00 1.82 1.94 
Nonneurotic 37.04 28.60 47.81 51.48 
Depressive 35.31 28.15 45.43 47.87 

Critical ratio 97 21 1.23 1.82 
Nonneurotic 37.04 28.60 47.81 51.48 
Hysterical 34.75 28.66 44.16 47.00 

Critical ratio 1.29 00 2.50" 273” 


* Significant at 5% level of confidence. 
** Significant at 1% level of confidence. 
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these means in order to evaluate the signifi- 
cance of the differences. 

„Table 2 shows that there are no significant 
differences between the DW scores of neu- 
rotic and nonneurotic groups. A critical ratio 
of only 1.29, which is not large enough to be 
significant, was calculated for the difference 
between these two means. Behavior standards 
then did not distinguish the neurotic from 
the nonneurotic group. The slight differences 
which were found were in the direction of the 
neurotic groups having lower scores on the 
DW scale. 

There were no significant differences be- 
tween the ND scores of the neurotic and non- 
neurotic groups. Thus, there were no differ- 
ences in the reported behavior of these two 
populations in situations involving a moral or 
ethical judgment. 

The subjects in the Hy group received 
lower PMW scores than did the subjects 
in the nonneurotic groups. This difference 
reached the 5% level of significance. The Hs 
and D groups also perceived their mothers as 
considering fewer items wrong, but these dif- 
ferences were not large enough to be signifi- 
cant; although in the case of the Hs group, 
the difference did reach the 10% level of 
confidence. 

In all three comparisons, the mothers of 
subjects in the neurotic groups considered 
fewer items wrong than did mothers of the 
nonneurotics. All of these differences reached 
the 10% level of significance, and when the 
mothers of the subjects in the Hy group WEL 
compared with mothers of the subjects 1n 
nonneurotic groups, a difference reache 
the 1% level of significance. , 

Thee, both the mothers and the pane 
mothers of the hysterical group Sean ped 
significantly fewer number of the ite 5 


: nneurotics. 
than did the mothers of pak sane PMW 


The same tendency 4 
and MW scores ane by the "a se 
rotic groups were compared with 
heurotic group, but the differe: 
large enough to be significant a 
level of confidence- 
The data in Table 3 are prese 
of proportions instead of in arene : 
This was done because all o E 
Scores are in part dependent upon 


nted in terms 
] frequencies. 


he conflict 
DW, 


TABLE 3 


PROPORTION OF THE ITEMS ON THE DW, PMW, AnD 
MW SCALES MARKED AS MORALLY OR Erurcanty ý 
WRroxG Bur WHICH DESCRIBED BEHAVIOR IN 
WHICH THE SUBJECT Hap ENGAGED 


Group DBC PMBC MBC 
Nonneurotic 35 46 49 
Hypochondriacal 36 46 47 

Critical ratio -10 28 45 
Nonneurotic 35 46 49 
Depressive 35 48 51 

Critical ratio 00 20 16 
Nonneurotic 35 46 49 
Hysterical 35 43 45 

Critical ratio 00 30 35 


Note.—Critical ratios have been calculated to de- 
termine if these conflict scores are significantly differ- 
ent for the neurotic and nonneurotic groups. 


PMW, and MW scores. For example, a sub- 
ject having a high MW score would have more 
opportunity to come in conflict with the ma- 
ternal standards than a subject with a lower 
score. Since there were differences in the DW, 
PMW, and MW scores of the neurotic and 
nonneurotic groups, differences between the 
proportions of conflict rather than an actual 
count of the number of conflicts give more 
meaningful results. 

Table 3 shows no significant differences be- 
tween the conflict scores received by the neu- 
rotic and nonneurotic groups. This indicates 
that the neurotic subjects do not tend to 
adhere more closely to their own standards, 
their perceived maternal standards, or the 
maternal standards than do the nonneurotic 
subjects. 

DISCUSSION 


These results either do not uphold, or are 
directly antithical to the impulse repression 
theory of anxiety which was previously pre- 
sented. In no instance did a neurotic group 
receive a higher PW, PMW, ND, or MW 
score indicating greater inhibition of behavior 
than did the nonneurotic group. When dif- 
ferences that approached or reached a signifi- 
cant level were found, the neurotic groups re- 
ceived lower scores indicating less behavior 
restriction. Nor did the conflict scores indi- 
cate that the neurotics were more controlled 
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by their own standards, the perceived moth- 
er’s standards, or the mother’s standards than 
were the nonneurotics. z 

The basic question raised by this research 
is “Why were low PMW and DW scores 
associated with Neurosis?” Perhaps an ex- 
planation can be found in Fenichel’s (1945) 
discussion of what he refers to as the heter- 
omerous super ego or in Mowrer’s (1950) 
discussion of super ego repression. Both 
Fenichel and Mowrer have emphasized the 
effects of the individual’s confusion about 
standards of behavior. It is possible that the 
liberal PMW and MW scores received by the 
mothers and the perceived mothers were actu- 
ally indications of confusion rather than of 
liberality, and further that confusion is re- 
lated to maladjustment. 

In order to evaluate this confusion hypothe- 
sis a confusion score was calculated and cor- 
related with the Hy score of the MMPI. The 
Hy scale was selected because it was most 
closely associated with low PMW and MW 
scores. The confusion score was calculated by 
counting the number of times the daughter 
had misperceived the mother’s standards. 
The Pearson Product-Moment Correlation be- 
tween the frequency with which the daughter 
had misperceived her mother’s standards and 
hysteria was .30. This correlation is signifi- 
cant at the 1% level of confidence. Thus the 
daughter’s confusion about maternal stand- 
ards was correlated with a measure of neu- 
roticism. 

This research introduces questions which 
need further exploration. The first step might 
be a replication of the experiment reported 
here. If at all possible, neurotic groups should 
be selected which are more deviant than the 
subjects included in this study. Also, sub- 
jects should be allowed to respond to the 
items with a scaled response rather than with 
a dichotomous response. Such scaling would 
allow an evaluation of the confusion hypothe- 
sis which was already presented. 


SUMMARY 


The objective of this research was to test 
the impulse repression theory of anxiety. A 
scale of behavior standards called the Moral 
Ethical Value scale and the MMPI were ad- 
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ministered to a population of 136 single, 
white, Protestant college females. On the first 
scale of the Moral Ethical Values scale, the 
subjects were asked to indicate whether they 
felt the behavior described by each item 
would be morally or ethically right or wrong 
for a college female. On the second scale they 
were asked to indicate whether they felt their 
mothers would consider the behavior de- 
scribed as morally or ethically right or wrong 
for a college female, and on the third scale 
they were asked to indicate whether they had 
engaged in the behavior being described. 
Scales were also sent to the mothers who were 
asked to indicate whether they felt the be- 
havior described would be morally or ethi- 
cally right or wrong for a college female. 

Subjects were divided into depressive, hys- 
terical, hypochrondriacal and nonneurotic 
groups on the basis of scores received on the 
neurotic triad of the MMPI. 

The neurotic and nonneurotic groups were 
compared on the scores they received on the 
Moral Ethical Values scales. These scales 
gave a measure of the mothers’ and daugh- 
ters’ evaluation of behavior in situations in- 
volving moral or ethical judgment. The results 
showed no significant differences between the 
neurotic and nonneurotic group in the num- 
ber of items on the scale that they indicated 
were morally or ethically wrong, nor was the 
behavior of the neurotic group different from 
the behavior of the nonneurotic group in 
situations involving a moral or ethical judg- 
ment. The neurotic daughters tended to per- 
ceive their mothers as considering fewer items 
wrong than did the nonneurotics. Also the 
mothers of the neurotics tended to consider 
fewer items wrong than did the mothers of 
the nonneurotics. Both the perceived mothers 
and the mothers of the Hs subjects consid- 
ered significantly fewer items wrong than did 
the mothers of the nonneurotic subjects. 
There was no difference in the proportion of 
time the neurotics and nonneurotics engaged 
in behavior which conflicted with their own, 
the perceived mothers, or the mother’s þe- 
havior standards. 

The results of this research are antithical 
to the impulse repression theory of anxiety. 
From this theory one would predict greater 
restriction of behavior to characterize the 
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neurotic groups. In some comparison the di- 
rect opposite of these predicted results was 
found. 

The hypothesis was presented that liberality 
was actually a reflection of confusion about 
moral and ethical standards. In order to 
evaluate this hypothesis, the frequency with 
which the daughter misperceived her mother’s 
standards was correlated with the hysteria 
scale of the MMPI. A significant correlation 
of .30 was calculated. This tended to support 


a confusion hypothesis. 
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CRITICAL FLICKER FREQUENCY AND THE SPIRAL 
AFTEREFFECT WITH PROCESS AND REACTIVE 
SCHIZOPHRENICS * 


JOSEPH M. 
Michigan S 


Considerable theory and some research sug- 
gest that at least certain schizophrenics suffer 
from a disease that is primarily organic in na- 
ture, perhaps of a central nervous system 
locus. Traditionally it has been the tacit as- 
sumption of many workers in the area that 
schizophrenia has an organic etiology, most 
likely involving a disturbance of the brain it- 
self (Bleuler, 1950; Noyes, 1953). The num- 
ber of published findings to support such an 
hypothesis are far outnumbered by articles 
which merely speculate on such a possibility 
without any attempt at experimentation or 
controlled observation. A few studies never- 
theless do offer some support for the assump- 
tion that brain damage is the important fac- 
tor in schizophrenia. Evidence from biopsies 
obtained from 70 cases of dementia praecox 
led Bateman and Papez (1951) to suggest 
that changes in nerve cells of the prefrontal 
cortex are due to pathogenic organisms. From 
an examination of 1087 records Kallmann 
(1938) concluded that “the organic rocess 
of schizophrenia in the brain... igs 
the cortex almost exclusively and usuall 
leaves the internal parts of the brain i oa > 
Kenara snd ee -a e brain intact. 
EEGs, often eas a i on oe 
ey cue ot ot brain lesions, sig- 

e often in the more severe 
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cases of schizophrenia, especially those resem- 
bling dementia praecox. Winkelman and Book 
(1948) concluded on the basis of 10 autop- 
sies of schizophrenics that gross brain ap- 
pearance did not differ from normals but that 
microscopic evidence did suggest the diagnosis. 

Widespread dissatisfaction with the tradi- 
tional Kraepelinian categorization of schizo- 
phrenia or dementia praecox has led to in- 
creased effort in recent years to refine the 
concept. One suggestion has been to view the 
disorder as including two polar types. At one 
extreme is the patient who has an early and 
insidious onset of psychosis with absence of 
precipitating stress, an inadequate prepsy- 
chotic personality with a tendency to with- 
draw from all human contacts, clinical symp- 
toms showing a flat or dull affect, and rela- 
tive absence of confusion. This syndrome has 
been called process schizophrenia. Reactive 
schizophrenia on the other hand is the term 
given to describe the case where there is typi- 
cally an abrupt and stormy onset of the psy- 
chosis usually attributable to a logical and 
significant stress situation. Here the premor- 
bid personality may be described as normal or 
neurotic, rather than schizoid, with perhaps 
some degree of outgoingness present and a 
clinical picture likely to include severe con- 
fusion and many affective components. At- 
tempts to classify schizophrenic samples into 
the two categories or to view the individual 
patient as occupying a position on a process- 
reactive continuum have increased in number 
and a certain degree of success has been 
achieved in differentiating the groups experi- 
mentally (Becker, 1956; Kantor, Wallner, & 
Winder, 1953; King, 1958). One investiga- 
tion by Brackbill and Fine (1956) sought to 
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Process and Reactive Schizophrenics 


relate the concepts specifically to central nerv- 
ous system pathology. Although it was impos- 
sible to differentiate the process from the or- 
ganic group on the basis of Rorschach indices 
of brain damage, it was observed that both of 
these groups had significantly more individu- 
als with five or more of the “organic signs” 
than did the reactive group. The authors 
suggested, therefore, that “the difficulty in 
differential diagnosis of some kinds of schizo- 
phrenics and organics results from the in- 
volvement of central nervous system pathol- 
ogy in process schizophrenia.” 

Two perceptual techniques of “brass instru- 
ment” psychology have been found fairly ef- 
fective in differentiating brain damaged from 
nonorganic groups: critical flicker frequency 
(CFF) and the Archimedes Spiral. Numerous 
studies have substantiated the fact that brain 
damaged patients have significantly lower 
CFF thresholds than do nonorganic patients 
(Ross & Reitan, 1955; Teuber & Bender, 


1948; Werner & Thuma, 1942). In psychi- 
atric patients with functional disorders CFF 
as thoroughly, and 


has not been investigated ne 
most studies are open to methodological criti- 
cisms including the small number of patients 
involved. In two of the better controlled 
studies, however (Irvine, 1954; Ricciuti, 
1948), the preponderance of evidence indi- 
cated that schizophrenic patients have signifi- 
cantly higher CFF thresholds than do brain 
damaged patients and that they do not differ 
significantly from normal individuals. 


Visual fixation by S upon a slowly rotating 
Archimedes Spiral produces the illusion of ex- 
Pansion or contraction of the disc depending 
on the direction of rotation. When the spiral 
is brought to a sudden stop the perceiver = 
Periences the illusion just the reverse of that 


during rotation, ie, a negative afterefiect. 
Although the extremely high en 
achieved by Price and Deabler ( can he 
subsequent 
t significant 
groups 
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tem pathology and that this is specifically 
cortical in nature, it was hypothesized that 
process schizophrenics and brain damaged pa- 
tients would have significantly lower CFF 
thresholds and would fail to perceive the 
spiral aftereffect significantly more often than 
would reactive schizophrenics and normals, 


Meruop 
Apparatus 


The CFF was tested by means of an episcotister 
viewed monocularly by S. Visual angle of the target 
was 2 degrees 28 minutes with an intensity of .085 
candles per square foot. A shutter which could be 
lowered over the target aperture allowed E to termi- 
nate the stimulus immediately at the end of each 
flicker trial. Power for the episcotister was supplied 
by a Cenco 115-volt, 60-cycle motor. Rate of rota- 
tion for the disc was controlled manually by E’s 
turning a small knob on the motor. Four equal sized 
sectors were cut in the disc in a manner to produce 
a light-dark ratio of 1.0. An automobile speedometer 
was used to measure the speed of the spinning disc 
and calibration by a Strobotac proved linear. 

The Archimedes Spiral was made from a card- 
board disc, off-white in color and 8 inches in di- 
ameter, A black stripe, one-half inch wide, was 
painted on the cardboard so that it made one and 
three quarter turns from the center until it tapered 
completely off the outside edge of the disc. Speed 
of rotation was kept constant at 100 rpm. When 
rotated, an illusion of the disc contracting or draw- 
ing into the center was created in the typical viewer. 
On stopping the disc suddenly the “normal” experi- 
ence was to perceive the spiral as expanding. 


Subjects 


Eighty hospitalized male veterans between 19 and 
45 years of age with good visual acuity were placed 
in four groups of 20 Ss each. The organic group 
consisted of individuals with known brain damage 
chosen on the basis of their diagnosis as determined 
by neurological findings of the medical staff, case 
history material, and presenting symptomatology 
Diagnoses included multiple sclerosis, brain trauma, 
brain tumor, Huntington’s Chorea, Parkinson’s Dis- 
ease, CVA, and intercranial infection. Mean age was 
35.2 years with patients averaging 9.3 years of school- 
ing. The normal group was made up of 20 veterans 
from a general medical and surgical hospital and 
included every man on a surgical ward under the 
age of 45 with no history of emotional difficulties 
necessitating hospitalization or outpatient treatment 
Diagnoses included postoperative fractures appen- 
dectomies, torn cartilage, hemorrhoids, and Sacral 


3Some unpublished evidence appears to indicate 
that size of disc and speed of rotation affect S’s per- 
ception of the afterimage. 
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strain. This group averaged 31.1 years of age with 
a mean educational level of 10.7 years. 

The third and fourth groups were made up of diag- 
nosed schizophrenics from an NP hospital. One hun- 
dred and sixty-one schizophrenic case histories were 
examined. From these a total of 76 were selected on 
the basis of their having: a case history with suffi- 
cient material to allow rating; no evidence either be- 
haviorally or from the record of any head injury or 
Jong period of unconsciousness; a background free 
from repeated NP hospitalization during the previous 
10 years; and definite evidence from the history of 
gross schizophrenic symptomatology, e.g., hallucina- 
tions, delusions, ideas of reference, and bizarre be- 
havior. These 76 patients were then rated on Becker’s 
(1955) revision of the Elgin Prognostic Scale by two 
clinical psychologists. Interrater reliability was indi- 
cated by a Pearson r of .89. The two numerical rat- 
ings on each patient were then averaged and the re- 
sulting 76 scores ranked on a continuum. The 20 pa- 
tients receiving the lowest point totals were selected 
as being most “reactive,” and those 20 with the high- 
est scores most “process.” The decision to use Becker's 
rating scale was based on the belief that it offered a 
greater degree of exactness in the ratings, permitted 
a grading along the entire length of the continuum 
rather than a forced choice dichotomy or the crea- 
tion of a middle or mixed group, and appeared to 
fit more easily into a theoretical framework such as 
Bellak’s (1949) where schizophrenics are seen as 
ranging from almost completely organic to almost 
completely psychogenic. 

The process group had a mean age of 26.6 and an 
educational level of 10.7 years, while the reactive 
group averaged 33.6 years of age and had 10.8 years 
of schooling. Mann-Whitney U tests revealed no sig- 
nificant differences among the four groups in terms 
of educational level, but significant differences in age 
were obtained. Process schizophrenics were signifi- 
cantly younger than the other three groups (p< 
001), a characteristic consistent with the early in- 
sidious onset associated with ‘this group. Organics 
Were significantly older than normals (p< .05), 
which may be partly attributable to the progressive 
nature of some of the organic disorders, allowing the 


patients to remain unhospitalized until somewhat 
later in life. 


Procedure 


All Ss Were informed that they were to undergo 
two “special eye tests.” With the Archimedes Spiral, 
S were seated 8 ft, from the face of the disc with 
the room under ordinary illumination. Instructions 
used were those of Gallese (1956) in which the pa- 
tient was questioned as to his perception of the ro- 
tating disc. Four 30-sec. trials were given at the 
end of which the disc was stopped abruptly. S was 
then asked to describe what he “saw.” A scoring sys- 
tem devised by Price and Deabler (1955) was used 
to record S’s response. If his answers suggested that 
he perceived the aftereffect, he was given a score of 


1 for that trial. If he did not report the illusion of 
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expansion, a score of O was recorded. S’s final score 
was based on all four trials. A point total of 4, there- 
fore, indicated that S perceived the aftereffect on all 
four trials, whereas a score of O meant that he failed 
to perceive the effect on any of his trials. 

In the case of CFF, S was carefully instructed on 
how to view the patch and given the following di- 
rections: 


The spot you see there [i.c the test patch] is 
going to flicker off and on. Stare at it steadily and 
don’t blink. After a while the light will no longer 
appear to be flickering. Instead, as you stare at it, 
it will gradually appear to become a steady light. 
As soon as it looks as though it is no longer flicker- 
ing, you say, “Now,” .. . and I will turn it off. 


The overhead light was then extinguished and the 
only illumination came from the well-shielded 74- 
watt bulb in the rear of the apparatus which per- 
mitted dial readings. After one minute of dark adap- 
tation, the shutter was raised allowing S to view the 
test patch. Æ manually increased the disc speed at a 
constant rate (approx. 1 mph per second) until S re- 
ported the appearance of a steady light. A total of 
10 successive ascending trials were taken with about 
30 seconds between trials. Possible auditory cues from 
the episcotister were masked by an electric fan whose 
blades struck a small piece of cardboard attached to 
the fan guard. After considering the first. three trials 
as practice, mean flicker threshold was determined 
for each of the 80 Ss. Patients were found to be 
quite reliable in their reporting of fusion with only 
12 Ss obtaining an intertrial SD exceeding three 
quarters of a cps and only one S in the entire sam- 
ple with an SD more than 1 cps. 


RESULTS 
Critical Flicker Frequency 


Mean CFF scores for each of the four 
groups are summarized in Table 1. Since vari- 
ances were not homogeneous, the Kruskal- 
Wallis H Test was used. An H value sig- 
nificant at the .001 level of confidence was 
obtained indicating that the four groups dif- 
fered in mean flicker threshold beyond chance 
expectation. Mann-Whitney U Tests run be- 


TABLE 1 


CRITICAL FLICKER FREQUENCIES 
IN CYCLES PER SECOND 


Mean Variance 
Organic 13.20 2.30 
Normal 16.38 -26 
Process 16.43 13 
Reactive 16.67 94 


Process and Reactive Schizophrenics 


tween all combinations of the groups in pairs 
showed that the organic group differed very 
significantly (p <.0001) from each of the 
other three groups. A significant difference in 
CFF at the .05 level was found to exist 
also between normals and reactives. Although 
mean CFF for the process group was slightly 
lower than that of the reactive group, the 
difference was not significant. In order to de- 
termine whether a more rigorous dichotomiza- 
tion would yield differences between the two 
schizophrenic groups, the 10 Ss rated most 
reactive (i.e., lowest Elgin ratings) were com- 
pared with the 10 rated most process (i.e., 
highest ratings). Again, no significant differ- 
ences were found between process and reac- 
tive patients in terms of CFF. On the possi- 
bility that a relationship existed between a 
schizophrenic S’s flicker rank and his rank on 
the Elgin Scale, Kendall’s tau was computed. 
The resulting correlation of .048 indicated 
essentially no relationship between the magni- 
tude of the schizophrenic’s Elgin rank (or po- 
sition on the process-reactive continuum) and 
his ranked CFF score. The ability of CFF to 
discriminate brain damaged from all other 
patients tested was exceptionally good, how- 
ever, When compared with normals, organic 
Ss were successfully discriminated by their 
flicker threshold 95% of the time since only 
one organic S obtained a higher CFF score 
than that of the lowest normal.* Somewhat 
less discrimination was present in compa 
son of process and reactive groups with the 


Organics (90% and 75% respectively). 


Archimedes Spiral 
20 brain dam- 


With the exception of the 
aged patients, all of the Ss (Ges 20 normals, 
20 process, and 20 reactives) obtained perfect 
scores on the task. Although x° revealed that 
Organics were significantly different from ari 
of the other groups (Ż < .01) the ability 0 
the spiral technique to discriminate are 
injured subjects from nonorganic ine 
Was relatively poor: Of the 20 organics only 
9 failed to obtain perfect scores on the 
task, This resulted in a test discrimination 
index of 45%. Only 5 of the 9 organics me 
totally unable to perceive the effect on any © 
the four trials. No relationship was observed 


*This S was a frontal lobe injury. 
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between organics’ scores and their neurologi- 
cal diagnoses. 


Discussion 


There were no differences in CFF between 
schizophrenics classified process and reactive. 
Although the method of classification may be 
speculated on as one reason for this failure 
other studies utilizing the method have dif- 
ferentiated the two groups (Becker, 1955; 
King, 1958). Present results do not suggest 
demonstrable cortical defect as measured by 
CFF in either process or reactive schizo- 
phrenia. The fact that the reactive group had 
a higher CFF than did the normals warrants 
speculation. Heightened physiological respon- 
siveness of reactive schizophrenics noted in 
previous studies was attributed by King 
(1958) to the heightened homeostatic re- 
sponse which reflects the precipitous nature 
of their adjustment. Other investigations (Ric- 
ciuti, 1948; Wiersma, 1906) have indicated 
that manic patients have higher CFFs than 
do other groups. Similarity between manic 
or affective symptoms and characteristics of 
schizophrenics who might be classified reac- 
tive has also been noted in previous studies 
(Cameron, 1938; Funkenstein, Greenblatt, & 
Solomon, 1948). Affective components in the 
clinical picture are among the bases for clas- 
sification as reactive in the rating scale em- 
ployed in the present investigation. There- 
fore, it seems possible that some patients 
who formerly might have been diagnosed 
manic-depressive are now being diagnosed 
schizo-affective or reactive schizophrenic, and 
these manic features may account for the re- 
active group’s higher than normal CFFs. On 
the other hand, the significant difference found 
between normals and reactives may merely 
indicate a comparison of the extremes in a 
homogeneous distribution. 

A possible alternative explanation of the re- 
sults might contend that a defect in the ex- 
perimental design was responsible for lower 
flicker thresholds in the organics. This posi- 
tion would maintain that since only ascending 
series were used in the present study it is pos- 
sible that the organics were responding prior 
to their subjective impressions of complete 
fusion and that this premature reaction 
would also have been obtained in descending 
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series, thus changing the results. However, 
since previous investigations (Irvine, 1954; 
Teuber & Bender, 1949; Werner & Thuma, 
1942) utilizing both ascending and descend- 
ing series have yielded consistently lower 
thresholds for organics, it would seem more 
parsimonious to accept a lower CFF threshold 
on the part of organics, rather than prema- 
ture reaction, as the explanation for these 
results. , 

The present experiment appears to provide 
additional support for CFF as a good tech- 
nique in discriminating brain damaged per- 
sons with gross destruction from normals and 
schizophrenics. Increased utilization of the 
technique in the ordinary hospital setting 
seems to be well warranted, a possibility rec- 
ognized by other investigators. Its potential 
seems especially good when a comparison is 
made with the relatively poor discriminating 
ability shown by the usual paper and pencil 
tests of organicity. 

Although the spiral aftereffect differenti- 
ated organic Ss from all other groups, only 
45% of the brain damaged Ss were identifi- 
able through failure to perceive the effect. 
Since only known organic cases were included 
in the present study it seems unlikely that 
this technique will be of extensive diagnostic 
value in a clinical setting where less severe 
organic cases are to be classified. The fact 
that none of the schizophrenics of either type 
failed to perceive the aftereffect does not, at 
least, support the position that cortical de- 
fect is involved in process schizophrenia. 


SUMMARY 


The present experiment was designed to in- 
vestigate the possibility of organic involve- 
ment, particularly of a cortical nature, in 
process schizophrenia. Recent studies dichoto- 
mizing schizophrenics into process and reac- 
tive types have suggested cortical damage in 
the former group. It was hypothesized, there- 
fore, that patients classified as process schizo- 
phrenics would perform similarly to brain 
damaged individuals on two perceptual tasks, 
while reactive schizophrenics would more 
closely resemble normal controls. The two 
perceptual tasks used were critical flicker fre- 
quency and the Archimedes Spiral aftereffect, 
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both of which had proven effective previously 
in differentiating brain damaged from non- 
organic Ss. Four groups of 20 Ss were given 
both tasks: reactive and process schizophren- 
ics, organics with known brain damage, and 
normals. Results of the experiment showed 
that organic patients were significantly dif- 
ferent from all other groups in terms of CFF 
threshold and ability to perceive the spiral 
aftereffect. Process and reactive schizophren- 
ics did not differ from each other on either 
task. Reactive schizophrenics differed from 
normals on CFF in having higher thresholds. 
Attempts to account for the findings included 
the possibility that affective components of 
reactive schizophrenia contribute to the dif- 
ference between them and other schizophren- 
ics and normals, and the belief that process 
schizophrenia, while showing some physio- 
logical differences from reactive schizophrenia 
in other investigations, probably is not re- 
lated to defects of a cortical nature. 
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AN ATTEMPT TO EMPLOY A SENTENCE COMPLETION TEST 
FOR THE DETECTION OF PSYCHIATRIC PATIENTS 
WITH SUICIDAL IDEAS? 


HERMAN Y. EFRON 
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Construction of a test suitable for group 
administration which could detect those who 
are morbidly preoccupied with depressive 
thoughts would be extremely useful to ward 
administrators in hospitals where intensive 
psychotherapeutic contacts are the exception 
rather than the rule. This study is an attempt 
to assess a group screening device (sentence 
completion test) . to identify those patients 
who may constitute suicidal risks. 

The D scale of the MMPI (Broida, 1954) 
and some aspects of responses to the Make-a- 
Picture-Story (MAPS) (Farberow, 1950) dif- 
ferentiated between suicidal and nonsuicidal 
patients. However, the few experimental stud- 
ies with the TAT yielded negative results, 
Neither responses to Card 3BM (Broida, 
1954) nor characteristics of the TAT heroes 
(Shneidman & Farberow, 1958) distinguished 
suicidal from control groups. There have been 
Proposed in the literature various Rorschach 
signs and patterns that can be used to de- 
tect potential suicidal behavior (Hertz, 1948, 
1949; Piotrowski, 1950; White & Schreiber, 
1952); however, other studies generally fail 
to confirm their validity (Fisher, 1951; Sak- 
heim, 1955). Results indicate that Hertz’s 
(1948) Configuration Technique is probably 


1 This study was an individual hospital project of 
the Veterans Administration Psychiatric Evaluation 
Project, Richard L. Jenkins, Director. This paper 
was reviewed in the Veterans Administration and is 
published with the approval of the Chief Medical 
Director, The statements and conclusions of the au- 
thor are the result of his own study and do not 
necessarily reflect the opinion or policy of the Veter- 
ans Administration. The author wishes to express his 
gratitude to Marcus P. Rosenblum for his construc- 
tive suggestions, and to Leon Cohen, Eric Layne, 
David Levine, and John Seymour for their coopera- 
tion in rating the sentence completions. 


the most useful of the prognostic scales (Sak- 
heim, 1955), but even her technique failed 
when used with schizophrenic patients (Fisher, 
1951). Even if such a test as the Rorschach 
were capable of differentiating between po- 
tential suicidal patients and nonsuicidal pa- 
tients, the time involved in arriving at these 
decisions would preclude their general use. 
The number of psychologists needed to serv- 
ice a psychiatric hospital would be far in ex- 
cess of current staffing patterns even though 
periodic retesting was restricted to those pa- 
tients whose case histories warrant some sus- 
picion of self-destructive behavior. A self-ad- 
ministering test which could be repeated at 
specified intervals, tapping both attitudes 
within and outside the patient’s awareness, 
seemed to be the most desirable instrument. 
It was felt that the sentence completion test 
might be such an instrument. 

The most difficult problem in designing an 
experiment to test the efficiency of an instru- 
ment to predict suicide is that of an appro- 
priate criterion. Because of the low incidence 
of suicide it is not practicable to rely upon 
this as a criterion measure. Rosen (1954) 
has clearly analyzed the inherent problems in 
predicting an event whose occurrence among 
psychiatric patients (as judged from statistics 
covering 1936 to 1949 at a large state hos- 
pital) is roughly 3 per 1,000 admissions 2 
(Levy & Southcombe, 1953). In his article, 
Rosen (1954) demonstrates that if a test were 


? Using the same data, Rosen (1954) incorrectly 
arrived at .0032 as a rough index of suicide among 
the psychiatric Population. Actually, since there are 
about three to four times as many resident patients 
per year as admissions per year, a closer approxima- 


tion to the actual suicide rate would be one third to 
one fourth of 0032. 
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Detection of Suicidal Patients 


devised that could correctly identify whether 
a patient was suicidal or nonsuicidal 75% of 
the time, the number of false positives (i.e., 
nonsuicidal patients being identified as sui- 
cidal) in a patient population of 12,000 would 
be close to 3,000. Recognizing this severe 
limitation it was still felt that if a group 
screening device that would approach 75% 
accuracy could be developed it would serve 
to alert the hospital personnel to investigate 
more thoroughly all patients designated as 
“suicidal.” In a hospital of 2,000 this would 
mean more vigilant study of about 500 pa- 
tients, an aim that might be feasible. Rosen 
(1954) emphasizes the need for researchers 
to formulate more precise definitions of ‘‘sui- 
cidal” patients. Studies by Farberow (1950) 
and Rosen, Hales, and Simon (1954) found 
marked group differences between the MMPI 
scores of patients who attempted suicide and 
those who threatened suicide. The group that 
made threats gave significantly higher mean 
scores on most all scales. Both studies indi- 
cate that as a group those who made suicidal 
threats were more severely disturbed (at the 
time of testing, which was after those in the 
attempted suicide group had executed their 
attempts). The present study deals with pa- 
tients who were known to have expressed con- 
spicuous suicidal thoughts, marked nihilistic 
ideas, or severe guilt with concern over pun- 
ishment or torture. A basic assumption is that 
patients with such morbid ruminations are 
greater suicidal risks than those who do not 
have such preoccupations. The only evidence 
that can be cited to support the assumption 
bears indirectly upon it and is possibly irrele- 
i i Farberow’s (1956) 
vant, viz., Shneidman and 
report that almost all of the people they 
studied who did commit suicide had given 
some earlier warning of their intentions. The 
hypothesis of this study is that pee 
can, on the basis of clinical judgment, differ- 
entiate between the sentence completions of 
Patients with suicidal ruminations and those 


without such ruminations. 


METHOD 


Test Instrument 
ia wës Joined from 
i sentences were pur a 
Some incomplete e constructed until 


existing tests and still others wer 


157 


the finished test consisting of 54 items was evolved.’ 
Almost a third of the items appear to be weighted 
toward eliciting depressive thema. Included in this 
group were such items as “If there were no way out 
I would—,” “My greatest fault is—,” and “Bob's 
defeat made him—.” 


Procedure 


Subjects (Ss) were tested in small groups of 10 to 
18 patients were given one hour in which to com- 
plete the test. Throughout the testing session the ex- 
aminer encouraged those patients who seemed to be 
having difficulty in completing the sentences. When 
more than about half the items were omitted the pa- 
tient was asked to write the reason for his difficulty 
in completing the sentences. In a few instances the 
patients even refused to do this, and the examiner 
wrote the reason the patient stated verbally. 

Phase I. Of the 104 obtained completions 54 were 
given to four staff psychologists. They were told that 
18 were done by suicidal patients, 9 by assaultive pa- 
tients, 27 by patients who were neither suicidal nor 
assaultive, and they were then asked to distribute 
them into their proper categories, After every judg- 
ment they were requested to state the degree of con- 
fidence they had in their judgment. Coding for these 
statements was: 1. I'm in doubt whether this is cor- 
rect. 2. I feel fairly confident that this correct. 3, I 
feel confident that this is correct. The only identify- 
ing data were the patient's code number and age. 

Phase II, From the 54 completions used in Phase I, 
all those with less than 25 responses were eliminated. 
The same four psychologists were asked to redis- 
tribute the tests which now represented 10 suicidal 
patients, 3 assaultive patients, and 25 nonsuicidal 
and nonassaultive patients. They were again asked to 
rate their confidence in each decision. 

Phase III. In order to obtain an estimate of the 
frequency of false positives, the remaining 50 com- 
pletions (which were all done by patients who were 
nonsuicidal and nonassaultive) were given to three 
of the four psychologists with the following instruc- 


tions: 

The following sentence completion tests were ob- 
tained from patients who may have been on sui- 
cidal observation status, on assaultive status, on 
suicidal and assaultive status, or on none of the 
above. On the accompanying Summary Sheet, will 
you please distribute the code numbers of the pa- 
tients into their appropriate categories. Remember 
the frequency in any of the above categories may 


be zero. 


This time there was also a “cannot rate” category 
in the event they found it impossible to reach a de- 
cision. Again they were requested to state the degree 
of confidence they had in each judgment. 

Subjects. Patients were selected from the acute in- 
tensive treatment service of a large Veterans Ad- 


8A copy of the incomplete sentences can be ob- 
tained from the author. 


Herman Y. Efron 
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TABLE 1 


DESCRIPTION OF SAMPLE 


Phase I Phase IT 
Demographic Variable Suicidal Nonsuicidal Nonsuicidal 
, — 18 5 36 50 
Diagnosis 
Schizophrenic 8 a a 
Manic-depressiv 2 ; 
Chronic brain syndrome 0 1 A 
Other 0 0 2 
Current hospitalization 
Less than 6 months 7 5 16 
Six months to 2 years 2 16 
More than 2 years 9 8 18 
All NP hospitalizations 
Less than 6 months 4 1 2 
Six months to 2 years 2 12 18 
More than 2 years 12 23 30 
Age 
Mean 35 37 36 
SD 5 1 9 


ministration hospital. For a patient to have been con- 
sidered suicidal he must have expressed serious self- 
destructive thoughts, marked nihilistic ideas, or severe 
guilt with concern over punishment and torture to 
the ward physician, nurse, or aide within the week 
prior to testing. To have been considered assaultive + 
the patient, in the estimation of the ward physician, 
had to present a high risk of acting out aggressively. 
Those patients who, for at least two months prior to 
testing, had not expressed any self-destructive or de- 
pressive ideas and had no known history of suicidal 
attempt were considered nonsuicidal. Nonassaultive 
patients were those whom the ward physician had 
no reason to suspect would act out aggressively. Of 
the 104 Ss, 18 were suicidal, 9 were assaultive, and 
77 were both nonsuicidal and nonassaultive. Of the 
latter group, 33 were privileged (i.e., had freedom of 
the hospital grounds) and 44 were nonprivileged. 
Table 1 contains a further description of the groups. 


RESULTS AND DISCUSSION 


The number of correctly assigned suicidal 
patients was compared with the chance ex- 
pectancy of such an occurrence as computed 


* Assaultive patients were included in the study in 
order to approximate the clinical situation and there- 
by encourage each rater to evaluate the completions 
more critically. Also, it was intended to prevent spu- 
rious positive findings that may have resulted by 
virtue of the fact that more severely disturbed pa- 
tients have a greater likelihood of being rated suicidal. 


from the hypergeometric function. For Phase 
I only one of the four psychologists’ distribu- 
tions warranted assuming that other than 
chance factors were operating. The probabili- 
ties obtained were .82, .38, .06, and .02. When 
the tests which contained less than 25 re- 
sponses were discounted (Phase II) none de- 
parted significantly from chance expectation 
(P = 97, .53, .53, .06). The over-all percent- 
age of correct identification of patients with 
suicidal ruminations was 43% for Phase I 
and 30% for Phase II. Since in Phase III the 
true distribution was not disclosed to the psy- 
chologists, their ratings provide us with an 
index of the false positives we might expect. 
The mean percentage of false positives was 
33%.° Clearly these results indicate that the 
test as used is woefully inadequate to provide 
practical value in the detection of suicidal 
patients. 


A comparison was made between the num- 


5 It is possible that the set established in Phases I 
and II by virtue of the raters being told the number 
of suicidal patients to expect resulted in increasing 
the probability of false positives. However, for one 
rater Phase III preceeded the others, His percentage 
of false positives was 30%. 


l 


anni » 


Detection of Suicidal Patients 


ber of agreements among psychologists with 
b the number to be expected if chance were the 
only factor. The expectancy derived from 
probability theory was compared with the ob- 
tained results by means of chi square. For the 
purpose of these comparisons all the distribu- 
tions were dichotomized into the categories 
“suicidal” and “others.” The chi squares for 
Phase I and Phase II were significant beyond 
the .01 level of significance, and for Phase Ill 
beyond the .05 level. When the assaultive pa- 
tients were grouped with the suicidal patients 
the chi squares for Phase I and II were sig- 
nificant beyond the .001 level and for Phase 
III beyond the .01 level. This latter finding 
coupled with the previous observation that 
the psychologists did somewhat better when 
records with less than 25 responses were in- 
cluded tends to indicate that their ratings 
were more in terms of severity of illness than 
focus upon specific traits that may be asso- 
ciated with suicide. Further evidence to sup- 
port this formulation is available from the 
comparison of the number of privileged and 
nonprivileged patients rated as suicidal. Al- 
though both these groups were actually non- 
assaultive and nonsuicidal, the likelihood of 
a nonprivileged patient being rated as suicidal 
was 60% greater. Since suicidal ruminations 
may be the result of varied and wholly differ- 
ent intrapsychic processes, our findings are 
not surprising. 
In analyzing the relationship between the 
degree of confidence and correctness of deci- 
sion it was found that with the exception of 
the one psychologist who did better than 
chance in Phase I there was no marked in- 


creased accuracy when a decision was made 
ptional rater was 


With confidence. The exce l 
Correct in 11 of his 14 high confidence deci- 
sions and only 4 of his 14 low confidence de- 
cisions. Such a difference would occur on a 
Chance basis less than one time in a hundred 
chi square with Yates correction). he 
Only the high confidence ratings of a ast 


Tater he was able to place correctly 
idl vel Tt would therefore seem 


the suici i 
idal patients. í ; 
hat such a test as the one described may, in 
he hands of a proficient rater, have some 
Usefulness in detecting suicides. aca i 
Search focused upon identifying proficien 
£ raters seems warranted. 


be 
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SUMMARY 


One hundred and four psychiatric patients 
were given a sentence completion test con- 
structed to elicit suicidal ideation. Of the 
104 subjects, 18 were known to have been 
preoccupied with suicidal ruminations. In one 
part of the experiment, four clinical psycholo- 
gists categorized sentence completions as com- 
ing from suicidal, assaultive, and nonassault- 
ive patients after having been given the fre- 
quencies in each category. In another part of 
the experiment, three psychologists were given 
different sentence completions and asked to 
categorize them without being told the actual 
distribution. 

The over-all percentage of correct identifi- 
cation of patients with suicidal ruminations 
varied from 30% to 43%. The mean percent- 
age of false positives was 33%. Although the 
psychologists’ predictions showed no greater 
relationship to the criterion measure than 
chance expectancy, they agreed with each 
other more often than would be expected if 
chance were the only factor. This, coupled 
with other evidence, was taken to indicate 
that the psychologists’ ratings were primarily 
in terms of severity of illness rather than 
focusing on specific suicidal traits. As a group, 
the raters’ performance was not significantly 
better when they had high confidence in their 
decisions. However, the high confidence rat- 
ings of one psychologist correctly placed 79% 
of the suicidal patients. This was taken to in- 
dicate that there was a possibility of identi- 
fying proficient raters who could use this test 


successfully. 
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In recent years a number of different ques- 
tionnaires purporting to measure “anxiety” 
(Cattell, 1957b; Taylor, 1953) or “neuroti- 
cism” (Cattell, 1955; Eysenck, 1956a; Winne, 
1951) have been independently developed by 
different methodologies, and subsequent stud- 
ies have shown relatively high correlations 
among these scales (Bendig, 1957; Holtzman, 
Calvin, & Bitterman, 1952; Windle, 1955). 
_ These correlations suggest that the Eysenck, 
© Winne, and Taylor scales may all be measur- 

ing the same objective personality dimension. 
However, several attacks have been launched 
against the undimensionality of these scales. 
Eysenck (1956b, 1957) argues that such 
questionnaires are confounding the independ- 
ent dimensions of neuroticism and extraver- 
sion and his hypothesis has received some ex- 
perimental verification (Bendig, 1957, 1958; 
Frey & Becker, 1958). A second objection to 
such scales has been that of Edwards (1957) 
who suggests that much of the variance of 
such scales can be attributed to social desir- 
ability or the tendency of Ss to give socially 
| desirable responses to personality items re- 
8ardless of their content and the large corre- 
lations between Edwards’ Social Desirability 

Scale (SDS) and Taylor’s MAS (Edwards, 

1957, p, 33) and Cattell’s Anxiety scale 

(Bendig, 1959b) have supported this hy- 

Dothesis’ However, 22 of the 39 items on Ed- 

Wards’ scale overlap with Taylor's MAS; and 
} When only the 17 non-MAS items are scored 
| œn the Social Desirability Scale, Edwards 
4 pund the MAS correlation a 

ced (E 1957, P- , 
(1959p, bie reported that the Cattell 

Qxiety score correlation decreased. These 
ndings suggest that the Edwards SDS is also 
undimensional, but measures a combina- 


FACTOR ANALYSES OF “ANXIETY” AND 
“NEUROTICISM” INVENTORIES 


A. W. BENDIG 
University of Pittsburgh 


tion of social desirability, anxiety, and/or 
neuroticism factors. Parenthetically it might 
be noted that the SDS shows a low positive 
correlation with the MMPI Lie scale (Ed- 
wards, 1957, p. 44), suggesting that the SDS 
scores may also involve a “lie” component. 
At a theoretical level, Spence (1958) argues 
for the construct validity of Taylor’s MAS as 
an operational definition of “manifest anx- 
iety,” now renamed “emotionality” or “emo- 
tional reactivity,” while Eysenck (1956b; 
1957), as noted above, regards anxiety as a 
combination of intraversion and neuroticism. 
Although Spence may be regarded as a monist 
in this area of personality theorizing, and Ey- 
senck as a dualist, Cattell (1957a) is a trini- 
tarian since he regards anxiety, neuroticism, 
and extraversion as three relatively independ- 
ent personality dimensions and has published 
inventories for the measurement of both neu- 
roticism and anxiety (Cattell, 1955, 1957b). 
So we are left with one, two, or three person- 
ality factors with varying identifications de- 


pending on the theoretical orientation of the 


scientist. 
The present research was designed to in- 


vestigate the number of orthogonal factors 
present in a battery of independent question- 
naires purportedly measuring the above noted 
personality dimensions and to clarify the fac- 
torial composition of each of these commonly 
used inventories. It was assumed that the in- 
tercorrelations among the anxiety and neu- 
roticism scores might be influenced by other 
contaminating factors, such as extraversion, 
social desirability, falsification, and omnipres- 
ent sex differences, and the inclusion of 
marker variables for these hypothetical fac- 
tors would help to purify the factors ob- 
tained. The central question of the study was 
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whether, when the influence of other factors 
was isolated by these marker variables, a sin- 
gle general factor of “emotionality” (Spence, 
1958) or separate group factors of “anxiety 
and “neuroticism” (Cattell, 1957b) would ap- 
pear necessary to account for the interrela- 
tionships among the scales. 


Strupy I 
Procedure 


Scales. The scales included in this study were as- 
sembled into a single booklet containing 230 items 
from which 12 scores were obtained. Only 10 scores 
were used with each of two groups of Ss, since four 
of the scores were not experimentally independent 
(see below). 

1. Manifest Anxiety Scale (Taylor, 1953) included 
only the 50 dichotomous-response items that are 
actually scored for this scale (McCreary & Bendig, 
1954) .1 

2. Winne’s (1951) Neuroticism scale: Since 8 of 
the 30 dichotomous-response MMPI items on this 
scale are included in the Manifest Anxiety scale, only 
the 22 experimentally independent items were used.? 

3. Edwards’ (1957) Social Desirability Scale: Only 
the 17 dichotomous-response MMPI items that are 
not included in Taylor’s MAS were scored for this 
scale.3 
_ 4. Eysenck’s (1¢*4a), Extraversion scale, which is 
included in the M^ „ty ‘Personality Inventory, con- 
tains 24 items. i 

5. Eysenck’s (1956a) Neuroticism scale is the sec- 
ond 24-item scale included in the MPI. Dichotomous 
item responses (Yes-No) were used with items on 
both of the Eysenck scales (Bendig, 1959a). 

ó. MMPI Lie scale (Hathaway & McKinley, 1951): 
Since two of the standard 15 items appeared inap- 
propriate for our college Ss, only 13 dichotomous- 
response items were used.* 

7. Cattell’s (1957b) Covert Anxiety scale included 
the first 20 “subtle” trichotomous-response (truc-?- 
false) items of the IPAT Anxiety scale. 

8. Cattell’s (1957b) Overt Anxicty scale included 
the last “overt, symptomatic” items of the IPAT 
Anxiety scale. 

9. Cattell’s (1956b, p. 7) Difference score was the 
score for Overt Anxiety minus the score for Covert 


1 The MMPI booklet numbers can be found in 
Windle (1955). 

*The MMPI booklet numbers and direction of 
scoring of these items are: 5, 29, 41, 44, 47, 72, 76, 
Perey ae 189, and 236 scored True; and 2, 3, 9, 

2 The Tae a 178, and 208 scored False. 

£ booklet numbers and direction of 
scoring of these items are: 54, 169, and 257 scored 
True; 40, 42, 138, 148, 156, 171, 218, 245, 247, 252 
10 269, 286, and 383 scored False. , 252, 
The MMPI booklet numbers are: 15, 30, 45, 60, 


75, 90, 105, 120, 15 5 5 5 
n 5, 120, 150, 165, 195, 225, and 285 scored 


i A, W. Bendig 


Anxiety. Cattell suggests that the difference between 
the Covert and Overt scores “js intended to give a 
measure of the degree to which the patient is or is 
not conscious of his anxicty, and perhaps also of any 
wish consciously to emphasize it.” 

40. Cattell’s Total Anxicty score was the sum of 
the Covert Anxiety and Overt Anxiety scores. Since 
these four scores derived from the IPAT Anxiety 
scale are not all experimentally independent, -the 
Covert Anxiety and Overt Anxiety scores (which are 
independent) were included in the analysis of the 
data from one second group of Ss (Group A), while 
the Difference and Total Anxiety scores (which also 
are independent) were used in the analysis of the 
data from the second group of Ss (Group B). 

11. Cattell’s Neuroticism scale contained the 34 
trichotomous-response items scored in the IPAT 
Neurotic Personality Factor questionnaire (Cattell, 
1955). The items were reversed in scoring (high 
score representing high neuroticism) for consistency 
in scoring direction with other similar scales in- 
cluded in this study. 

12. Cattell’s (1956b) Motivational Distortion scale 
contains the remaining six trichotomous-response 
items in the IPAT NPF inventory (Cattell, 1955). 
Cattell’s description of the construction of this scale 
suggests that it is measuring a factor similar to that 
assessed by the MMPI Lie scale. 

In the above list, Variables 3, 4, 6, and 12 were in- 
cluded as marker variables for possible contaminat- 
ing factors, while Variables 1, 2, 5, 7, 8, 9, 10, and 
11 provided the test of the single Emotionality fac- 
tor versus the double Anxiety-Neuroticism factor 
hypotheses. 

Subjects. The scales were administered during 4 
regular class session to 11 sections of introductory 
psychology, with a total of 425 Ss (261 men and 
164 women) being tested. The Ss are primarily fresh- 
man and sophomore students in the School of the 
Liberal Arts with smaller proportions of juniors and 
seniors and of students enrolled in the School of | 
Business Administration and the School of Engineer- 
ing and Mines. After scoring, the answer sheets were 
alphabetized by Ss’ names and the first 118 male Ss 
and the first 82 female Ss selected as Group A. In a 
similar manner the next 118 male and 82 female Ss 
were selected as Group B. The mean age of the 400 
Ss was 20.8 years, with a standard deviation of 45 
years. As noted above, the only procedural difference 
in analyzing the data from each group was that Cat- 
tell’s Covert Anxiety and Overt Anxiety were in- 
cluded as score variables for Group A, while Cattell’s 
Difference and Total Anxiety scores were used in the | 
analysis of the data from “Group B. Ss’ sex (male 
scored 1, female scored 0) was included in each 
analysis as the eleventh variable. 

Method. The 10 questionnaire variables within each 
group of Ss were intercorrelated by the usual prod- 
uct-moment method, and these variables were corre: 
lated with the sex dichotomy by the point piseri2 
formula. Each matrix of 55  intercorrelations was 
factor analyzed by the complete centroid methoe 
and four factors were extracted. Each of the succes” 
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TABLE 1 


Romane zii Ke 
Ror REED ORTHOGONAL Factor LOADINGS (DECIMAL Porxts OMITTED) oF Inventory SCORES 


Group A Group B 

Scales Em F Sx h? Em F Sx h? 
Taylor Manifest Anxiety Scale 83 —19 21 77 82 ail yi 
Winne Neuroticism (Abbreviated) ól —05 05 37 - = A P i 
Edwards Soc. Desir. (Abbreviated) —57 39 17 5i = 6o N a 
Eysenck MPI Extraversion scale —36 —04 —08 14 —46 =05 —03 32 
Eysenck MPI Neuroticism scale 75 —33 12 6S 67 — 350 18 73 
MMPI Lie scale (Abbreviated) —07 66 05 H —08 67 02 is 
Cattell Covert Anx. subscale 50 —ót 00 66 “ 
Cattell Overt Anx. subscale 60 —37 40 66 ` 
Cattell Difference score 47 09 37 37 
Cattell Total Anxiety scale 61 —66 21 85 
Cattell NPF Neuroticism scale 3+ —09 66 56 42 —37 55 61 
Cattell NPF Motiv. Dist. -05 62 12 40 -o5 o o 4l 

—02 03 56 31 —04 08 54 30 


Sex (Female) 


sive factors was tested {or statistical significance using 
both Tucker’s and Humphrey's criteria (Fruchter, 
1954, pp. 77-80). The significant factors in each 
matrix were analytically rotated to orthogonal sim- 
ple structure using the normalized varimax method 


(Kaiser, 1958).5 


Results 

The first four centroid factors from Groups 
A and B accounted for 53.2 and 54.8% of the 
variance of the 11 variables within each group 
of Ss, with the Group A factors accounting 
for 34.6, 9.1, 6.3, and 3.2 of the variance, 
while the similar percentages from Group B 
were 38.6, 8.7, 5-2, and 2.3. The application 
of Humphrey’s rule in testing the significance 
of the factors indicated that the first three 
factors from Group A and the first two fac- 
tors from Group B were significant at the .05 
level or better. Using Tucker’s procedure the 
first three factors were significant within both 
groups. Consequently, only three factors were 
retained for the normalized varimax rotation, 
With the fourth eing discarded as not 


factor bi carded 
i igni r ei a. 
being significant under either criteri 


extended to Gary 
University’s Com- 
providing as- 
lytic phase of 


5 The author's appreciation is 
Lotto and William B. Kehl of the 
Puting and Data Processing Center for 
sistance and facilities in the factor ana 
this study. 

” ° Tables giving t 
ariables within each group ° 
nificance of the unrotated centrox 
troid loadings and transformation 
been deposited with the ‘American Docume 


he intercorrelations among the 
Ss, the tests of sig- 
oid factors, and cen- 
matrices have 
ntation 


The loadings of each variable resulting from 
the normalized varimax rotation procedure are 
given in Table 1. For convenience and con- 
sistency, the signs of the loadings for the Sex 
variable were reversed in preparing Table 1 
as is indicated in the table. The three factors 
are almost identical between both Groups A 
and B, with indices ui cv similarity (sum 
of the cross-products of loadings of similar 
factors divided by the geometric mean of the 
sums of squares of the loadings) for the nine 
variables common to both Groups A and B 
being .99, .94, and 96 for rotated Factors Em, 
F, and Sx. The only evident sampling differ- 
ence between the independent groups of Ss is 
that the loadings of the scores on Factor F 
tend to be higher within Group B than they 
are within Group A. The relative rank orders 
of the Factor F loadings are almost identical 
within both groups (rho = .92), but F ac- 
counts for only 15% of the interscale variance 
within Group A, while it accounts for 19% 
within Group B. 

Factor Em is an Emotionality factor with 
zero loadings only on the MMPI Lie, Cattell 
Motivational Distortion, and Sex variables. 
Taylor’s MAS is the best single measure 
of Factor Em, with Eysenck’s Neuroticism, 
Winne’s Neuroticism, and Cattell’s Total 
Anxiety scores also having large loadings on 
this factor. This Emotionality factor, as de- 
Order Document No. 6313, remitting $1.25 
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fined by this battery of inventories, perme- 
ates both the Anxiety and Neuroticism scales 
equally strongly and shows a moderate nega- 
tive loading (—-36 and —.46) on the Extra- 
version scale. Factor F is a Falsification fac- 
tor with high loadings on the MMPI Lie and 
Motivational Distortion scales. Since neither 
of these two variables was loaded on the other 
two factors, they can be used to define and 
identify this factor. Most of the scales with 
high Emotionality loadings also have moder- 
ate negative loadings on this F factor with 
Cattell’s Covert Anxiety and Total Anxiety 
scales showing the largest loadings. Eysenck’s 
Extraversion scale and the Sex variable have 
_ zero loadings on Factor F. Factor Sx appears 
to be a pure Sex factor with the reversed di- 
rection of scoring of this variable identifying 
the positive pole as the “femininity” direc- 
tion. Both Cattell’s Overt Anxiety and Neu- 
roticism scales have large positive loadings 
on the Sx factor as does Cattell’s Difference 
score (which is probably attributable to the 
inclusion of Covert Anxiety in the Difference 
score). 
i Considering the factor composition of the 
inventory scores included in this analysis, 
the Social Desirability, Eysenck Neuroticism, 
Covert Anxiety, Total Anxiety, and probably 
the Manifest Anxiety and Winne Neuroticism 
scales are factorily complex by being loaded 
on both Factors Em and F. Overt Anxiety 
and probably Cattell’s Neuroticism scales are 
loaded with all three factors, while Cattell’s 
Difference score is loaded with both Factors 
Em and Sx. Eysenck’s Extraversion scale 
shows a moderate loading on Factor Em, and 
the MMPI Lie and Motivational Distortion 
scales are the best measures of Factor F. 


Discussion 


The absence of a significant unrotated 
fourth factor within either matrix of inven- 
tory intercorrelations and the clear identifi- 
cation of the analytically rotated three sig- 
nificant factors as Emotionality, Falsification 
and Sex factors indicate that the Aei 
and Neuroticism dimensions measured by the 
scales developed by different authors and dif 
ferent methodologies can be assumed to Be 
operationally identical within the limits of 
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this study. This is not to deny that some of 
these scales may be also measuring factors 
that had no opportunity to appear in these 
analyses, but the communalities of several 
scales, notably Taylor’s MAS, Eysenck’s Neu- 
roticism, Cattell’s Covert Anxiety, Overt Anx- 
iety, Total Anxiety, and Difference scores, are 
close to what might be assumed to be the 
reliabilities of these scales, leaving little re- 
liable scale variance to be attributed to hypo- 
thetical factors not appearing in these analyses. 

The Emotionality factor defined by these 
inventories includes an introversion compo- 
nent as has been suggested by Eysenck 
(1957). His further hypothesis that this 
component results from the confounding of 
an orthogonal Extraversion-Introversion (EI) 
factor with Em could not be tested with the 
present data since only, one “pure” marker 
variable for the hypothetical EI factor had 
been included in the analyses. Inclusion of 
additional measures of EI might permit a 
separation of Emotionality and Extraversion 
factors that may have been confounded in 
the present analyses and allow the assessment 
of the relative proportions of each factor in 
these Anxiety and Neuroticism scales. The 
presence of a Falsification or Honesty factor 
in these inventory scores, as was suggested by 
Edwards (1957) among others, was also 
strongly confirmed, but it is evident that Ed- 
wards’ Social Desirability Scale is not the 
best measure of this factor. The factor load- 
ings of the abbreviated Social Desirability 
Scale appear quite similar to those of the 
Anxiety and Neuroticism scales and indicate 
that the SDS scores are better measures of 
Factor Em than they are of Factor F. Al- 
though Eysenck (1956a) was moderately suc- 
cessful in developing an Emotionality scale 
that is independent of Extraversion (Bendig, 
1957), his scale is also negatively loaded with 
the Falsification factor. 

The study reported below is a further ex- 
tension of the first study with additional 
marker variables for the Em and TH factors. 
It was hoped that these additional scales 
would clarify the relation of the scales to the 
Em, EI, F, and Sx factors and would again 


test for separate Anxiety and Neuroticis™ 
factors. 


i 


O ee 


Factor Analyses of Inventories 


Stupy II 
Procedure 


p Scales. Eleven independent personality scales were 
included, along with standard instructions, in a book- 
let to be distributed to the Ss with separate IBM 
answer sheets being used to collect the responses to 
the items. 

Maudsley Personality Inventory (MPI) scores: 
The most recent form of the MPI, described in de- 
tail by Jensen (1958), consists of 80 trichotomous- 
response (Yes-?-No) items including 24 Extraversion, 
24 Neuroticism, 20 Lie, and 12 Buffer items. Eysenck 
(1956a) describes the 12 Buffer items as 7 nonscored 
Extraversion and 5 nonscored Neuroticism items to 
be used for research purposes. He also indicates which 
of the 24 items on either the E or the N scales can 
be split into subscales containing 12 items each for 
establishing the split-half reliability of the E and N 
scales. Much of the content of the Lie items is simi- 
lar to that in other Lie scales such as the MMPI 
scale described below. Although the original MPI 
uses the trichotomous-response method, it has been 
shown that the reliability and intercorrelation of the 
scales are not affected if a simpler dichotomous- 
response (Yes-No) method is used (Bendig, 1959a). 

For purposes of the present research, the items of 
the MPI, using the dichotomous-response format, 
were grouped into six independent scales to obtain 
separate scores. MPI-E1 and MPI-E2 each contained 
12 of the original 24 Extraversion items with the as- 
signment of items to subscales following the split 
suggested by Eysenck (1956a). MPI-E3 included the 
7 Buffer MPI Extraversion items. MPI-N1 contained 
the 24 items in the original Neuroticism scale, while 
MPI-N2 was scored for the 5 Buffer MPI Neuroti- 
cism items. MPI-L scores were obtained by scoring 


10 of the 20 MPI Lie items, omitting 10 Lie items 
that either overlapped in item content some of the 
other Lie scales used in this study or appeared not 
to be items which would be relevant for our Ss.* 

Some of the scales included in the previous study 
Were repeated in this analysis. Taylor's eae 
Manifest Anxiety Scale (MAS) was included along 


i i Winne’s 
wi jated 22-item version of Vy 
E Je and the 13-item form 


Neurotici inne N) sca fo 
of gt TE scale. The 34-item yanti 
(NPF-N) and 6-item Motivational Distortion er- 
MD) scales included in Cattell’s Neurotic Fer 
— 

1 The content of the MPI items and ; g 
irecti i 3, pp. 316 
direction can be’ found in Jensen Sd i an 


318), The numbers of the items m 7 
j 13, 14, 20, 31, 56, 57, 
Study aro: MPI-EL, b 3 y, Uso a1, 44, 46, 47, 51 


66, 69, 75: MPI-E2, 2, 9 25> 

61, 73, 79; MPLE3, 7; 21, 29, 34, 60, 1 z y 
Scored for Yes responses) ; MPI-N2, 7% 19, 33, 36, 
78 (all scored Yes); and MPILy $ 71 coed the 
40, 43, 49, $8, 62. The MPINI scab nee Lie 
24 items listed by Jense mee ih ‘were 
items 8, 15, 24, 28, 45, 52 55 6h ee 

Omitted from the form given to the Ss. 
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ality Factor Questionnaire were also repeated in this 
study. As in the previous study, the NPF Neuroti- 
cism items were reversed in scoring compared to the 
item key given in the NPF manual so that high 
scores here represent high Neuroticism. 

In summary, the MPI-N1, MAS, Winne N, MMPI 
Lie, NPF-N, and NPF-MD scales are identical with 
those used previously, while the MPI-E1 and MPI- 
E2 combined are equivalent to the previous Extra- 
version scale. MPI-E3, MPI-N2, and MPI-L were 
used for the first time in this study. 

Subjects. The scales were administered to a total 
of 263 students (197 men and 66 women) enrolled in 
eight sections of a one-semester course in introduc- 
tory psychology (Group C). The Ss were primarily 
freshman and sophomore students and are similar to 
those used in the previous study with the exception 
that a smaller percentage of women Ss was enrolled 
during this (spring) semester. The scales were ad- 
ministered in class during the second week of the 
semester by each class instructor, and this group of 
Ss represents the total enrollment in the course dur- 
ing this semester with the exception of a few Ss who 
were absent from class the day that the scales were 


administered. 
Method. The 11 scale scores were intercorrelated 


by the usual product-moment method, and each score 
was correlated with the sex dichotomy by the point 
biserial formula. Six factors were extracted from the 
12-variable matrix of correlations by the complete 
centroid method and each factor was tested for 
statistical significance using both Humphrey’s and 
Tucker’s criterion. The significant factors were ro- 
tated to orthogonal simple structure using the ana- 
lytic normalized varimax criterion. 


Results 

Using Humphrey’s test of the significance 
of factors, the first three centroid factors were 
significant at the .01 level and the fourth fac- 
tor at the .05 level (z = 2.20), while the fifth 
and sixth factors did not approach significance 
(z = 82 and .53). The first five factors were 
significant using Tucker’s phi criterion, while 
the sixth factor again was not significant. In- 
cluding the fifth factor in the rotation process 
in a fifth rotated factor with all load- 
ings less than .30 (the largest loadings were 
— 28 for NPF-N and .22 for MMPI Lie). 
Consequently, the fifth factor was omitted 
and only the first four factors included in the 
normalized varimax rotation procedure. The 
four unrotated centroid factors accounted for 
53.8% of the total variance with the omitted 
fifth factor accounting for only an additional 
2.0%. The residual correlations after the ex- 
traction of four factors ranged in absolute 


resulted 
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TABLE 2 


ROTATED ORTHOGONAL Factor LOADINGS 
(DecmtaL Ports OMITTED) OF 
Inventory Scores (Group C) 


Scales Em EI F S it 
Eysenck MPI-N1 78 —07 —25 14 70 
Eysenck MPI-N2 6 —01 —13 10 483 
Taylor MAS 8 -23 —12 07 80 
Winne N 69 —23 —O4 04 - 53 
Cattell NPF-N 46 04 —13 52 50 
Eysenck MPI-E1 —22 78 0 —10 66 
Eysenck MPI-E2 —10 719 —02 11 G4 
Eysenck MPI-E3 —05 79 08 02 G4 
Eysenck MPI-L 02 o4 62 06 39 
MMPI Lie —27 —10 56 10 40 
Cattell MPF-MD —22 11 55 —11 38 
Sex (Female) 05 01 09 59 36 


values from .00 to .08, with the median being 
015. 

The rotated factor loadings show a good ap- 
proximation to simple structure, with 45.8% 
of the loadings having absolute magnitudes 
of .10 or less, 60.4% being .20 or less, and 
27.1% of the loadings being significant (.30 

` or greater). Only NPF-N had significant load- 
ings on two factors with the other 11 vari- 
ables having loadings above .30 on one fac- 
tor. The factor loadings and communalities of 
the scales can be found in Table 2. The first 
factor is obviously the Emotionality (Em) 
factor with high loadings on all of the Anx- 
iety and Neuroticism scales, while the or- 
thogonal second factor is an Extraversion- 
Introversion (EI) dimension which loads 
highly on the three Extraversion subscales. 
The third rotated factor again is a Falsifica- 
tion (F) factor with high loadings on the 
three Lie scales and the fourth factor is a 
Sex (Sx) factor that loads only on the NPF-N 
and Sex (female) variables. MPI-N1 has a 
small negative loading on F, while the MAS 
and Winne N variables have small negative 
loadings on EI. MMPI Lie and NPF-MD 
scales have small negative loadings on Em in 
addition to their large loadings on F with 
MPI-L loading only on F. NPF-N loads on 
both Em and Sx with its loading on Em 
(.46) being slightly smaller than its loading 
on Sx (.52). 
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Discussion 


The results of this factor analysis have con- 
firmed and clarified the findings of the previ- 
ous analyses. The presence of an orthogonal 
EI factor is confirmed and has been separated 
from the Em factor, while the previous analy- 
ses confounded these two factors. Again only 
one major Em factor was found to load the 
Anxiety and Neuroticism scales, which con- 
firms the previous evidence that Anxiety and 
Neuroticism factors cannot be demonstrated 
as separate factors in factor analyses of ques- 
tionnaire data. The identification of the fac- 
tor was strengthened and more clearly sepa- 
rated from the EI factor. 

Using the squares of the factor loadings in 
Table 2 as percentages of scale variance, we 
can quantitatively estimate factor components 
in the scales included in this research. Of the 
Anxiety and Neuroticism scales, the MAS is 
the most valid measure of Em (72%), but is 
contaminated by a small loading on EI (5%). 
Eysenck’s MPI-N1 score is also a good meas- 
ure of Em (61%) and is free of EI, but is 
slightly loaded on F (6%). Winne’s Neuroti- 
cism scale shows a lower weighting on Em 
(48%) and, like the MAS, has an ET com- 
ponent (5%). Cattell’s NPF Neuroticism 
scale is a better measure of Sex differences 
(Sx) than it is of Emotionality (21% vs. 
27%) which agrees with the previous analy- 
ses. The greater saturation of the MAS score 
with Em may be due to the greater length of 
this scale (50 items) compared to the other 
scales, assuming that a longer scale would 
have a greater proportion of reliable variance 
and, possibly, a larger proportion of common 
factor variance. 

Approximately 62% of Eysenck’s Extra- 
version scale variance is accounted for by the 
EI factor and 3% is attributable to Em. All 
three of the Lie scales (MPI-L, MMPI Lie, 
and NPF-MD) load moderately on the F 
factor (38, 31, and 30%), but MMPI Lie 
and NPF-MD also have Em components (7 
and 5%). MPI-L appears to be the purest 
measure of this F factor with negligible load- 
ings on Em, EI, and Sx. It is interesting to 
note that the three scales composed of MMPI 


items (MAS, Winne, and MMPI Lie) all have 
both Em and EI components. 
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In using these scales for further factor ana- 
lytic studies, MAS, MPI-N1, and MPI-N2 
appear to be the best “markers” for the Em 
factor and Cattell’s and Winne’s Neuroticism 
scales can be eliminated as being too contami- 
nated by extraneous factor variance. How- 
ever, the Winne scale might be improved and 
purified through factor analysis. The MPI 
Extraversion scores can be used to define the 
EI factor, but should be improved to remove 
the F component. MPI-L can be used as a 
measure of F, but its F saturation should be 
increased, possibly by lengthening the scale, 
and at least one other F marker scale will 
have to be developed that is free of the Em 


factor. 
SUMMARY 


Various inventories measuring Anxiety, Neu- 
roticism, Extraversion, and Falsification vari- 
ables were administered to two groups of col- 
lege students, each group containing 200 Ss, 
and replicated factor analyses of the inter- 
correlations among 11 scores within each 
group were performed. Three significant fac- 
tors were found within each matrix of corre- 
lations and the factors were rotated to or- 
thogonal simple structure by the normalized 
varimax method. The rotated factors were 
identified as Emotionality, Falsification, and 
Sex with all of the Anxiety and Neuroticism 
Scores having appreciable loadings on both of 
the first two factors. It was suggested that 
Anxiety and Neuroticism are both manifesta- 
tions of a more general Emotionality factor 


and are not separate dimensions yi ei 
mon] inventories, and that the ana'y- 
Tey ( ed two independent 


Ses may have confound 
factor dimensions, Emotionality and Bie 
Vversion-Introversion, within the Emotionality 


actor i two samples. 
found in these , a second study 


To te above findings i 
Was Aaria v which 11 Anxiety, NTa 
cism, Extraversion, and Lie scales ovens r 
With the first battery were ange is 

3 students and a centroid factor ana on 
erformed on the matrix of scale meran 
ons. Four factors were found which, a ter 


ation to orthogonal simple i La 
€arly ident Emotionality ia 
Y identifiable as Falsification 


traversion-Introversion (EI), 
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(F), and Sex (Sx). Separate factors of Anx- 
iety and Neuroticism again did not appear in 
the analysis, and the separation of Emotion- 
ality and Extraversion-Introversion as inde- 
pendent factors was confirmed. 
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Theories regarding the conditions anteced- 
ent to the development of schizophrenia still 
remain speculative in character. The points of 
view fall mainly into three categories: causa- 
tive factors are of a genetic or constitutional 
nature; causative factors are of a psychologi- 
cal character linked to some maladjustment 
in interpersonal relationships; causative fac- 
tors represent a combination of physical and 
psychological agents. Many of those who have 
attempted to account for schizophrenia in 
terms of psychological factors have stressed 
the importance of faulty mother-child rela- 
tionships in the etiology of this disorder 
(Arieti, 1955; Fromm-Reichmann, 1948; 
Rosen, 1947). One line of research bearing 
upon the mother-child hypothesis has uti- 
lized an objective measure of the child rear- 
ing attitudes of the schizophrenics’ mothers, 
comparing them to the mothers of normals. 

Mark (1953) reported that 67 of the 139 
items in an attitude survey reliably discrimi- 
nated between the mothers of male schizo- 
Phrenics and the mothers of normal males 
with the schizophrenics’ mothers tending to 
be more “restrictive” and either “excessively 
devoted or detached.” Using Shoben’s (1949) 
Parent-Child Attitude Survey, Freeman and 
Grayson (1955) found significant or sugses- 
tive differences on two out of three subscales 
When the mothers of schizophrenics and nor- 
mals were compared. Tendencies to be “pos- 
Sessive” or “ignoring” in attitude were shown 
by the mothers of schizophrenics. However, a 
Study by Zuckerman, Oltean, and Monsshe 
(1958a), which compared the responses i 
these two types of mothers on the Paren 


ttitude Research Instrument | (Sousiss © 
ell, 1958), found only one significant a5 
erence between them on 20 scales, a finding 
Which can be readily attributed to chance. 


The apparent logic behind these studies 
and numerous others (see Freeman & Gray- 
son, 1955) in which a variety of assessment 
techniques have been used to evaluate the 
maternal attitudes of schizophrenics’ mothers 
is that the best way to detect and clarify pos- 
sible malignant factors in the mother-schizo- 
phrenic child relationship is to directly evalu- 
ate the mother. However, it can be argued 
that it is the child’s perceptions of the moth- 
er’s characteristics and attitudes which are 
the important cues mediating his behavior in 
the mother-child relationship. It would fol- 
low that the study of the possible genesis of 
schizophrenia in disruptive mother-child in- 
teractions should proceed by way of the 
schizophrenic child’s perceptual schema of the 
mother and of himself in relation to her. An 
earlier study (Singer, 1954) followed a simi- 
lar approach by using Thematic Appercep- 
tion Test stories to evaluate schizophrenics’ 
perception of parental characteristics and in- 
trafamilial relationships. Singer concluded that 
schizophrenics see parental figures as cold or 
rejecting and depict the relationships with 
parents as being relatively isolated. 

The present study represents an attempt to 
measure objectively schizophrenic perceptions 
of mothers’ child rearing attitudes and to com- 
pare these perceived attitudes with those of 
normals. More specifically, the hypothesis that 
schizophrenics would perceive the mother— 
child relationship as poorer than would nor- 
mals was tested by predicting that schizo- 
phrenics would attribute more pathogenic 
attitudes to their mothers than would normals. 


METHOD 


Subjects 
Twenty-six females, admitted in 1957 to a typical 


large midwestern mental hospital and who carried un- 
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contested staff diagnoses of schizophrenia, were indi- 
vidually administered the Parent Attitude Research 
Instrument (PARI). There was no attempt made to 
provide more homogeneous classes of schizophrenics 
by further categorizing them into the standard sub- 
groups of paranoid, catatonic, etc. One reason for 
this was the restricted number of schizophrenics 
available for the study since inclusion demanded the 
ability and cooperation of both the schizophrenic 
and her mother. The second reason was the lack of 
any evidence that membership in any subclass was 
differentially related to the experimental task. A con- 
trol group of 27 normal females (i.e., answered “No” 
when asked if they had ever utilized any mental 
health service) were personally contacted or con- 
tacted by mail and were willing to complete the 
PARI. 

The schizophrenic and normal groups did not differ 
significantly in mean age (schizophrenics, 36.6 years; 
normals, 33.4 years) or mean educational level 
(schizophrenics, 11.3 years; normals, 12.1 years). 
Hospitalization was only partially controlled (26 out 
of 27 of the schizophrenics were hospitalized at the 
time of testing while 10 of 26 normals were patients 
on medical wards of a general hospital) but there is 
no a priori basis for assuming a systematic relation- 
ship between the hospitalization factor and the test 
performance obtained in the present study. The so- 
cial-economic status variable was not controlled in 
this study except to the extent that having fairly 
equivalent mean educational levels for the daughter 
groups accomplishes this. Since social-economic status 
cee to be related to child rearing atti- 
tudes) ie mie Co: ie dealin of auch ptt 
esiis ea pale aig to make a preliminary 
Aeaee Erns nie t e mothers of the two 

c ed attitudinally. The mothers 

n aD given the PARI and an analysis of vari- 
emonstrated no differences (F < 1,00) in ex- 
pressed child rearing attitudes between the mothers 


of the schizophrenic and normal daughters used in 
this study. 


Test 


: The attitude survey used in the present investiga- 
tion was the Parent Attitude Research Instrument 
(PARI) developed by Schaefer and Bell (1958). It 
includes 115 statements devised to measure maternal 
attitudes judged to be related to personality develop- 
ment in children. The total of 115 statements con- 


sists of 23 sets of 5 sta i 
s tements en 
intended to measure a E A 


ted category of maternal attitudes 

aga te Daries are given in Table 1 The 
r has t i i ; 

dateer alde e option of endorsing “strongly 


: agree,” “mildly disagree,” or “strong! 
disagree” for each Hf cca! spies 


ees strongl; ” re- 
sponse is given the highest score he PARI on 


i t 
“strongly disagree” the lowest; E ig ang 
3 


score always represents agreement with the attitud: 
in question. Further, the attitudes being measu; ‘| 
by 20 of the scales taken in their extreme are t; ee 
cally assumed to be deviant when charactenstis of 


a high 
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the mother. Accordingly, higher scores for these 20 
scales indicate both greater agreement and a greater 
potential disruptive influence on the child’s develop- 
ment. The remaining three “rapport” scales (Nos. 1, 
14, 21 on Table 1) were included by the test con- 
structors so that normals would have a greater op- 
portunity to agree with statements on the question- 
naire. Scores on these three scales are interpreted in 
opposite fashion so that higher scores indicate more 
adjustive attitudes. 


Procedure 


The schizophrenic daughters were individually ad- 
ministered the PARI under special instructions to 
“Answer it as you think your mother would.” The 
administration was conducted by some member of 
the state hospital staff who was previously known 
to the patient. The staff member was requested to 
be sure the nature of the task was initially clear to 
the patient and to restate the directions periodically 
if it seemed necessary. The 26 schizophrenics included 
in this study represent about 29% of the 90 female 
patients with appropriate diagnoses admitted during 
the data collection period. 

The 10 normal daughters obtained from general 
medical wards were individually administered the 
PARI by a psychologist under the same conditions 
as the schizophrenics. The remaining 17 normal 
daughters were contacted by mail and were strongly 
urged to complete the task without prior discussion 
with their mothers concerning it. The names and 
addresses of these 17 daughters were obtained from 
their mothers who were initially tested. As far as it 
is known, the daughters were not acquainted with 
one another. The mothers, in turn, were obtained 
as Ss from the Iowa City Employment Agency 
and local civic groups. The total sample of 27 daugh- 
ters represents about 42% of the 65 requested to 
participate in the study. 

The printed task instructions were identical for 
both groups, whether seen personally or contacte 
by letter. Further, the impersonal research nature of 
the task was stressed for all daughter Ss to decrease 
test response manipulation associated with public 
description of their mothers. 

Scoring of the predictions proceeded just as though 
the mother had completed the PARI under standard 
conditions with a score of 4 assigned to a “strongly 
agree” prediction for a statement, 3 to a “mildly 
agree,” 2 to a “mildly disagree,” and 1 to “strongly 
disagree.” These statement scores were then summe 
over the five statements within each scale. The scale 
prediction scores on the PARI define perceived ma- 
ternal attitudes in this study. 


RESULTS 


The data presented in Table 1 were €% 
amined initially by a Type I analysis of vat” 
ance (Lindquist, 1953) in which the mai” 
effect for diagnosis was a between-S effect 
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TABLE 1 


COMPARISONS OF MEAN MOTHERS’ ATTITUDE SCALE SCORES As PREDICTED 
BY SCHIZOPHRENIC AND NORMAL DAUGHTERS 


Schizophrenic Normal 
Primary (N = 26) (N = 27) 
Factor z 
Scale Number and Description Loading? Mean SD Mean SD t value 
2 Fostering Dependency A 14.10 4.00 11.25 2.85** 
3 Seclusion of Mother A 14.55 3.05 12.45 Sant 
4 Breaking the Will A 13.60 3.10 11.25 2-76** 
5 Martyrdom A 13.75 3.85 10.80 Tare 
6 Fear of Harming Baby A 14.80 3.40 14.00 “S 
10 Excluding Outside Influences A 13.00 2.05 11.45 iy 
11 Deification A 0 1535 3300 140 ith 
12 Suppression of Aggression A 13.50 2.75 11.15 oes 
15 Approval of Activity A 14.65 2.65 13.60 a 
16 Avoidance of Communication A 12.90 3.85 $ eu 
17 Inconsiderateness of Husband A 13.85 3.55 119 
18 Suppression of Sex A 13.25 3.50 274 
19 Ascendance of Mother A 14.85 3.50 3.50% 
20 Intrusiveness A 12.85 3.65 237" 
22 Acceleration of Development A 15.35 2.60 3.05** 
23 Dependency of Mother à 14.20 2.60 1.59 
7 Marital Conflict B 14.80 2.75 1.25 
9 Irritability B 13.25 2.30 2.07* 
13 Rejecting Homemaking Role B 12.00 2.80 42 
1 Encouraging Verbalization Cc 16.55 2.30 229 
14 Equalitarianism C 16.50 2.60 AS 
21 Comradeship and Sharing C 17.85 1.70 1.11 
8 14.45 3.45 78 


Strictness 


‘rom Zuckerman, Ribback, Mona 


nt beyond ,0S level. 
evel. 


gni 
+ Significant beyond O1 l 


and the main effect for scales was based upon 
within-S comparisons. The F value for the 
diagnosis X scales interaction was significant 


beyond the .001 level of confidence (mean 
1.09; mean square 


square for interaction = 

within wreatinents = 263 22 = 80 for 22, 

1122 df). Since the diagnosis X scales inter- 

action effect was significant, no evaluation of 

the main effects of diagnosis oF scales a 
n the 


made. Rather, interest was focused upo 
n each of the 


Simple effects of diagnosis on € 
individual scales (Lindquist, 1953, p- 211). 
risons of predicted 


Table 1 presents the compa 
attitude cia for the individual PARI scales. 
Inspection of this table shows that in 13 of 
the 23 scale comparisons, the schizophrenics 
Prediction scores differed from those of the 
Normals. Of the 13 comparisons providing 


Significant differences, 1? show the ao 
Phrenic daughters ascribing attitudes to t od 

: t 
Mothers which are more contrary to usually 


ashkin, & Norton, 1958. 


approved child rearing opinions than are those 
ascribed to their mothers by normals. The 
exception was the Irritability Scale where nor- 
mal daughters predicted that their mothers 
would score higher than did the schizophrenics. 


Discussion 


When schizophrenic daughters were asked 
to predict how they thought their mothers 
would perform on a survey of child rearing 
attitudes and these predictions were com- 
pared with those provided by normal daugh- 
ters, marked differences in these perceptions 
were obtained. The major trend was toward 
the schizophrenic’s perception of her mother 
as endorsing less approved (and presumably 
more pathogenic) child rearing attitudes when 
compared to the normal’s perception of her 
mother. This trend was observed for 12 of 
the 23 PARI attitude scales. Normal daugh- 
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ters predicted that their mothers would en- 
dorse a more pathogenic attitude on only one 
scale and on the ten remaining scales no sig- 
nificant differences were obtained. 

These findings lend support to the notion 
that the schizophrenic daughter attributes 
more pathogenic child rearing attitudes to 
her mother than does the normal daughter. It 
may be recalled that the mothers of these 
daughter groups did not differ significantly 
in their expressed child rearing attitudes on 
the same attitude scale used by the daugh- 
ters, a finding which is consistent with the 
study of Zuckerman et al. (1958a). However, 
since this study is concerned with perceptual 
differences between schizophrenics and nor- 
mals and not with accuracy of perception, this 
discrepancy between perception and endorse- 
ment is not of immediate concern. 

Factor analytic studies with the PARI 
(Schaefer & Bell, 1955; Zuckerman, Ribback, 
Monashkin, & Norton, 1958b) have found 
three stable factors which reportedly account 
for most of estimated communality variance 
on this instrument. Factor A has been de- 
fined as an “authoritarian-control” cluster; 
Factor B, “hostility-rejection”; and Factor C, 
“democratic attitudes.” An analysis of the 
findings in the present study shows that 12 
of the 16 scales with highest loadings on Fac- 
tor A (see Table 1) were characterized by a 
significantly greater perceived pathology in 
maternal attitudes on the part of schizophren- 
ics. On the other hand, Factor B included 
scales where schizophrenic-normal compari- 
sons produced a far different picture. On one 
of the scales (9), normals predicted a reliably 
higher score for their mothers; the two others 
(7, 13) produced no differences between the 
diagnostic groups. Factor C included three 
scales none of which produced a difference 
between diagnostic groups. These findings 
suggest that it is in the “authoritarian-con- 
trol” aspects of mother-child relationships 
where schizophrenics perceive their mothers 
as deviating more from accepted child rear- 
ing practices than do normals. Further the 
“hostility-rejection” dimension either does not 
distinguish schizophrenics’ and normals’ per- 
ceptions of their mothers or normals perceive 
their mothers as deviating more from stand- 
ard practices in this regard. Finally, there is 
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no indication of a difference between percep- 
tions of these diagnostic groups with regard 
to “democratic attitudes” of the mothers. 

Thus when the child rearing attitude pat- 
terns of the mothers of schizophrenic and 
normal daughters are examined through the 
daughters’ perception of them and these per- 
ceptions are evaluated in rather molar con- 
ceptual units, it begins to appear that a ma- 
jor source of interpersonal difficulty in the 
mother-schizophrenic daughter relationship 
evolves from the perceived “authoritarian- 
control” tendencies of the mother. 

It might be contended that the findings of 
the present study are not appropriate evi- 
dence regarding the etiological relationship 
between faulty mother-child relationships 
and the development of schizophrenia. The 
argument would be that the perceived child 
rearing attitudes of the adult schizophrenic 
are retrospective in nature and might vary 
markedly from those held as a child (or at 
least prior to the onset of schizophrenia). It 
is true that what appear to be systematic 
perceptual differences between schizophrenics 
and normals in this investigation can be 
speculatively related to schizophrenia in a 
number of ways of which causal significance 
is but one. In any case, definitive evidence is 
not to be found in the present study but 
rather from studies specifically designed to 
critically evaluate various alternative hy- 
potheses. 

It is also. recognized that the results of 
the present study could be accounted for by 
differential test-taking habits of the schizo- 
phrenic and normal daughters rather than 
differential perceptual habits as previously 
discussed. For example, if these groups dif- 
fered substantially along a social desirability 
response set continuum, one possibility would 
be that normals would attempt to depict their 
mothers in a more favorable light than the 
schizophrenics. Under such a condition, the 
direction of between-group differences on the 
PARI would undoubtedly be consistent with 
those found in this study. However, there are 
three reasons why accounting for the results 
in terms of differential social desirability re- 
sponse set tendencies appears less parsimoni- 


ous than doing so in terms of perceptual dif- 
ferences: 


~ Vidual’s conception of her mother. 


og O 


Schizophrenics 


rch approach was 
n any given indi- 
This might be 
expected to lessen the likelihood that desirability of 
response would play a major role within either 
daughter group. Reh 

2. Previous studies comparing psychiatric and 
normal Ss (Gough, 1950; Klett, 1957; McKinley, 
Hathaway, & Meehl, 1948) indicate that these groups 
do not show differential characteristics with regard 
to desirability of test responses. 

3. It would be difficult to 


systematic differences in predicte 
within the “quthoritarian-control” cluster by presum- 


ing differential social desirability response sets. It 
would seem necessary to introduce a cumbersome 
triple interaction between social desirability, attitude 
cluster, and diagnosis to explain why 75% of the 
scales included in one cluster provided significant 
unidirectional differences between schizophrenics and 
normals because of the social desirability factor while 
le outside of the cluster even approached 


é 1. The more impersonal resea 
stressed rather than an interest i 


account for finding 
d attitudes solely 


not one sca! 
that result. 


SUMMARY 


This study compared the perceptions of 
their mothers’ child rearing attitudes on the 
part of schizophrenic and normal daughters. 
The two daughter groups, matched for age, 
education, and partially for hospitalization, 
were asked to predict how their mothers 
would respond to items on the Parent Atti- 
tude Research Instrument and these predic- 
tions operationally defined perceived maternal 
attitudes. The mothers of the schizophrenics 
and normals independently completed the 
PARI, and there were no differences in en- 
dorsed child rearing attitudes obtained be- 
tween these mother groups. 

Analysis of the predicted scale scores re- 
vealed an overall tendency for schizophrenic 


daughters to predict attitude scores for their 
variant child 


mothers which reflected more i c 
rearing practices than did the predicted atti- 
tude scores for the normals. These perceived 
attitude scale scores were further analyzed in 
terms of three primary factors obtained in 
Previous factor analytic studies. This analy- 
sis provided rather convincing evidence fe 
schizophrenics, more than normals, tr 


perceive attitudes relating to the EE 
tarian-control” dimension as more abno! 
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in their mothers. However, perceptions of atti- 
tudes relating to “‘hostility-rejection” or to 
‘democratic attitudes” provided no substan- 
tial basis for differentiation of these two 
classes of mothers. 
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Research and test construction procedures 
designed to improve personality inventories 
can be most fruitful if they are guided by a 
clear understanding of the rationale of per- 
sonality inventories. However, disagreements 
concerning the beneficial or deleterious influ- 
ence of personality inventory item ambigu- 
ity (Allport, 1937; Anastasi, 1954; Benton, 
1935; Eisenberg, 1941; Elias, 1951; Freeman, 
1955; Hutt, 1945; Landis & Katz, 1934; 
Meehl, 1945; Rotter, 1954; Vernon, 1953) 
indicate that, at least with regard to item 
ambiguity, this clarification of rationale has 
not been fully achieved. This paper is an at- 
tempt to arrive at some further clarification 
of personality inventory rationale through a 
discussion of item ambiguity and its relation- 
ships to discriminating power. The research 
and test construction implications of these re- 
lationships will also be discussed, 


Two TYPES or AMBIGUITY 


The classical position Concerning the rela- 
tion of item discriminating power and am- 
biguity is given by Allport in his listing of 
freedom from ambiguity as a requisite to the 
use of personality inventories. He states this 
position as follows: “The stimulus-situation is 
assumed to be identical for each subject, and 
his response is assumed to have a constant 
significance” (Allport, 1937, p. 449). As can 
be seen from the two halves of Allport’s state- 
ment, item ‘ambiguity can be divided into two 
types: variation in the subject’s interpreta- 
tion of what is meant by an item and varia- 
tion in the extent to which different types of 
persons give similar responses to an item 
These two kinds of ambiguity will be referred 
to as “interpretive ambiguity” and “res Ons 
ambiguity,” respectively, aie 


Interpretive Ambiguity 


The most widely held point of view con- 
cerning interpretive ambiguity is that freedom 
from interpretive ambiguity is necessary in or- 
der to achieve adequate discriminating power- 
This point of view was first reflected in the 
theoretical arguments of Allport (1937) and 
in the writings of earlier researchers who 
found interpretive ambiguity in inventory 
items and criticized personality inventories 
because of this ambiguity (Benton, 1935; 
Eisenberg 1941; Landis & Katz, 1934). The 
assumption that personality inventories must 
be free of interpretive ambiguity is also stated 
by Hutt (1945), and is seen in Taylor’s 
(1953, p. 289) comments on the development 
of the Manifest Anxiety Scale where, with- 
out study of item discriminating power, the 
majority of the test items were rewritten “so 
as to minimize confusion and misinterpreta- 
tion.” This assumption is most recently re- 
flected in current texts which list interpretive 
ambiguity as one of the primary problems of 
personality inventories (Anastasi, 1955, P- 
558; Freeman, 1955, p. 499; Rotter, 1954, 
p. 257). Among the recent writers, this p0- 
sition is stressed most by Rotter (1954, P: 
257), who lists similarity of subject inter- 
pretation of questionnaire items as one of the 
three basic assumptions of personality inven- 
tories. 

Meehl (1945) was among the first to ques- 
tion the necessity for interpretively unam- 
biguous personality inventory items. He takes 
the position that discriminating power is the 
prime requirement for inventory statements, 
and if discrimination is obtained through em 
pirical item selection, the question of whether 
or not the discriminating items are interpre- 
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tively ambiguous is essentially an irrelevant 
question. 

Vernon (1953) also takes the point of view 
that the question of item ambiguity is not it- 
self a serious question for personality inven- 
tories. However, his rationale is different from 
Meehl’s. Vernon (1953, p. 140) sees the basic 
influence of item ambiguity as harmful but 
feels that this negative influence is in some 
way ruled out by the cancelling effects of vari- 
ous interpretive choices over a series of test 
items. This cancelling effect is based on an 
assumption that the interpretive choices are 
randomly distributed among the groups being 
discriminated. 

Elias (1951) goes even further and takes 
the point of view that interpretive ambiguity 
is useful and needed for maximal discrimina- 
tion. His point of view is that the different 
individual interpretations of inventory state- 
ments may themselves reflect relevant per- 
sonality differences and are thus sources of 
potentially discriminating variance. “An un- 
structured question permits greater leeway for 
personality projection; therefore, it should re- 
flect personality tendencies more sensitively 
than more structured items” (p. 497). 

If the primary task of personality inven- 
tories is to classify persons into elevan} Bo: 

n Meehl and Elias are cor- 
mogeneous groups, ^te tof inter 
rect in pointing out that amoun © erat 
pretive ambiguity is a relevant gons m 
only to the extent that interpretive ambiguity 


iscriminati ower. Criticisms 
a i scriminating p i 
ety for interpretive am- 


of personality hearse the demonstration 
Stake o : 
biguity must then res between item inter- 
of a negative correlation bel inating power. 
Dretive ambiguity ae ee this point is 
“+t eviden - 
Partial research ‘ho found no cor 


given by Benton par), Fe ambiguity and 
relation between ay e data would seem 


AOR eS ay araa 
er meee T of those cateta E 
inventories because of i 
pPenton’s data a 
interpretive differences 

abnormal” subjects, Pi at different jnter- 


Regate Elias’ position f polity 
¿ to perso 
Eclat : relevant i 
Pretations yield ae the sample of items 
ver, o ques- 


Used ; re 
Sed in Benton’s ore study W 


‘ion by Eisenberg’s ( 


terpretive variations were found to be dis- 
criminating. Thus, while suggesting rather 
clearly that it is not necessary to assume free- 
dom from interpretive ambiguity in order to 
obtain discrimination, the available research 
evidence does not go much further in clarify- 
ing the relations between discriminating power 
and interpretive ambiguity. The presentation 
of a simplified ambiguity model may help in 
clarifying some of the relations involved. 

An ambiguity model. Personality inventory 
item ambiguity can be most simply repre- 
sented by a situation in which an inventory 
item (S) may lead to either of two mediat- 
ing interpretations (r(s);, r(s)2), both of 
which may then be responded to in terms of 
either Yes (R,) or No (Re). As an example 
the various probabilities of mediating inter- 
pretations and responses for two equal WV 
groups (A and B) being differentiated by an 
item can be represented as is illustrated below. 


SA, 5B 
6A, 5B Ri 
Re 
s+ 5A, .5B 
AA, SB Ri 
r(s)— 5A, 5B 
L Re 


The above example assumes that 60% of 
Group A and 50% of Group B make the first 
mediating interpretation, and for the As mak- 
ing this interpretation 80% respond Yes (R,). 
Multiplying interpretation probabilities by 
response probabilities we find that in this ex- 
ample 68% of the As and 50% of the Bs 
answer Yes to the item, and 32% of the As 
and 50% of the Bs respond No. Thus, if all 
who respond Yes are labeled as A, the state- 
ment correctly identifies 59%. 

Trying out various combinations of inter- 
pretation and interpretation response proba- 
pilities, and studying the resultant discrimi- 
nating power changes, indicate that the fol- 
lowing can be used as a generalized formula 
for the discriminating power of an item for 
the model presented above. This formula is 

ate for use with equal or unequal 


ropri 
i = (Meehl & Rosen, 1955) discrimina- 


tion problems. 
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Pre) 
| (5B+.5) —L(PP:+QQ1) +B (pp2t+aqe) 1| 
u ( (B+) ) 


D = proportion correctly identified 

B = NV for group B/N for Group A 

P = Group A probability of r(s); 

p = Group B probability of r(s): 

O=1-—P 

g=i->p 
P, = probability of Ri for the A group r(s): persons 
ps = probability of Re for the B group r(s); persons 
Qı = probability of R; for the A group r(s)2 persons 
q2 = probability of Rə for the B group r(s)2 persons 


Conclusions concerning interpretive ambigu- 
ity and discriminating power. The general- 
ized item discriminating power formula pre- 
sented above was used to systematically study 
the influence of base rate changes and changes 
in interpretation and response probabilities 
on item discriminating power. The primary 
conclusions from this study are discussed 
below. 

1. Interpretive ambiguity in itself does not 
place a limit on item discriminating power. 
If the mediating interpretations are each per- 
fectly discriminating, the item will be per- 
fectly discriminating regardless of the level 
of ambiguity. (If P, = Q1 = pe = ge = 1.0; 
D=1.0 for all values of P and p.) This 
means that freedom from interpretive am- 
biguity is not a necessary assumption of per- 
sonality inventories, and research aimed only 
at describing the extent of interpretive am- 
biguity in personality inventories is irrelevant 
to personality inventory utility. 

2. If the mediating interpretations are 
equal in discriminating power, and the major- 
ity response is the same for both interpreta- 
tions, changes in interpretive ambiguity have 
no effect on item discriminating power. (If 
Har Orand po\='go, D is constant for all 
values of P and b.) However, in the special 
case where neither interpretation is highly 
discriminating and where the majority re- 
sponse is different for the two interpretations 

(eg, Pr=1; p2=0; Q;=0; g2=1) an 
increase in item ambiguity which leads to 
different interpretation Probabilities for the 
two groups will lead to increased discriminat- 
ing power. 


3. When the interpretations differ in dis- 
crimination power, but the discrimination di- 
rection is the same for both interpretations 
(the direction of the relations between P; and 
(1 — pz) and Q; and (1 — q2) are the same), 
then: 

(a) Increasing item clarity by increasing 
the probability of the most discriminating in- 
terpretation (for the group most discrimi- 
nated by that interpretation) yields a linear 
increase in item discriminating power. Table 1 
interpretation probability values increasing 
from .5 through 1.0 give examples of these 
increases in item discriminating power for 
several levels of interpretation discriminating 
power. 

(b) Increasing item clarity by decreasing 
the probability of the most discriminating in- 
terpretation (for the group most discriminated 
by that interpretation) yields a linear de- 
crease in discriminating power. Table 1 in- 
terpretation probability values decreasing 
from .5 through .0 provide examples. 

These computations suggest an explanation 
of Elias’ data showing ambiguous items as 
more discriminating than unambiguous items 
(1951). It is hypothesized that the superi- 
ority of Elias’ ambiguous items was a func- 
tion of the increased probability of a more 
discriminating interpretation rather than a re- 
sult of the ambiguity itself. 

4. The discriminating power of the most 
differentiating interpretation is the maximum 


TABLE 1 


CHANGE IN Item DISCRIMINATING POWER FOR VARIOUS 
LEVELS OF INTERPRETATION DISCRIMINATING POWER 
AND INTERPRETATION PROBABILITY 


(p = pp = @ = 5; B = 1) 


Item Discriminating Power* 


Interpretation 

Probability (P) Pi=.6 P,=.8 Py = 10 
0 50 50 50 
2 5i 53 55 
4 52 56 60 
5 525 575 625 
6 53 a 65 
8 54 62 -70 
1.0 55 .65 Ws 

alf Pi = .5, 


values of Pp,” teM discriminating power will equal .5 for all 


= ne eee CO 
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potential discriminating power of an item. 
This maximum discrimination level is ap- 
proached as item wording changes increase 
the probability of this most discriminating 
interpretation. This conclusion suggests a 
change in the usual inventory construction 
procedure of empirical selection of statements 
from an a priori item pool. Increasing dis- 
criminating power would be possible by em- 


pirical selection among the various statement 


wordings suggested by interpretations re- 
ported by respondents. (The exception to this 


general rule is the special case under 2 above.) 

5. When mediating interpretations differ in 
discrimination direction, changes in the prob- 
ability of an interpretation can cause a shift 
in the discrimination direction of the item. 
For example, consider the MMPI item “I 
know who is responsible for most of my trou- 
bles.” This statement has two common inter- 
pretations. One is to include one’s self as a 
causal factor. The other is to interpret the 
item as referring only to others as causing 
one’s troubles. Let us assume, as seems rea- 
sonable, that the first interpretation (includ- 
ing one’s self) yields a greater percentage of 
Yes responses in normals than among para- 
noids, (Assuming Py 20, Pee 2.) Because 
of the paranoid’s greater tendency to project 
causation, it is likely that the second inter- 
pretation (excluding one’s self) yields a 


; f Yes responses among 
greater percentage 0 ee nee 5) Neate 


paranoids. (Assume v1 MMP. 
respondent reports suggest that as the tea is 
now worded, the pui 

? al factor has the highe 
one’s self as a caus ~ i 


p “ye P 1al base rate P! 
probability. In an rem Tense classed as 


a Yes respondent $ $ : ice 
abnormal. (This is what is done 1 practice. 
A Yes respondent to i 
ditional point on the Pa scale. 
Clinician is distu ee 
terpretations reporte eee ing 
“clarifies” the item by changint ete 
to “T am responsible for my, f E variable 
cause of a belief that smsighe g to get at, 
the test constructor 22 


the discrimination 
shift. After this wording 

Ents should be Jabeled aS not! 
tion of discrimination powe h 
interpretive ambiguity fore 


interpretation of 


this item is 8 ; 
However, if a 


YES RESPONSE LABELED "8" 
——-— YES RESPONSE LABELED "A" 


2:3 4 5 6 7 STIRO 
VALUES OF P AND p 


Fic. 1. Shift in discrimination directi 

S scrin rection occurrin; 
asa function of ambiguity lessening changes in ie 
wording. (B=1, Pi=.5, Q:=.8, po=.2, g=.5; 
P and p values of 1.0 indicate an unambiguous item.) 


PER C 
ENT CORRECTLY IDENTIFIED 
a 
oO 


values used in this example is illustrated in 
Fig. 1. 

School and other institutional administrators 
occasionally suggest item wording changes be- 
fore granting approval to use subjects for re- 
search projects. Although the changes sug- 
gested usually seem minor, the shifts in dis- 
crimination direction, which can be caused 
even by “clarifying” wording changes, make 
such changes a very dubious research pro- 
cedure. , 

Taylor’s MAS wording changes (1953) are 
another example of clarifying items without 
adequate study of resultant shifts in item 
discriminating power and direction. Taylor 
tested the comparability of the old and re- 
vised MAS forms by studying their correla- 
tion in a college population. Taylor states the 
rationale for the use of this population as 


follows: 


A sample was selected from the colle y 
for this purpose since it was thought wet peli 
would show the least confusion in interpretin, fie 
original versions of the difficult items and haki e 
better demonstrate the comparability oF cee oie, 
forms than less verbally sophisticated individ ae 
(Taylor, 1953, p. 288). uals 


Wording changes designed to elicit a 

di tesigi greater 
probability of the high intelligence group in- 
terpretation of the original items (judges se- 
lected these revised items for nearness to their 
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interpretation of the meaning of the original 
item) were thus tested for comparability to the 
older items on a group explicitly selected for a 
high probability of the same interpretation of 
both item forms. Such a method certainly does 
not provide an adequate test of the compara- 
bility of original and revised test forms if the 
test is to be used in more general popula- 
tion samples. This item revision and revision 
testing method has not influenced research 
conclusions based on the MAS because the 
great bulk of research has been done using 
only the revised form. However, the same type 
of revision and revision testing done on a per- 
sonality inventory with an interpretive base 
grounded on research with the initial form, 
would be likely to lead to incorrect conclu- 
sions if the old form interpretations were car- 
ried over to the revised form. These comments 
apply to demonstrations of the comparability 
between any two test forms. Whenever there 
ts a possibility of interpretive ambiguity, and 
imterpretation choices are not assumed to be 
random, the comparability of two forms of 
the same test must be demonstrated on a 
population with relevant base rates compa- 
rable to the base rates in the population in 
which the tests are to be used. 

6. Use of personality inventories in popu- 
lations with different base rates can change 
item discrimination direction and power. For 
example, at P and p levels of .5 for the item 


used in Fig. 1, 55% correct identification can 
be obtained with a 3 A group base rate by 
labeling Yes respondents as A. However, with 


a 3 B group base rate, Yes respondents should 
be tabeled as B, and with a .5 base rate for 
each group, the item is nondiscriminating. 

ese considerations are especially important 
because of the almost invariable base rate 
changes between test development populations 
and populations in which the test is used 
(Broen, 1957). Also, unless the inter preta- 
tion probabilities (P, p) and interpretation 
response probabilities (Pi, po, Qu q2) are 
known for each 8roup, the effects of known 
base rate changes on item discrimination di- 
rection and power cannot be computed. Three 
alternatives are then Open to test construc- 
tors and validators: (a) accept introspective 
reports of item interpretations as valid self- 
reports and compute interpretation and in- 
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terpretation response probabilities from these 
introspective reports; (b) study item dis- 
criminating power in populations with rele- 
vant base rates comparable to those in the 
population in which the test is to be used; 
(c) use unambiguous (single interpretation) 
items. The interpretation response probabili- 
ties for the group to be discriminated can, of 
course, be easily computed for unambiguous 
items. The utility of using respondent reports 
as ambiguity measures (in c) and interpreta- 
tion discriminators (in a) need not be only 
an assumption. The utility of the respondent 
reports can be demonstrated by research deal- 
ing with the accuracy of discriminating power 
predictions using the probability values ob- 
tained through the respondent reports for 
populations with various base rates. Such 
research would also yield data relevant to 
Vernon’s (1953, p. 140) assumption that 
the interpretations are distributed randomly 
throughout various diagnostic classifications, 
The greater ease of (c) over (a) and the 
greater applicability to all populations with 
known base rates of (c) over (b) may be the 
strongest argument for freedom from ambigu- 
ity as a criterion in item selection, 


Response Ambiguity 


The second type of item ambiguity—re- 
sponse ambiguity—has been defined above as 
the extent to which groups to be discriminated 
give similar responses and is, therefore, the 
inverse of item discriminating power. Because 
of this direct relation to item validity, 
ality inventory criticisms based on response 
ambiguity (Allport, 1937; Anastasi, 1954; 
Eisenberg, 1941; Freeman, 1955; Rotter, 
1954) would seem to be more directly rele- 
vant to personality inventory utility than 
criticisms based on interpretive ambiguity. 
However, methods for combining items of 
relatively low discriminating DOET to make 
highly discriminating scales are available. 
These methods involve using item configura- 
tions (Meehl, 1950); using nondiscriminating 
items as suppressors of irrelevant variance 
(Horst, 1941); or combining items which 


have Somewhat higher correlations with the 
criterion than with each other. An example 


of the latter type of scale is provided by the 


person- 
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MMPI Hy scale which combines two rela- 
tively uncorrelated sets of items, each of 
which provides increased discrimination to 
the total scale by differentiating on separate 
dimensions (McKinley & Hathaway, 1944). 
These methods which enable the use of items 
of low criterion discriminating power to con- 
strict highly discriminating scales indicate 
that, like freedom from interpretive ambigu- 
ity, freedom from response ambiguity is not 
a necessary assumption for personality inven- 
tories. 
SUMMARY 


An ambiguity probability model was intro- 
duced to clarify the relation of personality in- 
ventory interpretive ambiguity to discriminat- 
ing power. Item discriminating power was 
found to be a function of interpretation prob- 
ability, interpretation discriminating power, 
and population base rates. Research and test 
construction implications of these findings 
were discussed, and several methods of using 
relatively nondiscriminating test items to in- 
crease scale discriminating power were cited. 
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SEMANTIC STRUCTURES 
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The consistently poor showing of clinicians 
in large scale global assessment studies (Kelly 
& Fiske, 1951; Holtzman & Sells, 1954) has 
led to increased concern over the bias intro- 
duced by constant processes in the assessor, 
his reliance upon the connotative meaning of 
key terms and his idiosyncratic use of con- 
cepts. 

In his analysis of certain judgmental vari- 
ables, Cronbach (1955) has shown that if it 
is a judge’s expectation that certain traits 
covary and others not, he will customarily 
report subjects to be high (or low) on all of 
a set of traits. Such a tendency, suggested to 
reflect an assessor’s implicit personality theo- 
ries, has been demonstrated in different types 
of judges in diverse settings (Jones, 1954; 
Soskin, 1954; Steiner, 1954). The psycholo- 
gist, psychiatrist, or social worker who first 
observes, tests, or interviews and then ap- 
plies labels or makes decisions is especially 
deserving of such attention to his semantic 
space since many of the concepts he uses are 
characterized by the looseness and ambiguity 
of their definitions (Grayson & Tolman, 1950) 
and their lack of integration in a testable 
nomological network (Feigl, 1949). More- 
over, since the multidisciplinary “team ap- 
proach” to the patient has found wide accept- 
ance, the lack of semantic clarity and pre- 
cision has been aggravated by problems of 
interdisciplinary communication. By virtue of 


‘This paper, presented at the meeting of 
Southwestern Psychological Sushi ay To ni 
1959, represents a portion of a dissertation sae 
mitted to the University of Minnesota in partial f 1- 
fillment of the requirements for the degree of Do i i 
of Philosophy. The author is greatly indebted t his 
advisor, William Schofield, and to others who sie 
sistance aided in the completion of this ade G 


their different training and interests, and be- 
cause of the partly distinct role they assume, 
it would seem reasonable to expect psychia- 
trists, social workers, and psychologists to 
base their inferences on distinct types of im- 
plicit personality theories. 

In this study, the clinician’s semantic struc- 
ture is used to reflect that system of inter- 
relationship of meanings that forms one im- 
portant aspect of his implicit personality 
theories. This model, based on Osgood and 
Suci’s D measure (1952), represents similari- 
ties in connotative meaning between concepts 
as discriminated by an appropriate form of 
the semantic differential. As applied to groups, 
this technique has excellent reliability (Jen- 
kins, Russell, & Suci, 1957) and the model 
which is based upon it can be described as 
having good construct validity (Lazowick, 
1955; Mowrer, 1953; Osgood, 1952). 

This investigation will attempt to deal 
separately with interstructure differences (be- 
tween disciplines) and intrastructure differ- 
ences (among concepts). Two questions will 
be asked: 1. How similar are the implicit per- 
sonality theories of psychiatrists, psycholo- 
gists, and social workers, and with respect to 
what dimensions do they differ? 2. Does an 
intensive examination of such a network of 
associations suggest the existence of meaning- 
ful covariations or clusterings of concepts? 


METHOD 
Subjects 


The subjects (Ss) were 10 psychiatrists, 10 psychi- 
atric social workers, 10 clinical psychologist trainees, 
10 PhD psychologists. The first three groups were 
chosen from among advanced graduate students oF 
psychiatric residents who were near achieving that 
arbitrary level of training and experience which thei! 
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TABLE 1 


Concepts AND SCALES USED IN THIS ANALYSIS 


. Concepts Scal 
N (listed in the order of their frequency) (listed ye ae) 
E Diagnostic Therapeutic Evaluation Potency Activity 
Anxiety Reaction EST (Electroshock -Ba i 
T hor Good-Bad Hard-Soft Active-Passive 
Depressive Reaction Psychotherapy Kind-Cruel Masculine- Excitable-Calm 
r Feminine 
Schizoid Personality Supportive Successful- Si Ji 
trong-Weak Fast-Sl 
Psychotherapy Unsuccessful Dae 


Schizophrenic Reaction Tranquilizing Drugs 


Schizophrenic Reaction, Reassurance 
Paranoid Type 


Somatization Reaction 


Hysteria Commitment 

Psychopathic Personality Group Psychotherapy 

Paranoid Personality Directive 
Psychotherapy 


Obsessive-Compulsive 
Reaction 


Passive-Dependent 
Personality 


Uncovering Psychotherapy 


ines demand of them before they 
status. The fourth 


al psychologists who 


respective discipl: 
are accorded full professional 
group, composed of senior clinic: 
held the PhD degree for an average of 5.80 years, 
was added in order to further test the sensitivity of 
the techniques employed and to increase the number 
of psychologists to a more stable N for the investi- 
gation of that group’s semantic structure. 


Concepts 

The terms used in this study represent a sample 
from an empirically derived pool of 95 terms based 
on an examination of 80 medical folders from four 
Minneapolis Psychiatric In- and Out-Patient Serv- 
ices, Of the 31 terms or phrases relating to therapy 
which were thus recorded, 9 were selected from 
among the 13 most frequently used ones, by discard- 
ing 4 somewhat ambiguous (eg Active Psycho- 
therapy) or less widely used concepts (eg. Regres- 
sive EST). The 64 diagnostic terms 1m this pool 
were likewise ranked in the order of their total fre- 
quency of occurrence. The 10 most frequently used 


diagnostic terms were included in this study, to- 
8ether with Passive-Dependent Personality which was 
Used in spite of its lower ause 


frequency count bec: 
of its contrast to the other tern 


s. The concepts and 
Scales used in this investigation are reported in 
able 1, 


Scales 


ailable from the various fac- 
been found to be representa- 
and it has been suggested 
that the factor loadings of the scales vary with the 
concepts being rated (Osgood, 1952). Since the Ss 
were busy professionals, it was decided to restrict 
the number of scales to nine to be chosen from the 
pool of 20 scales that had been set up for the Min- 
nesota Atlas study (Jenkins, Russell, & Suci, 1957) 
and which are believed to be representative of the 
major dimensions of meaning. A pilot study deter- 
mined which of these scales best differentiated this 
type of concept from one another. The 9 scales 
chosen for this study were, with one exception, the 
scales having the largest variance. Although Colorful- 
Colorless was a more widely used scale than Mascu- 
line-Feminine, it was decided to include the latter 
instead in order to give the three major factors, 
evaluation, potency, and activity, equal representa- 
tion in terms of number of scales per factor. 


Of the many scales av 
torial studies, none have 
tive of one factor only, 


Procedure 

After the order of the concepts and of the scales 
had been randomized, and the poles of five of the 
nine scales reversed so that scales with the same fac- 
tor loadings were systematically counterbalanced, the 
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40 Ss were instructed to rate each concept on the 
nine scales using the entire continuum in reporting 
their first impression of each term. 


RESULTS 
Intergroup Differences 


Each group’s semantic structure was deter- 
mined as follows. Each group’s mean scale 
profile for each concept was determined and 
as a measure of the similarity in connotation 
of meaning of any pair of concepts, Osgood 
and Suci’s D was obtained. D is defined as 
the square root of the sum of the square dif- 
ferences between scores on each of the scales 
for any two concepts. Each group’s semantic 
structure is represented by the matrix of Ds 
reflecting the distances between the concepts 
used. Since the use of Pearson product-mo- 
ment correlations to gauge comparative simi- 
larity in semantic structures has been shown 
to be a sensitive measure by Osgood and 
Luria (1954) the correlations between each 
group’s set of D values and that of every 
other group were obtained and are reported 
in Table 2. As can be seen, none of the inter- 
correlations between group semantic struc- 
tures falls below .60. All correlations are sig- 
nificantly different from zero (p < .001). 

In testing the significance of the differences 
between product moment correlations reported 
in Table 2, it was found that the semantic 
structures of the psychology trainees and 
the PhD psychologists are significantly closer 
than (a) the psychology trainees and the 
social workers (p < .01), (b) the psychology 
trainees and the psychiatrists (p < .03) and 
(c) the PhD psychologists and the social 
workers (p < 01). Moreover, the semantic 


TABLE 2 


I 
NTERCORRELATIONS OF Group D MATRICES 


PhD 
Psychol- Psychology Psychi- 
ogists Trainees atrists 
Psychology 
trainees 78 
Psychiatrists TA 69 
Social 
workers 1 -60 73 
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TABLE 3 


NUMBER OF Ds DIFFERENTIATING THE Four GROUPS , 


AT THE .05 LEVEL OF SIGNIFICANCE! 


PhD Psychol- 
Psychol- ogy Psychi- 
ogists Trainees atrists 
Psychology 
trainees 1(1,0) 
Psychiatrists 1(0,1) 1(0,1) 
Social 
workers 7(5,2) 10(5,5) 3(2,1) 


1 The number of Ds involving diagnosti i 
sea iin te parentese NEEE ECE a Ase 
structures of the PhD psychologists and of 
the psychiatrists are significantly closer to 
one another than the semantic structures of 
the PhD psychologists and the social workers 
(p < .02). 

In an effort to explicate the semantic dif- 
ferences between social workers and the other 
disciplines, the four groups were contrasted 
by means of Mann Whitney U Tests on each 
of the 91 Ds or “links” constituting their 
semantic structure. As can be seen from 
Table 3, 23 Ds significantly differentiated 
the four groups from one another at the .05 
level of significance. Twenty of these differ- 
ences are between social workers and the 
other groups; in contrast, the psychologists 
and psychiatrists differ significantly on only 
3 Ds. Twelve of the 20 Ds (60%) involve 
diagnostic categories while the remaining 8 
concern therapeutic terms. Since exactly 60% 
of the pool of 91 Ds relate to diagnostic 
terms, it can be seen that social workers 
differ from the other disciplines as much with 
regard to diagnostic concepts as they do with 
reference to therapeutic phrases. A U Test 
applied to the data of Table 3 indicates that 
the number of significant differences betwee? 
social workers and the other disciplines is it- 
self significantly higher than the number of 
such differences among the remaining three 
groups (p = .05). Table 4 reports specific dif- 
ferences between social workers on the one 
hand and psychologists and psychiatrists 0? 
the other. It should be noted that except 
for Obsessive-Compulsive/Depressive Reat- 
tion the social workers find these pairs of con- 
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TABLE 4 
PAIRS or Concerts DIFFERENTIATING THE GROUPS AT THE .05 LEVEL OF SIGNIFICANCE 


Social Workers vs. Other Three Groups* 


PhD Psychologists vs. Psychology 
Trainees vs. Psychiatrists 


Diagnostic Therapeutic 


Diagnostic Therapeutic 


Hysteria/Obsessive- Directive PRx>/ 
Compulsive Reaction 


Passive-Dependent Person- Group PRx/ 
ality/Paranoid Personality 
Obsessive-Compulsive reaction/  Uncov. PRx/ 
Psychopathic Personality 
Somatization Reaction/ 
Paranoid Personality 


Hysteria/Somatization 
Reaction 

Hysteria/Psychopathic 
Personality 

Somatization reaction/ 
Psychopathic Personality 

Obsessive-Compulsive reaction/ 
Depressive Reaction 


Tranquilizing Drugs 


Tranquilizing Drugs 
EST/Tranquilizing Drugs 


EST/Supportive PRx 


Reassurance/ 


Hysteria/ 
Supportive PRx 


Somatization reaction 
Directive PRx/EST 


Tranquilizing Drugs 


a Only 13 out of 20 significant differences are reported 


than one other group. 
b Psychotherapy. 


cepts to be closer in connotative meaning than 
do their colleagues. 


The Semantic Structure of Psychologists 


Since relatively minor differences were ex- 
pected between the PhD psychologists and 
the psychology trainees, these groups were 
combined in order to add more stability to 
the results. Because the statistical study of 
relational measures such as D is cumbersome 
and requires many separate tests of signifi- 
cance for even a small number of concepts, 
only the nine therapeutic terms were used in 
this phase of the investigation. 

The 36 Ds between all pairs of the nine 
for each of the 20 psy- 
mputed, resulting in 20 
of Ds each with an V 


of 20. Each concept can be visualized as a 
ertain distance away from each of 
and the significance of the 
nce between different pairs 


Iculated by using each 


oint of reference. In 
“Ts 


therapeutic terms 
chologists were Co 
matrices or 36 sets 


point a c 
the other terms 
difference in dista 
of concepts can be Ca 


concept in turn as @ pe e 
order to answer the kind of question, 


1 here since $ of the Ds listed differentiated the social workers from more 


supportive therapy closer to reassurance than 
it is to directive psychotherapy,” it was neces- 
sary to compute 252 Mann Whitney U Tests 
in all. Of these, 83 were significant at the .05 
level, 65 being significant at the .02 level. 
The presence of such a large number of sig- 
nificant differences in the distance between 
therapeutic concepts suggests the existence of 
clusters of concepts. Examination of the sta- 


@ Psychotherapy 


Uncovering of 
Psychotherapy AS 


Psychotheropy 


<8 Commitment 


EST @ 
` 
A ae © Reassurance 
p 
Directive Supportive z 
Psychotherapy Psychotherapy — Tronquilizing 
Drugs 


Fic. 1. A two dimensional representation of clusters 
of therapeutic terms. 
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tistically significant Ds between the nine 
therapeutic terms indicates the existence of 
three fairly distinct clusters: (a) EST, Direc- 
tive Psychotherapy, and Commitment; (b) 
Uncovering Psychotherapy, Psychotherapy, 
and Group Psychotherapy; (c) Reassurance, 
Supportive Psychotherapy, and Tranquilizing 
Drugs. Figure 1 is a two-dimensional visuali- 
zation of the interrelationships based pri- 
marily on topological relations as defined by 
the tests of significance between Ds. 


Discussion 


It is apparent from Tables 2, 3, and 4 that 
the greatest intergroup differences in implicit 
personality theories lie between psychologists 
(regardless of level of experience) and so- 
cial workers, with psychiatrists maintaining a 
mediating position between these two disci- 
plines. It must, however, be emphasized that 
the design of this study did not allow for the 
testing of the overall importance of these in- 
terdisciplinary differences; compared, for in- 
stance, to the differences that might possibly 
emerge from the contrasting of these groups 
with a group of laymen, such differences as 
were found in this study might seem minimal 
and insignificant. 

The position of the psychology trainees with 
respect to the senior psychologist is as ex- 
pected. There is a communality of interests 
and outlook between these groups made all 
the more pervasive by the fact that in this 
particular sample one group trains and su- 
petvises the other. The relationship of both 
groups of psychologists to the social workers 
is likewise as hypothesized. Of the three ma- 
jor interests of psychologists, psychodiagnos- 
tics, research, and psychotherapy, only the 
last is shared in some measure by social work- 
ers, whose major concern has traditionally 
been the social welfare of the patient. That 
psychiatrists should be as “close” to the 
senior psychologists as to the social workers 
had not been predicted. One possible explana- 
tion would emphasize the fact that the psy- 
chiatrist shares both the interest in diagnosis 
and therapy (as well as in research to an in- 
me ee of the psychologist, and the 
cial woes concn wih the patents 

, 3 status, etc. One impor- 
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tant inference can futher be drawn from 
these results: Since gross intergroup similari- 
ties and differences predicted on the basis of 
these groups’ interests and training are gener- 
ally reflected in interdisciplinary differences 
in semantic structures and implicit person- 
ality theories, some validity is given to the 
conclusion that semantic differential ratings 
and the model upon which the semantic struc- 
ture is based are sensitive tools and that they 
parallel extralinguistic interests and attitudes. 

In examining the pairs of concepts which 
social workers find more connotatively similar 
(Table 4), two alternative explanations offer 
themselves: (a) the two words are actually 
closer in connotative meaning for social work- 
ers or (b) social workers do not discriminate 
between the terms, as when all diagnostic 
terms are uniformly rated “somewhat bad.” 
The second alternative seems the likeliest in- 
asmuch as the pairs of concepts which are 
connotatively closer for social workers seem 
to involve. constructs that are generally 
thought to be quite distinct in meaning (e.g., 
Obsessive-Compulsive reaction/ Psychopathic 
personality). Since these are the only three 
such differences between PhD psychologists, 
psychology trainees and psychiatrists, it is 
difficult to decide whether they represent op- 
posite and interpretable attitudes or chance 
differences. If these are real differences, the 
content of the three pairs of concepts (e.g. 
Hysteria/Somatization reaction) would sug- 
gest arguable theoretical differences rather 
than lack of discrimination. In either case, 
the semantic structures of psychologists and 
psychiatrists are seen not to be appreciably 
dissimilar. The conclusion may, therefore, be 
drawn that insofar as these semantic struc- 
tures represent an aspect of the Ss’ implicit 
personality theories, the systems of covaria- 
tion of diagnostic and therapeutic terms used 
by psychiatrists and psychologists are much 
the same. Social workers, on the other hand, 
reveal attitudes and associations not shared 
by psychologists, although accepted to a 
larger extent by psychiatrists. 

Although the intensive analysis performed 
on the interrelationships among nine terms 
relating to therapy was based on the semantic 
structures of psychologists only, it can be seen 
from the above considerations that the results 
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could be generalized to the psychiatrists. The 
first question that presents itself in this at- 
tempt to tap some aspects of the clinician’s 
implicit personality theories concerns the basis 
for the particular clustering that was found. 
What dimensions account for the intracluster 
similarities? If one considers the overall me- 
dian factor scores ° of terms relating to ther- 
apy separately for each of the three factors 
represented in this study (evaluation, potency, 
and activity) it becomes clear that Cluster 
3 (Reassurance, Supportive Psychotherapy, 
Tranquilizing Drugs) is “valued” more highly 
than Cluster 1 (Directive Psychotherapy, 
Commitment, EST); Cluster 1, however, is 
more potent or stronger as well as more ac- 
tive. Cluster 2 (Psychotherapy, Uncovering 
Psychotherapy, Group Psychotherapy) is not 
as cohesive as the other clusters and it tends 
to share in the characteristics of the other two 
groupings. 

What attitudes and implicit beliefs relevant 
to the therapies most commonly prescribed 
for psychiatric patients can be inferred from 
this data? It might be argued that the terms 
in each cluster represent treatments of choice 
under certain specifiable conditions, other 
things being equal. Cluster 1, the “strong, ac- 
tive” therapies, might be most applicable to 
most situations which to the mind of the cli- 


nician demanded such rapidly direct and ac- 
tive therapeutic interference at the cost of 
fice more highly regarded 


being forced to sacri ; 
or valued therapies. Cluster 3, the ‘good, 
kind” therapies, would be chosen when a 


nearly opposite situation existed: the empha- 
sis required would be on a “good, kind, suc- 
cessful” therapy involving relatively little di- 
rect interference with the patient’s psycho- 
pathology: The third cluster, because of its 
middle position on the three factors, could 
be said to represent a compromise between 
e valued therapies and more active and 
here neither support and re- 
tive interference are required, 
ering Psychotherapy, 
would then seem 
s also differ rela- 
es than is the 


mor 
strong ones. W 
assurance nor aC 
Psychotherapy, Uncov! 
and Group Psychotherapy 
to be indicated. These term 
tively more among themselv' 
d therapeutic terms 


2 s f diagnostic an 
eet s is available from the author. 


based on factor score 
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case with the other two clusters. Clinicians 
seem to attach a valuative connotation to 
Psychotherapy that they find lacking in the 
other two terms. Is it possible that they have 
in mind a type of psychotherapy (the one 
they practice, perchance?) which they prefer 
to the ones listed here? 

The value of such inferences must be tested 
primarily from their agreement with other 
knowledge. Insofar as these interpretations 
clarify the decision making activities of cli- 
nicians, and insofar as clinical practice in turn 
lends credence to them, they contribute to 
our knowledge of clinicians. Unfortunately, no 
systematic data is available that reports on 
clinicians’ attitudes towards these therapies. 
One is therefore forced to fall back on such 
weak criterion as “do these conclusions make 
sense in light of how clinicians are commonly 
believed to feel about these therapies?” The 
present writer would suggest that extralin- 
guistic attitudes and beliefs concerning ther- 
ng tapped which are in ac- 
cordance with the way in which clinicians 
implicitly match a therapy with their “feel- 
ing about the patient.” Nevertheless, it can- 
not be too strongly emphasized that this is 
an exploratory study in an area in great need 
of more precise and systematic investigation. 


apy are indeed bei 


SuMMARY AND CONCLUSIONS 


In an effort to investigate the implicit per- 
sonality theories of clinicians, 10 clinical psy- 
chology trainees, 10 psychiatrists, 10 social 
workers, and 10 senior clinical psychologists 
were asked to rate 11 diagnostic and 9 thera- 
peutic frequently used concepts on nine ap- 
propriate semantic differential scales. The re- 
sulting semantic structures define an aspect 
of the groups’ implicit personality theories. 
Major conclusions were as follows: 

1. Significant intergroup differences in se- 
mantic structures and in degree of connota- 
tive similarity assigned to pairs of concepts 
exist between psychologists (regardless of 
level of experience) and social workers, with 
psychiatrists maintaining a mediating posi- 
tion between these two disciplines. It is sug- 
gested that social workers make fewer dis- 
criminations between the connotative mean- 
ing of psychiatric and psychologic terms. 
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2. An intensive analysis of the interrela- 
tionships between nine terms relating to ther- 
apy for 20 psychologists suggests the exist- 
ence of three clusters of terms: (a) Direc- 
tive Psychotherapy, Commitment and EST, 
(b) Psychotherapy, Uncovering Psychother- 
apy and Group Psychotherapy, and (c) Reas- 
surance, Supportive Psychotherapy, and Tran- 
quilizing Drugs. These clusters (a) can be 
said to represent the degree of equivalence and 
interchangeability of the terms in the cluster 
and (b) are suggestive of situations in clinical 
practice in which the terms of one cluster 
would be more applicable than those of an- 

other. Such clusterings can be presented as 
providing an insight into the implicit person- 

ality theories of clinicians. 
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WISC IQs FOR THE MENTALLY RETARDED 
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Many psychologists using the Wechsler In- 
telligence Scale for Children (WISC) have 
been chagrined to discover that a client has 
earned an insufficient number of weighted 
score units to obtain an IQ from the tables 
provided: the lowest full scale IQ given in 
the WISC manual is 46, which requires a 
scaled score of 26 (Wechsler, 1949, p. 26). 
The primary purpose of this paper is to pro- 
vide extrapolated IQs down to 28, which is 
obtained when the scaled score is equal to one. 

This extrapolation provides IQs to within 
the Idiot range, according to Wechsler’s 
(1958, p. 57) classification system. For a 
variety of reasons, it is considered important 
by psychologists and others working with the 
mentally retarded to differentiate among them. 
Many institutional policies and state legisla- 
tion require a diagnosis of level of mental re- 
tardation or deficiency. This obviously can- 
not always be provided with the published 
WISC data. IQs often play a significant role 
in determining differential therapy, treatment, 
and training or education in these institu- 


tions. However, the principal and most valid 
reason for desiring to differentiate among in- 
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Fic. 1. WISC full scale IQs and scaled scores with 
= extrapolated points below norms. 


dividuals with IQs below 46 is to further re- 
search and understanding of them. The poten- 
tial positive contributions to society of these 
individuals are only beginning to be recog- 
nized (Hutt & Gibby, 1958, p. 99). 

In Fig. 1, the Full Scale WISC IQs have 
been plotted against WISC scaled scores. This 
resulted in a straight line which can be de- 
scribed by the formula IQ = (scaled score) 
(.727) + 27.4. The formula gives results, when 
rounded to the nearest whole IQ value, which 
are identical with those published in the WISC 
manual for every IQ between 46 and 85 with 


TABLE 1 
EXTRAPOLATED WISC IQs: IQ = (Scatep Score) 
(.727) + 27.4 


Scaled Calculated Smoothed 
Scores IQ Values IQ Values 
25 45.58 46 
24 44.85 45 
23 44.12 44 
22 43.39 43 
21 + 42.67 43 
20 41,94 42 
19 41.21 41 
18 40.49 40 
17 39.76 40 
16 39.03 39 
15 38.31 38 
14 37.58 38 
13 36.85 37 
12 36.12 36 
11 35.40 35 
10 34.67 35 
9 33.94 34 
8 33.22 33 
7 32.49 32 
6 31.76 32 
5 31.04 31 
4 30.31 30 
3 29.58 30 
2 28.85 29 
1 28.13 28 
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only one exception—with a scaled score of 
69, the formula gives the calculated IQ value 
of 77.56, while Wechsler’s Table XI B (1949, 
p. 26) gives an IQ of 77. 

Table 1 presents extrapolated WISC IQs 
for scaled scores from 1 to 25. 

These extrapolated IQ values may be used 
appropriately only with certain cautions or 
limitations. Firstly, although the empirical re- 
lationship is known to be linear, some rela- 
tionships that are linear “in the middle range” 
of their distributions depart from linearity at 
the extremes. 

Generally, the further the extrapolation is 
made from the empirical data, the greater the 
probability of error. In this case, as our extra- 
polated IQs decrease from 46, the less con- 
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fidence we have in our extrapolation, the lower 
the expected reliability of the value, and the 
less its probable psychological significance. 
Also attenuating the reliability of the lower 
extrapolated IQs is the smaller number of 


items successfully completed to establish the 
scaled score. 
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INCIDENCE OF BENDER-GESTALT FIGURE ROTATIONS 


RICHARD M. GRIFFITH Anp VIVIAN 
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One striking distortion in the drawing of 
the Bender Visual Motor Gestalt Test (BG) 
or in the reproduction of block designs—the 
rotation or angular displacement of one or 
more of the figures—would be more meaning- 
ful to the diagnostician if he had statistics on 
the frequency of the phenomena in his clini- 
cal groups and if he had an adequate theory 
to elucidate its signiĥcance. Aimed at the 
second problem, a series of attempts to fur- 
nish a theoretical underpinning through in- 
vestigating factors which produce rotations 
was begun by Shapiro (1953) followed by 
others (Williams, Lubin, Gieseking, & Ruben- 
stein, 1956; Yates, 1956) who used similar 
techniques and, most recently, by Hannah 
(1958) who worked independently. Earlier, 
Hanvik and Anderson (1950) had considered 
the effect of focal brain lesions on the pro- 
duction of rotations. The present note pro- 
vides actuarial statistics on the first problem 
—the frequency of rotations in the BG draw- 


ings of different clinical groups. 


METHOD 


Drawer by drawer, the files of folders accumulated 
through eleven years of testing by the Psychology 
Service of a neuropsychiatric hospital were searched 
until approximately 1000 BG records (probably one 
third of all those available!) were examined. The 


Piverage age of a restricted sample (331 cases) was 


33.5 (SD, 11.2) at the time the test was adminis- 
tered. The distribution of ages was skewed by the 
WWI veterans, the median age being 33.4. The IQs 
of 282 of these patients averaged 94.9 (SD, 164). 
As standard operating procedure, looking ahead to 
future research possibilities, the diagnosis which had 
been reached by a team composed of psychiatrist, 
psychologist, and social worker had been noted in 
each folder. For the present purposes this diagnosis 
was accepted without further evidence. It is recog- 


nized that the design of the study is to some extent 


H. TAYLOR 


Kentucky 


ae in that the BG, however small its influence 
ad been one of the considerations in arrivi i 
dardoa iving at the 
The diagnoses were broadly categori. i i 
phrenia, Neurosis, CharacteE Dio de chor Sie 
Syndrome, Mental Deficiency, and Other the 1 tie 
being a catch-pile for the less frequent diagnos n 
for those instances for which the diagnosis had TA 
advertently been omitted. When it was “ote that 
the percentage of rotations was high for the first iu 
mental defectives the number of their records a 
supplemented by combing other drawers of ie 
files; thereafter, their figures were not used in cae 
puting the totals for the hospital population, A ne 
tation was defined as an angular displacement of 45 
degrees or more in a figure which otherwise remained 


recognizable. 
TABLE 1 


PERCENTAGE OF BG RECORDS with 
ONE or More ROTATIONS 


Percentage with 


N Rotation (s) 
Schizophrenia 346 19.7 
Neurosis 167 19.8 
Character Disorder 165 16.4 
Chronic Brain Syndrome 147 40.8* 
Mental Deficiency 59 55.9* 
Other 178 23.0 
Total* 1003 22.8 


goa do not menaa EN detectives, 
hronic Brain Syndrome anı ental Deficiency di: 
.05 probability level; each differs from all other pene i 


level. 


1 With some clinicians during much of the period 
covered by the records the BG had been adminis- 
tered primarily as a buffer to what was considered 
more valid tests. That its effect on diagnoses was 
never great may be concluded from the breadth of 
evidence considered by the diagnostic team and from 
the fact that before they were told the results of the 
present study a group of 14 psychodiagnosticians 
within the hospital estimated almost as high per cent 
rotations by the schizophrenics (36) as by the or- 


ganics (43). 
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RESULTS 


Table 1 gives the percentage of the records 

of each diagnostic group in which at least one 
of the BG designs had been rotated. To the 
surprise of the investigators and their col- 
leagues, the percentage for the schizophrenics 
did not differ materially from those for the 
neurotics and the character disorders (19.7, 
19.8, and 16.4, respectively). Above this 
cluster of values the rotations by the Chronic 
Brain Syndrome and the Mental Deficiency 
groups rose to 40.8 and 55.9%. The 23.0% 
for Others corresponds closely to the 22.8 for 
the total population (exclusive of the men- 
tally defective). 


Discussion 


Hanvik and Andersen (1950), requiring less 
deviation (30 degrees vs. 45) and employing 
more stringent diagnostic standards, reported 
a somewhat larger percentage of rotations by 
organics. Of a group of 44 patients with focal 
brain lesions which had been verified either 
through surgery or by Xray the records of 
59.1% had one or more rotations as com- 
pared with 40.8% for the present group, 
which was more heterogeneous. Their 18.9% 
for 37 normals and neurotics is not out of 
line with the results for the present NP pa- 
tients—other than organic and deficient. 

Because of the relatively small number of 
mentally defective patients, their percentage 
rotations is not considered to be as reliable 
as those for the other diagnoses, Also, in a 
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hospital population the diagnosis is always 
suspect; mental illness may mimic deficiency. 
That a relation between intelligence and ro- 
tations, as conjectured in the literature (Wil- 
liams et al., 1956; Yates, 1956), does exist, 
is supported by comparing the IQs of 56 ro- 
tators and 226 nonrotators: the means of 84.3 
and 97.5, with standard deviations of 13.3 


and 17.1, respectively, differ by the ¢ test at 
the .01 level. 


SUMMARY 


This study, confirming the belief of cli- 
nicians that rotations of Bender’s gestalten are 
of diagnostic significance, establishes the rela- 
tive frequency of such distortions by clinical 
groups within one neuropsychiatric hospital. 
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self-concept generate satisfaction with life’s affairs. It corresponds 
itive or negative closely to Osgood’s evaluative factors, and ap- 
despite evidence pears to be the domain implied by most writers 
(Osgood, Susi & Tannenbaum, 1957) which sug- when they refer to the self-concept. Factor II 
gestacatiiat a meaningful concept such as the (anxiety-tension) is defined by feelings of tired- 
“self” comprises an aggregate of factors rather ness, emptiness, fear, and confusion. This di- 
than a single evaluative dimension. The present mension, more So than Factor I, may reflect 
study, which was undertaken in an attempt to symptoms of distress that are amenable to change 
categorize self-ratings in such a way as to permit during therapy. Factor III (independence) ap- 
a sensitive evaluation of change 1n psychiatric pa- pears related to subjectively perceived intelligence 
tients during therapy, casts further light on the and leadership ability. The opposite pole may 
factorial structure of the self-concept. — - represent passive acceptance, conformity, and 

A self-rating device composed of 70 bipolar felt inadequacy and inability to lead. Factor IV 
adjectives descriptive of human personality, in- (estrangement) is characterized by feelings of 
cluding antonyms by Osgood and terms se- distance and artificiality versus a natural and 


om several Po ul Jf-concept meas- easy relationship with others. Factor V (body 
d by a heterogeneous group of image) is similar to Osgood’s potency dimension. 


Most measures of the 
scores which are tréated as pos 
points on a value continuum, 


ar se 


7 omplete eta É Osg 
ue vas Oa male psychiatric patients. Rat- It is defined by adjectives related to body size 
1 s a each S on all 70 scales were dichotomized and physical strength or potency (eg., small 
ing x 70 matrix of phi co- light, underweight versus large, strong, rugeed, 


nd a 70 
d. Two overlapping por- heavy). 
this matrix were separately factor Two additional factors were obtained but not 


and rotated by the quartimax method, interpreted. 
an analytic procedure (conveniently programmed The results help explain the findings of in- 
for the computer) which provides an objective vestigators who have noted poor correspondence 
approximation of orthogonal simple structure. between different tests of the self-concept and 
pretable factors were found: Factor I low correlation between the self-concept and ex- 
. proad evaluative dimension ternal criteria of adjustment. It seems likely that 
le by success and these investigators dealt with measures which may 
i have unwittingly confounded several self-concept 
1An extended report of this study a = ob- variables. While it is generally agreed that the 
tained without charge hin ap A H, extremes of thinking, feeling, and behaving de- 
t oo ae for a fee from ihe scribed here are typically observed in psychiatric 
aion Institute, Order Docu- groups, such states are not qualitatively different 
$1.75 for microfilm or from those found in normals. Thus, whether 
: normals” or disturbed Ss are dealt with, tests 
t the Ann Arbor VA Hospital. This which treat self-ratings in simple additive faon 
dieson eimo may obscure attributes or dimensions of the self- 
pors i ; D. W. Bostion, i 
of psychiatric 1 coo — = Ers KA concept that should be treated independently. 
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THE EFFECTS OF CHLORPROMAZINE ON PSYCHOLOGICAL 
TEST SCORES? 
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As a consequence of the widespread use of 
chlorpromazine with new admissions in psychi- 
atric hospitals, patients seen for diagnostic test- 
ing have usually had drug medication for several 
weeks prior, to testing. Because the medication 
obviously introduces a new factor into the test- 
ing situation, it is important to evaluate the effect 
of the drug on test performance. The aim of this 
investigation is to determine the effects of chlor- 
promazine on the following tests: the Wechsler- 
Bellevue, the Rorschach, the Bender-Gestalt, the 
Porteus Maze, and the MMPI. 

Experimental and control groups were selected 
from new admissions to a state hospital. All Ss 
were administered the test battery within 4 to 10 
days after admission and before any drug medi- 
cation had been prescribed. Ss were placed on 
either chlorpromazine or placebo after comple- 
tion of the battery and retested 30 days later. 
Both drug and placebo were discontinued for 48 
hours prior to retesting to minimize the influence 
of the drug’s side effects on test performance. 
There were no significant differences between the 
two groups In age or education and the modal 
diagnosis in both groups was schizophrenia. Both 
male and female patients were included in the 
sample, There were 18 Ss in the drug group, 21 
Ss in the control group. The chlorpromazine dos- 
Be received by each experimental S varied with 
n o mes. dai i the drug but for most Ss it was 
F ne hee eed eet under the conditions 

f ine has little or no ef- 

1 An g 

Re Sse aa of this study may be ob- 


ge from Abe J. Judson, Uti 
oe pare University, Utica, N. Ya or to 
Sed can ais Noe Documentation Institute. 
eae theses $1.25 for micro- 
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fect on psychological test scores. W the Wechsler- 
Bellevue and Bender-Gestalt, there wee’. S: 
nificant differences in gains between the two 
groups. The latter result differs from Winter and 
Frederickson’s (1956) report that the drug im- 
paired Bender-Gestalt performance. However, 
they attribute their finding to the side effects of 
the drug and our procedure tended to reduce these 
effects. On the Porteus Maze, there was some im- 
provement in performance for the control group { 
and virtually no change for the experimental 
group but the difference in gains falls short of 
significance (¢= 1.60). The difference is in the 
same direction as that reported by Porteus (1957) 
who found that chlorpromazine reduced Maze 
Test scores. 

The MMPI and Rorschach results are essen- 
tially negative. There were no significant differ- 
ences in gains on any of the MMPI scales. The 
Rorschach was scored for 17%, D%, d,dd%, 
F%, F—%, M%, FM%, FC%, CF,C%, Afo, 
H%, ShR%, P, R, and RT. Again, there were 
no significant differences in gains on any of these 
variables. Although the results are generally nega- 
tive, it is worth noting that on the F scale of the 
MMPI and F — % score of the Rorschach, both 
measures of unrealistic thinking, the drug group 
showed greater improvement (¢ = 1.83 and 1.32, 
respectively). These results are not significant but 
they do point to a possible effect of chlorproma- 
zine on test performance. 
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SIMULTANEOUS RECORDING OF BALES AND CHAPPLE INTERACTION 
G INITIAL PSYCHIATRIC INTERVIEWS * 


MEASURES DURIN 
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_T he content of standardized initial psychiatric acts for each patient was divi 
interviews is analyzed through the use of the of Contributions to give a vided by eee 
Bales (1950) categories and interaction profile the average number of acts per pares: 
and correlated | with simultaneously recorded This Duration score based on the Bales EA 
Chapple measures of the same interviews (Ma- similar to the Chapple score for Patient nates 
j & Matarazzo, 1956). Subjects No measure similar to the Chapple Silence — 
ferred to the outpatient psychi- appeared to be derivable from the Bales data. 
The rank order correlations between the major 


re £ 1 f 

atry department m a large hospital. Their aver- 

age age 18 36; 17 are female and 7 male. Chapple variables of Units, Action. i 

The characteristics of the partially standard- and three scores derived from the ae 
Total Acts, Contributions and Duration, are given 


bed in detail in the ar- 
in Table 1. 


ized interview are descri 
ticle by Matarazzo et al. (1956). The interviewer, 
an experienced psychiatrist, applies ĉ& behavior The correlation between number of Chi 
F : f appli 
ds of the Patient Units and Bales (Patient) E 


stress in the second and fourth perio! 

interview. The second period consists of a silence (+4.96) is as high as that obtain i 
$ i $ 4 ed f a 
stress as the interviewer fails to respond by wait- server reliability of Chapple ee ee 
ing up to 15 seconds after the subject has finished observers using two Interaction Chrono} : he 
speaking. In the fourth period the interviewer (Phillips, Matarazzo, Matarazzo, & Saslow 1987). 
. r ? . 
The correlation between Chapple Patient Action 


- interrupts the subject for 2 total of 12 times. 
Time limits are set on the length of each period. and Bales Patient Duration (+.92) is also high 
2 8 


Two scores which are comparable to the Chap- enough to make the t i 
3 x t wo meas 
ple scores of Units and Action were derived from able in the present sample. res ae 


the Bales data by dividing the total number of 

acts for a patient into two components. The REFERENCES 

number of times a patient spoke, his Contribu- gares, R. F. I N 

tions, Was computed by counting the number of Addis set ere process analysis. Cambridge: 

tient began speaking to the psychia- Mararazzo, J. H i SARLON, EAN 

lar to the Chapple score G. The interaction chronograph R R 

The total number of Bales for objective measurement of ph a es 
terviews. J. Psychol., 1956, 41, 347-367. E 
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1 An extended repor 
tained without charge from A. Paul Hare, Dept. So- Ru 
cial Relations, arvard University, Cambridge, Mass., RUDE E A G. Observer reliability of in- 
or for a fee from the American Documentation In- Psychol, 1957, 21 ce interviews. J. consult. 
Order Document No. 6154, remitting $1.25 i , 21, 269-275. 
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TABLE 1 
RANK ORDER CORRELATIONS BETWEEN BALES AND CHAPPLE VARIABLES 


f (N = 24) 
ie 1E 3 4 3 3 
Le rits (Chapple) — 98 =f, ee a 
2, Action (Chapple) o5 T30 sae —.93 
3 Silence Chapple) 35 TA +92 
3 Total Acts (Bales) Ti TH 
ä -68 
—.91 


5 Contributions (Bales) 
6 Duration (Bales) 
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COMPLAINTS OF DISTURBED 
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This study explored relations between symp- 
tomatic complaints made by parents about chil- 
dren in outpatient treatment and social status, 
sex, religion, age, family size, ordinal position, 
parental age differences, parent-child age differ- 
ences and family status (presence or absence of 
both parents), 

Data were collected from records of 265 cases 
in an urban child guidance clinic. Complaints were 


, the same was true of neurotic 


disorders, 
1 An extended repor i 
j 3 Port of this study may be ob- 
tained without charge from R. G. Hunt Psy chology 
ept., Washington Unive: s i 
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ORS AND THE pSYCHIATRIC 


CHILDREN * 


* DE i 


ationship 
here a chi Square analysis revealed complaints of 


for the 


“nonnormal” families 
were significantly overrepresented in the clinic 
and were significantly older than “normal” family 
children. 

These modest results suggest several possibili- 
ties, one of which is that social factors are less 
important in shaping disorders of children than 
of older groups. It is also Possible though ‘that 
intake policies and/or cultural attitudes toward 
Psychiatry constitute “filters” producing spuri- 
ously homogeneous patient populations. 
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II. Of the concepts the subject holds of 
his spouse and of his parent of the opposite 
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Tous) 
goa very different rationale, both 
Although sil social psychologists generally 


rocess of parental Lapeer eit to 
aoe t factor in adult sexual adjust- 
= Ea mnor into the phenomenon of self- 
mentification with the parent of the same sex 
; “incoverig an increasing number of factors 
S lating to the process (Beier & Ratzenburg, 
1953; Ewing, 1954; Ferguson, lani Aa 
dyce, 1953; Seward, 1945; Sopchak, 19 4 
Winch, 1951). Whether and how such iden- 
tification is related to marital interaction is 
important to an understanding of the mar- 
riage relationship and needs to be explored 
v empirically. A similar factor which may relate 
“7 to the dynamics of the marital relationship 
i is the degree to which a marriage partner 
f 
| 
} 


uates his spouse with his parent of the 
C asile sex. Studies (Ferguson, 1938; Ham- 
ilton, 1929; Terman, 1938) previously inves- 
tigating the association have been inconclu- 
sive. Studies in mate selection (Commins, 
< 1932; Mangus, 1936; Strauss, 1946a, 1946b, 
1947; Winch, 1958) have generally found 
j that a positive relationship exists between the 
image one has of his parent of the opposite 

sex and the spouse he selects. ; 
f This investigation has sought to determine 
whether identification of self with parent of 
the same sex and the equation of spouse with 
\ parent of the opposite sex is associated with 
/ marital satisfaction. It has investigated two 
hypotheses which may be stated in the null 
h form: There is no difference in population 
if; means between two groups defined as satis- 
k | factorily and less satisfactorily married with 

| regard to the degree of congruence: 

‘ I. Between the subject’s self-concept 
i and his concept of the parent of the same 


sex; 


Sex. 


METHOD. 


The Criterion and Sample 


“Satisfactorily” and “less satisfactorily” married 
were operationally defined by 22 items found by 
Locke (1951, 1959) to discriminate between success- 
fully married and divorced subjects. These items 
were subsequently used by Wallace (1947) and by 
Swan (1952) and found valid and reliable. The 
Locke items include inquiry into the permanence of 
the marriage, adaptability, interests, consensus, af- 
fection, sex satisfaction, and general satisfaction. 
Each item is scored according to weightings deter- 
mined by Locke’s study, and a total score is de- 
termined by adding these. As a second, independent 
measure the Terman (1938) self-rating happiness 
scale was observed to substantiate the findings. The 
Terman scale is a horizontal line which runs from 
“Extremely happy” through seven indicated degrees 
of marital bliss to “Extremely unhappy.” Terman 
assigned weights to each position: 15, 12, 9, 6, 3, 
0, and 0. These weightings were used in this study. 

The original population consisted of 594 former 
students who had been enrolled in Family Life 
classes, 1948-50, at the University of Minnesota, and 
who in a 1953 survey indicated that they were mar- 
ried. These 241 men and 353 women were sent 26 
classification questions: age, sex, vocation, income, 
parental background, etc., and the Modified Marital 
Adjustment Scale as suggested by Locke (1951) plus 
the Terman self-rating happiness scale. Responses 
were received from 454 persons (76.5% of the 
group). 

The Locke items were scored and ranked. The 
distribution ranged from 118 to 52 points: median 
102, mean 101.13, and standard deviation of 10.71. 
Using points Qs, score 109, and Qu, score 96, as cut- 
ting points, those with the highest scores, mean 
111.94, and those with the lowest, mean 85.94, were 
selected to participate in the second step of the 
study. 

Requests were sent to the 116 individuals in the 
upper group and 108 in the lower group soliciting 
the participation of their spouses. Code numbers 
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Fic. 1. Interpersonal check list profile illustrating personality variables 
according to octants and scales. 


were used and confidenti 
group 71.5% of the s 
tionnaires which they 
60.5% of the lower 
The fact that fewer 


ality assured. In the upper 
pouses returned Locke ques- 
had been requested to mark; 


group. 

When the Locke scores were computed for all 
spouses returning questionnaires, and the mean values 
of the mates of the Upper quarter subjects compared 
with the lower quarter, the difference was significant 
beyond the .01 level. From these respondents the two 
final groups, S couples, Satisfied, and U couples, les 
satisfied, were selected after the arbitrary decidian 
to consider as satisfied tho: > 


se couples both of whose 
scores were above the Qa point, and as less satisfied 


those below the Qı point, on the original distribu- 
tion. Forty couples were thus designated for the U 
group of couples and 41 for the S. 

The happiness scale was observed to determine the 
degree of differentiation between the spouses of the 
S and U couples; the means of the two groups were 
tested for differences by the ¢ test and found signifi- 
cant beyond the .01 level. It can be concluded with 
confidence that the two samples were significantly 
different with regard to degree of marital satisfaction 
as tested by the instruments of this study. 


Measurement of Self and Other Concepts 


The 41 spouses of the original members of the 
study scoring above Qs and the 40 of those scoring 
below Q: were mailed copies of the Interpersonal 
Check List (LaForge, 1955; Leary, 1956) and each 
was directed to mark it without the knowledge or 
cooperation of the spouse. When this check list was 
returned, one was mailed to the other spouse with 
identical directions and request. In both samples 


there was a 100% response and replies were re- 
turned from both individuals. 


The two groups were then examined for homo- 
“geneity by means of information gained through the 
26 items of descriptive personal data. Out of 29 tests 
of significance by means of chi square, one difference 
between the groups was found. This falls well 
within the limits of chance and the populations may 
be considered as drawn from the same population. 
The factors examined were: age, age at marriage, 
number of years’ education, number of years mar- 
ried, number of children, children’s ages, vocational 
status, income, number of working wives and wives 
income, religious affiliation, present residence, birth- 
place,-community of childhood, vocation of parents, 
marital status and education of parents. The one 
difference found was that the fathers of the husbands 
of the S‘Couples had a greater number of years 
Pee cancel Check List. was chosen as the 
self-concept instrument. It is a list of 128 descrip- 
tive self-referent items selected to represent inter- 
personal variables, each made up of 8 items. These 
variables are arranged in a circular profile and com- 
bined into descriptive octants in such a way font 
the opposite octant represents an appen orm! 
ic. each intersecting line on the aapa a e 
thought of as a continuum with the subjec i pss 
being located according to the umber a wi 
checked within that specific category. y e m 
pose of this study the two octants making up he 
one continuum have been designated ae mee la 
which there are four for each COREE ee a a; 
Scale 1 is made up of variables termed anpe ial 
Autocratic, Octant AP, at one end of the e anmum; 
and Modest-Self effacing, Octant HI, ae gh "i 
Scale 2, Competitive-Exploitive, BC, VS. = p 
Dependent, JK; Scale 3, Blan eee n am 
Cooperative-Over conventional, LM; Stele , kp 
tical-Distrustful, FG, vs. Responsible-Over generous, 

Figure 1. 
cms min items from the check list are: “well 
thought of,” “apologetic,” “somewhat snobbish, 
“clinging vine,” “outspoken,” “will confide in any- 
one,” “often gloomy,” and “encourages others. j 

The check list is scored by aniants; the criterion 
being the number of items checked. W hen on on 
js considered as one part of a continuum with an 
opposing octant as the other part, a single score Be 
the continuum may be obtained by pee e 
er score from the lower, thus locating a point on 
the ntinuum. Four such points may be determined 
ae individual concept marked on the check list. 
truction to subjects marking the ICL solicit 
a T nscjous-level, verbal responses which tap 
are PN erception and his willingness to express 
ne sublet Aa Such an overt, conscious self-report 
ne Pa 7 ‘to the phenomenological assumption 
Sa aie al field coincides with subjective 
ak ehe Deere onal the self and other concepts 
ead er tae the purpose of this study in 
Bea i all he statements an individual makes 
on hi aself or his world as they are represented 
ome oe chee list items. Each subject marked four 


re alah f, spouse, mother, and father. On each 
2 


concepts: sel 
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of these four concepts the four scale scores were 
determined. It is these four scale scores which are 
used as a basis for comparing the degree of con- 
gruity of one personality with another: self with 
parent of the same sex; spouse with parent of the 
opposite sex. The term “identification” is thus de- 
fined operationally for this study as the measured 
degree of agreement between the self-concept and 
the concept of an “other.” The process of consciously 
ascribing similarities or differences in terms of the 
ICL to various family members is termed “equat- 
ing.” If the subject describes his wife in the same 
way that he describes his mother, he would be said 
to equate them. 

A discrepancy score between each personality on 
each scale was determined. The couples groups were 
divided on the basis of sex so that sex differences 
might be observed. Distributions of the discrepancy 
scores were made. A transformation of the data was 
then made in order to make the distributions more 
nearly normal and to equalize the variance. In this 
way the assumptions of the ¢ test of significance 
were met. The variable of discrepancy determined 
from scores on the scales may be designated X. The 
new variable of discrepancy was taken and called Y 
and the transformation was effected: Y=log (X 
F1: 

Such an analysis provides a way of discovering 
whether pairs of concepts are more congruent in the 
satisfied groups than in the less satisfied, but it does 
not provide any way to discover the direction or 
trend of these differences in terms of descriptive 
personality traits. The two previous studies (Ham- 
ilton, 1929; Terman, 1938) have been concerned 
only with whether there were or were not similari- 
ties in concepts between spouse and parent of the 
opposite sex and what the relation was to marital 
satisfaction. Terman asked if spouse and parent of 
the opposite sex were “alike in general physical 
type.” Hamilton’s subjects were asked if their 
spouses and parents of the opposite sex were “simi- 
lar physically” and “similar mentally.” Because the 
ICL is conveniently organized in terms of eight re- 
lated personality variables, it is possible to define 
discrepancies not only by their magnitude but in 
terms of whether one concept has more or less of 
the qualities described by Scales 1, 2, 3, and 4, 

In order to do this, the mean raw score of each 
ICL octant was determined for each sex, both S 
and U groups, on each concept: self, spouse, parent 
of the same sex, and parent of the opposite sex. 
Scale scores were then obtained thus: 


X Scale 1 = Xap — Xm 
X Scale 2 = Xyco — X 
X Scale 3 
X Scale 4 = 


These scale scores were then plotted on a continuum 
representing the two octants composing each scale. 
Comparisons of the concepts of self and parent of 
the same sex, and spouse and parent of the opposite 
sex, could thus be observed. The distance between 
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TABLE 1 


SIGNIFICANT DIFFERENCES BETWEEN Less SATISFIED AND SATISFIED Groups WHEN DISCREPANCY 
BETWEEN SELF AND PARENT OF THE SAME SEX IS TESTED 


Discrepancy Between Wife’s Self-Concept 
and Her Concept of Her Mother 


Discrepancy Between Husband’s Self-Concept 
and His Concept of His Father 


Level of Level of 
ICL Scale Uz Se t Significance ICL Scale Uz S: t Significance 
1 2135 -585 2.24 05 1 731 550 2.55 01 
2 539 484 83 accept 2 558 424 2.20 05 
3 545 536 a accept 3 609 550 82 accept 
4 -601 .599 03 accept 4 -670 468 2.89 01 


concept scores represents the magnitude of the dis- 

crepancy. The raw score values themselves provide 

the comparisons indicating the direction of differ- 

ences between the concepts in terms of the ICL 

personality variables on the four scales (see Figures 
. 2 and 3). 


RESULTS 


When Hypothesis I which deals with the 
phenomenon of identification of self and 
parent of the same sex was examined, it 
was obvious that there was a sex difference. 
Women in the satisfactorily married group 
Were not found to identify with their moth- 
ers to a greater degree than those in less 
satisfactory marriages. The null hypothesis 
was accepted on three of the four scales. 
On every scale the mean discrepancy scores 
were found to be higher in the U group, 
but were not statistically greater except 
on Scale 1. Men in the § group, however, 
were found to have lower mean discrepancy 
scores on every scale and significantly lower 
ones on Scales 1, 2, and 4. One may conclude 
that men who are satisfied with their mar- 


riages as indicated by this study perceive 
themselves and their fathers as being similar 
to a greater degree than do men who are 
less satisfied. 

The second null hypothesis, which held 
that there is no difference in population 
means between two groups defined as satis- 
factorily and less satisfactorily married with 
regard to the degree of congruence of the 
concepts the subject holds of his spouse and 
of his parent of the opposite sex, was rejected 
on six of the eight scales (see Table 2). Again 
a sex difference was found. Wives in the S 
couples were found significantly different on 
every scale from wives in the U couples. It 
can be concluded that women who rated high 
in marital satisfaction perceived their hus- 
bands and their fathers as similar to a greater 
extent than those who scored low on the 
marital satisfaction scale. 

When comparisons were made between the 
husbands of the S and U Couples, the find- 
ings were not so conclusive. Husbands who 
were more satisfied were found to have a sig- 


TABLE 2 


SIGNIFICANT DIFFERENCES BETWEEN Less SATISFIED AND SATISFIED Grours WHEN DISCREPANCY 
BETWEEN SPOUSE AND PARENT OF THE OPPOSITE SEX IS TESTED 


Discrepancy Between Wife’s Conce i q 
pt of Her Discrepancy Between Husband’s Concept of His 
Husband and Her Concept of Her Father Wife and His Concept of His Mother 
Level of Level of 
ICL Scal x Ley 
cale Uz Sz t Significance ICL Scale Us Sz t Significance 
1 A 
2 ‘509 oe in o 1 636 521 1.46 accept 
3 809 1577 Hi 205 2 608 545 89 accept 
4 I aor S58 au 3 665 536 1.90 05 
L 5 01 4 703 0.534 256 01 


Marital Satisfaction and Parent Concepts 


WIFE: SELF AND MOTHER 


6 “SP oa." Ss) £ 
LoS eer 


199 


SELF 


\ 


SCALE I APt + 


MOTHER 


SELF 


SCALE O BCt + x r 


MOTHER 
SELF 


SCALE IL DEH t t =- $ 


MOTHER 
SELF 


SCALE IZ FGH 


MOTHER 


HUSBAND: SELF AND FATHER 


SELF 


SCALE I AP} 


= HI 


FATHER 


SELF 


SCALE O BC imm $ 


“JK 


FATHER 


SELF 


SCALE IT DE H + a 


FATHER 


SELF 


SCALE IZ FG+ amet 


FATHER 


5 4 3 d 
jee 


Se 


o ! 2 3 4 
1 1 1 i f ni § 


6 
= 


Satisfied Group 


--—-— Unsatisfied Group 


Fic. 2. Directional trends of scores of self and parent of the same sex 
when compared on four interpersonal check list scales. 


nificantly greater congruence of concepts of 
wife and mother on Scales 3 and 4. Although 
the mean discrepancy scores on Scales 1 and 
2 were lower in the S group, the difference 
was not significant. The question remains 
whether the husband’s perception of wife and 
mother is associated with satisfaction in 


marriage. 


Directional Differences 

When directional differences were observed 
as plotted on the continua of the four scales, 
the following conclusions could be drawn with 
regard to the comparisons made between the 
S and U groups when self and parent of the 
same sex was observed (see Figure 2). 

The directional differences which exist be- 
tween women and men when they compared 
themselves with their parent of the same sex 
were most obvious on Scales 3 and 4 on which 
U women saw themselves as being less co- 


operative and responsible than their mothers 
but U men saw themselves as having more of 
these qualities than do their fathers. All S 
subjects, men and women, viewed themselyes 
in relation to the parent of the same sex in 
the same directions. 

When wives compared themselves with 
their mothers on Scale 1, both S and U wives 
thought themselves less managerial and auto- 
cratic than they thought their mothers: how- 
ever, U women thought themselves consider. 
ably less so. 

Scale 2 differences were not significant be- 
tween S and U women and the directional 
trend for both groups is the same with the 
subject seeing self as more docile and de- 
pendent than she sees her mother. 

Scale 3 differences which were not great 
enough to be statistically significant are ‘iow: 
ever, in different directions. S subjects per- 
ceive themselves and mothers as very similar, 
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Fic. 3. Directional trends of scores of spouse and parent of the opposite sex 
when compared on four interpersonal check list scales. 


with selves being a bit more cooperative and 
conventional. U women see mothers as being 
much more cooperative than they see them- 
selves, 

On Scale 4 the difference in discrepancy 
between the concepts for the two groups of 
women was not significant. Each group per- 
ceived the mother similarly, but self-percep- 
tions were very different. U women thought 
themselves much less responsible and gener- 
ous than S women thought themselves or than 
U women thought their mothers. 

S and U women agree on all four scales 
more often on their perception of their moth- 
ers than they do on self-perceptions. This 
was not true for men subjects. When men 
compared selves and father, S and U men 
tended to see themselves as more like each 


other than they saw their respective fathers 
as alike. 


On Scales 1 and 2 the direction is the same 
for both S and U groups of men, but the 
difference between self and father is signifi- 
cantly greater in the U group. U men saw 
their fathers as much more managerial and 
competitive than S men saw theirs. 

Scale 3 discrepancies between self and 
father were not significant; U men saw both 
selves and fathers as more blunt and ageres- 
sive than S men saw themselves and fathers. 

The discrepancy between self and father 
for the two groups of men was significant on 
Scale 4. S husbands saw themselves and 
fathers similarly and considered themselves 
as cooperative and generous. U men, on the 
other hand, saw themselves as being low in 
this quality and saw their fathers as being 
even much lower; so that the fathers’ score 
is a positive score in the FG octant, indi- 
cating skepticality and distrust. 
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Graphs indicating the directional trends in 
which S and U subjects compared their 
spouses with their parents of the opposite 
sex indicate that on every scale S and U 
women saw spouses in relation to their fathers 
in the opposite direction (see Figure 3). 

Both S and U women saw husbands as 
moderately managerial and autocratic, but U 
wives saw their fathers as being more so. 

On Scale 2, U women perceived their fath- 
ers in comparison with husbands as being 
slightly more ie S women saw them 

ing slightly less. 
re the wives’ perceptions of S and U 
husbands were compared on Scale 3, S hus- 
bands were seen as more cooperative than 
U husbands, and S fathers were seen less 
cooperative than S husbands. Not so with 
U fathers who were am as more cooperative 
husbands of U wives. , 
a A trends on Scale 4 are like 
those on Scale 3 only more so. S and U hus 
bands are seen much differently Ţ by their 
wives, with S wives seeing ae s as id 
responsible and generous and ce g 
their husbands in the opposite irec ie = 
skeptical and distrusting. atom o Dn 
groups of women were perceive e 
responsible and generous, but S aa } g 1 
fathers had less of this quality a eae 
husbands had, and U wives ia eir 
fathers had more of it than husbani s. P 

When men compared wives an fee S, 

discrepancies between these were jess a a 
‘ecrepancies found among women su 

ae in te compared husbands and fath- 

a the directional trends were also not so 

: i ar a pattern. 
aie ib. papain oe a little less mana- 

yb rerai than their mothers. U 
penn perceived the two in the opposite 
ae wives were more “bossy.” 


Very little difference exists in the two 
er 


eee cooperative than U mothers 
seen a 
Wels differences go in opposite 


d U groups. Men who 
y marriages saw both their 
being more responsible 


directions + 
were in satisfactor 
wives and mothers as 


and generous than U men saw their women- 
folk. S men thought their wives and mothers 
were quite similar in this respect but mothers 
were somewhat more generous; U men judged 
their wives to be the more generous. 


Discussion 


The answer to the question of why men 
who identify with fathers were found more 
often in satisfactory marriages than in un- 
satisfactory ones, but women who identify 
with mothers were not, may lie within the 
nature of modern American culture and the 
confused role of women. A modern wife’s 
conception of herself as similar to her mother 
as measured by this test is unrelated to her 
satisfaction in marriage. Doing things the 
way mother did, seeing self as a female of 
mother’s cut, apparently is not a factor 
closely associated with the satisfaction a mod- 
ern wife finds in marriage. Elements other 
than her allegiance to or expectation of the 
role of wife and mother as she saw it por- 
trayed in the previous generation are evi- 
dently of greater import in her marriage. 

The same conclusions cannot be drawn 
from the data with regard to men. The find- 
ings may reflect a greater stability of the 
husband-father role in our changing culture 
and suggest that those men who see them- 
selves fulfilling the husband role as they saw 
their fathers fulfilling it are more satisfied 
in their own marriages. Satisfaction in mar- 
riage for the male may be associated with 
his own feelings of masculinity which have 
been learned from the father; or may charac- 
terize men who, in terms of Freudian theory, 
have identified with the male or father-image. 

All subjects thought selves less managerial, 
autocratic, competitive, and exploitive than 
their parent of the same sex, but all S sub- 
jects thought this difference not so great. 
That parents are seen as being high in these 
dominant qualities may reflect an attitude 
that children, even adult children, hold in re- 
gard to parents—that they are pretty dicta- 
torial people. 

If the mother serves as model of self for 
the woman, one would conclude that the U 
wife might feel a sense of disappointment in 
herself because she saw herself as being less 
cooperative, generous, and responsible than 
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her mother. The S wife who perceived more 
of these qualities in herself than in her 
mother may see herself as an improvement 
over the mother. 

If lack of modesty, self-effacement, docility, 
and dependence is considered as a lack of 
masculine strength, men in the U group, when 
comparing selves with fathers, might also con- 
sider themselves as lesser men than their 
fathers whom they saw as having fewer of 
these “weak” characteristics. The greatest 
difference between the S and U men, which 
is on Scale 4, indicates that U men saw 
themselves and their fathers as considerably 
more distrustful and skeptical than S men 
saw themselves and fathers. The fathers of U 
husbands were perceived as being even more 
skeptical than were the men themselves. 

The sex differences found with regard to 
equating spouse and parent in the second 
hypothesis is of even greater interest. Women 
in satisfactory marriages saw their husbands 
and fathers as significantly more alike on all 
four scales than women in less satisfactory 
marriages, The findings were not so definite 
m regard to the association of satisfaction 
and the congruence of the husband’s images 
of wife and mother; two scales were signifi- 
cantly and positively related, two were not. 

These findings are not in agreement with 
those of the Hamilton study (1929) and 
Ferguson’s (1938) interpretation, but Ter- 
man’s study (1938) which found happy wives 
Preferred husbands and fathers “not of the 
Opposite type” would seem compatible with 
the results of this study. This kind of sex 
difference lends support to the assumption 
that in satisfying marriages there is a rela- 
tionship between wife and husband that re- 
flects the telationship between parent and 
child ìn that the wife sees her husband as 

father ’ and he, too, sees himself as “father.” 
Wives can continue comfortably in a depend- 
ent role relating to the husband much as they 
did to father; not so, the husband who would 
like to continue in a filial relationship with 
his wife. This is a cultural expectation that 
may account for the lack of concisive evi- 
dence that marital satisfaction is associated 
with the husband’s equating of wife and 
mother. i 


Observation of the directional trends when 
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wives compared husbands and fathers would 
indicate that U wives thought their husbands 
more modest, self-effacing, docile, and de- 
pendent than their fathers (Scales 1 and 2). 
Husbands seen in this way might be consid- 
ered not so masculine as fathers. In keeping 
with this generally less favorable picture of 
their spouses, U wives thought husbands 
more blunt and distrustful than fathers. S 
wives equated husbands and fathers rather 
closely on Scales 1, 2, and 3, but thought 
husbands more responsible and generous than 
fathers on Scale 4. If women look to their 
fathers as models for husbands, the U wife 
might well feel disillusioned in her husband, 
while the S wife, who saw the two similarly 
except for increased generosity in her hus- 
band, might be especially well pleased with 
the man she had married. 

The men saw their wives and mothers sta- 
tistically more discrepant only on Scales 3 
and 4. On Scale 3 the direction in which the 
S and U husbands viewed their wives and 
mothers was similar; both thought wives a 
bit more cooperative and conventional. The 
fact that U men saw both wives and mothers 
as less responsible and generous than S men 
saw theirs may indicate that a generally less 
favorable impression of the female is held by 
men in less satisfactory marriages; mothers 
of men in the U group were seen as even Jess 
generous than their wives. It may also indi- 
cate these U men actually have their lives 
peopled with less “helpful, big-hearted, kind, 
and reassuring” women. 

Although the premises of this study are 
based in perceptual differences and the ICL 
responses cannot claim to represent objective 
reality, one cannot escape the conjecture that 
perhaps not only the processes of identifica- 
tion and equation are associated with marital 
satisfaction, but that, indeed, the S and U 
subjects have real Personality differences. In 
observing Figures 2 and 3 one readily notes 
that the differences between all S and U sub- 
jects are consistently greatest on Scale 4 in 
every comparison. The U subjects, male and 
female, place themselves, spouses, and par- 
ents farther into the “Skeptical-Distrustful” 
octant than do the § subjects. The percep- 
tions of selves and others held by U subjects 
indicate more of the quality measured by 
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FG items such as: “often gloomy, bitter, 
resentful, complaining, jealous, stubborn.” 


` One may speculate that persons dissatisfied 


in marriage may actually be more skeptical 
and distrustful, may have married skeptical 
and distrustful spouses, and have been reared 
by skeptical and distrustful parents. 

It also may be conjectured that the U 
women tended to see children generally as 
less socially desirable persons than parents. 
This conclusion is drawn from the similarity 
with which U women saw themselves in rela- 
tion to their mothers and husbands in relation 
to their fathers. In each case sons and daugh- 
ters were seen as generally less “forceful, 
dominating, confident, independent,” and less 
“cooperative, friendly, warm, tender, consid- 
erate,” than parents. 


SUMMARY 


Two hypotheses dealing with the relation- 
ship of parent image to marital satisfaction 
were investigated. Two groups of married 
couples, N=41 and 40, defined as satis- 
factorily” and “less satisfactorily married, 
indicated their perception of self, spouse, par- 
ent of the same sex, and parent of the oppo- 
site sex on the Interpersonal Check List. 
Discrepancy scores were determined between 
the concepts (a) of self and parent of the 
same sex and (b) of spouse and parent of the 
opposite sex and were tested for significance 
of differences by means of the t test. 

Directional trends in the differences be- 
tween concepts were graphed on poau 
representing each of the four ICL scales md 
were observed in order to discover in y i 
ways the concepts held by the S ian 

ups differed in terms of the eight person- 
Sity variables represented by the S 

Men in satisfactory marriages were : 

+ a ontify with their fathers to a significantly 
bi ans on three out of four of the 
ge bait) of the four scales unsatis- 
laa arried men saw their fathers as 
factorily m dominant and less loving than 
being more Such differences were not found 
0 women’s groups; on three of 

the discrepancies between self and 

not found statistically different 
zi men. When directional trends 
L T women were found to see 


the scales 


þserve! 


themselves as less cooperative and responsible 
than their mothers while S women thought 
themselves to have slightly more of these 
qualities than their mothers. 

When the S and U groups were compared 
regarding the degree of congruence the sub- 
ject held of his spouse and of his parent of 
the opposite sex, a significant relationship was 
found on all four scales of the women’s 
scores, but was not clear with regard to men. 
Women in satisfactory marriages perceived 
their husbands as similar to their fathers to 
a greater degree than did less satisfied wives. 

When the nature of these perceptions was 
observed by individual ICL scales, dissatisfied 
women were found to perceive their husbands 
as less dominant and competitive and more 
blunt, aggressive, skeptical, and distrustful 
than they saw their fathers. The satisfied 
wives who found the opposite on each of the 
four scales might well perceive their husbands 
as being more desirable men than their 
fathers. 

Men in satisfactory marriages as con- 
trasted with those in less satisfactory mar- 
riages perceived their wives as similar to 
mothers to a significantly greater degree on 
Scales 3 and 4; on the other two scales there 
appeared to be no relationship. For this rea- 
son, the findings are considered inconclusive, 
An analysis of the nature of these differences 
indicated no difference in direction on Scale 
3; however, on Scale 4, U men saw spouses 
as less generous than mothers, but S men saw 
them as slightly more. U husbands saw’ both 
their mothers and their wives as being re- 
liably more skeptical and distrustful than did 
husbands in the satisfied group. 

Congruence in perception of self and par- 
ent of the same sex was found associated 
with marital satisfaction for men. Equation 
of spouse and parent of the opposite sex was 
significantly related to marital satisfaction for 
women; the findings for men were not con- 
clusive. 
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NEED SATISFACTION AND EDWARDS PPS SCORES 
IN MARRIED COUPLES? 


IRWIN KATZ, SAM GLUCKSBERG, anb ROBERT KRAUSS 2 
New York University 


What relationships exist between the per- 
sonalities of marriage partners? Winch (1958) 
found in a study of 25 married couples that 
the needs of spouses tended to be dissimilar, 
which he took as evidence that mates select 
one another on the basis of complementarity. 
His results are in conflict with the data of 
several earlier studies, in which low posi- 
tive correlations between spouses’ traits were 
sometimes obtained, but never negative cor- 
relations. For example, Burgess T 
(1953) gave 31 items of — a 
rotic Inventory to 1,000 engaged couples anc 
found on 14 items significantly pect ed 
larity than resulted from random matchings, 
while no items yielded reliably negative cor- 
relations. i e OE 

Winch (1958) attributed the isagre 
between his own data and the earlier results 
to the relative superficiality of the aad 
tories that were used by previous he iga- 
as compared with his own reliance on 
and the TAT. However, 
ethod contained a contami- 
casts doubt on the validity 


f his findings. Each judge rated all of the 
a cteristics of a given subject, so that 
E on different traits cannot be regarded 
sco 


nother. Of 71 sig- 
i endent of one ano i 
ae we negative correlations between like 
aa and significantly, positive correlations 
al 
m igation was supported in part by a re- 


1 This N from the National Institute of 


ic Health Service. 
a e use of a shortened ver- 


here is evidence to indicate 
d to selected items iso- 


tors, p 
depth interviews 
Winch’s scoring m 
nating feature that 


i vithin the context, the re- 
a st be considered applicable 
form of the EPPS. 

andardiz 


between unlike traits (388 correlations were 
run in all) an indeterminate number could 
actually have been spurious reflections of the 
raters’ implicit theories of trait organization. 

To get new evidence on husband—wife per- 
sonality relationships we employed a modified 
form of the Edwards Personal Preference 
Schedule (EPPS). The Edwards test is not 
highly susceptible to response faking of the 
social desirability type (Edwards, 1957) and 
provides reasonably independent scores on 
different motivational variables, as indicated 
by generally low intercorrelations (Edwards, 
1953). In addition, self-ratings and predicted 
self-ratings of need satisfaction in the mar- 
riage were obtained, so that degree of satis- 
faction could be compared with the similar- 
ity-dissimilarity of mates’ scores on the Ed- 
wards variables. 


METHOD 
Subjects 


Fifty-six volunteer couples were obtained through 
the cooperation of psychology graduate students at 
New York University. The mean length of time mar- 
ried was 5 years, with a range of 6 months to 22 
years. The mean number of children was .7, range 0 
to 3. For husbands the mean age was 29.6 years, 
range 23 to 52; and for wives the mean age was 28 
years, range 20 to 46. Every person was white, with 
a background of at least 12 years of schooling. All 
couples were middle class, as determined by the hus- 
band’s occupation or educational status, 


Testing Procedures 


All Ss filled out a shortened version of the EPPS 
in which the number of personality variables had 
been reduced from 15 to 11 for reasons of economy. 
By eliminating Order, Intraception, Change, and 
Heterosexuality, we reduced the total number of 
items from 225 to 110. Ss were given a second ques- 
tionnaire which required ratings of the extent to 
which the spouse tended to satisfy or thwart each of 
the 11 needs that were measured by the EPPS. Needs 
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were defined by the statements associated with them 
in the EPPS. Under each definition was a seven- 
point graphic rating scale on which Ss were to re- 
spond “independently of the importance of the need 
in you; to the extent that you have this need at all, 
does your spouse’s behavior tend to satisfy it or 
thwart it or not affect it one way or the other.” Ss 
were given another copy of the rating sheet with in- 
structions to predict the spouse’s responses. 


RESULTS AND DISCUSSION 


Comparison of Mates’ EPPS Scores 


Winch found that husband-wife similarity 
on need variables was significantly lower than 
that of randomly matched men and women, 
as measured by Q type correlations. We com- 
puted rho correlations between EPPS scores 
on 11 variables for all couples. The mean cor- 
relation was .24, with a SD of .37. For 25 
randomly matched men and women the mean 
value of rho was .26, and the SD was .25. 
Thus, similarity between mates’ need profiles 
was close to random expectation. 

Husbands and wives were also compared on 
scores on particular needs. Winch had ob- 
tained five times as many complementary as 
noncomplementary relationships. We selected 
21 combinations of EPPS variables which 
seemed relevant to the hypothesis of com- 
plementarity, as follows: 


Husband Wife 
Abasement Abasement — 
Abasement Dominance + 
Abasement Aggression F 
Achievement Achievement = 
Affiliation Affiliation = 
Aggression Aggression = 
Aggression Abasement J 
Autonomy Autonomy a 
Autonomy Nurturance ra 
Deference Dominance sje 
Deference Deference a 
Dominance Deference F 
Dominance Abasement F 
Dominance Dominance = 
Endurance Endurance = 
Exhibition Exhibition = 
Nurturance Autonomy - 
Nurturance Nurturance = 
Nurturance Succorance F 
Succorance Succorance — 
Surrorance Nurturance P 


a Pluses and minuses show the type of rela- 
tionship indicated by the complementarity hy- 
pothesis. Many of these pairings had been 


tested by Winch. Since we were interested 
only in ascertaining whether reliably positive 
or negative relationships existed, a chi square 
test of association between high-low EPPS 
scores for husbands and high-low EPPS 
scores for wives was performed for each need 
pair.The dichotomies were based upon me- 
dian splits of the distributions of need scores 
in each sex. Four pairings yielded p values 
<.05 and one pairing yielded a p<.10. 


Four pairs of like needs were positively re- 
lated: 


Abasement-Abasement (x? = 5.42, p < .02) 
Affiliation—Afiiliation (x? = 2 81, p < .10) 
Autonomy—Autonomy (x .07, p < .01) 
Nurturance-Nurturance Oe = 4.54, $ < .05) 


One pair of unlike needs were n 


egatively re- 
lated: 


Husb. Succorance-Wife Nurturance G? = 3.87, 
$ < 05) 


Thus, in every instance the direction of asso- 
ciation was opposite to that predicted by 
Winch’s complementarity hypothesis. 


Total Need Satisfaction and Relationships be- 
tween Spouses’ EPPS Scores 


For each S, self-ratings of satisfaction on 
all 11 needs were added together to yield a 
total satisfaction score, after which Separate 
distributions of scores for men and for women 
were set up. Then High Satisfaction and Low 
Satisfaction groups were established by se- 
lecting from each distribution the 20 highest 
and the 20 lowest individuals. Association þe- 
tween High-Low Satisfaction and similarity- 
dissimilarity of spouses’ EPPS scores was 


tested via chi square for men and women - 


separately. 

In women three notable relationships were 
found between total, satisfaction and degree 
of resemblance of own and spouse’s EPPS 
scores. High Satisfaction wives, as compared 
with Low Satisfaction wives, showed a trend 
toward less similarity to their husbands on 
Aggression (x? = 2.84, p < .10). On Nurtur- 
ance and Succorance tespectively, High Satis- 
faction wives were significantly more similar 
to their husbands than were Low Satisfaction 
wives (in both cases x" = 4.86, P< 05). 


Da * | 


Need Satisfaction in Married Couples 


Thus, for wives complementarity was not gen- 
erally related directly to total satisfaction. 

Among husbands, three significant findings 
and one trend revealed in every case greater 
complementarity in the High Satisfaction 
group than in the Low Satisfaction group. In 
the following, interspousal similarity was less 
for High Satisfaction men in the first three 
cases, and greater in the last one. 


Achievement-Achievement (x? = 5.65, p < .02) 
Succorance-Succorance (x? = 4.26, p < .05) 
Dominance-Dominance (x? = 3.28, p < -10) ; 
Husb. Succorance-Wife Nurturance (x? = 4.86, 


p <.05) 
Summarizing, it would appear that the na- 
ture of husband—wife need relationships, as 
measured by the EPPS, is different for vari- 
ous need pairings, for different degrees of to- 
tal need satisfaction, and for the sexes. 


Total Need Satisfaction and Individual EPPS 

Scores 

Chi square tests were used to ascertain 
whether there was any relation between total 
need satisfaction (20 highs vs. 20 lows) and 
a person’s EPPS scores (high vs. low by me- 
dian split). Among wives two significant find- 
ings indicated positive relationships between 
total satisfaction and two EPPS variables: 


Nurturance (x? = 3.94, p < .05) 
Succorance (x? = 6.46, $ < .02) 


i i i itively re- 
Total satisfaction of wives was posi 
lated to two EPPS scores of their husbands 
(one trend and one significant finding) : 


2 = .10) 
Nurturance (x? = 2.80, p < 
Achievement (x? = 6.46, p < .02) 
were trends toward negative asso- 
een wives’ total satisfaction and 
f their husbands: 


Also, there 
ciation betw 
two EPPS scores 0 


Abasement (x? = 
‘Autonomy (x? = 


2.86, p < .10) 
3.50, p < -10) 


isfaction with 
„ø husbands’ total satis i 

Comparing Ts scores revealed two negative 
= ais one a trend and the other sig- 
relatio. A 
nificant: 

2 .01 

Affiliation (x? i era D 
Dominance (x" = 9" 
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Total satisfaction of husbands showed four 
significant relationships with EPPS scores of 
their wives. Two were positive: 


Succorance (x? = 4.91, p < .05) 
Nurturance (x? = 6.46, p < .02) 


Two were negative, as was a trend: 


Autonomy (x? = 4.91, p < .05) 
Dominance (x? = 4.91, p < .05) 
Aggression (x? = 3.28, p < .10) 


It can be seen that for both men and 
women, total need satisfaction in marriage 
was related to some of one’s own and one’s 
spouse’s EPPS scores. Interestingly, satisfac- 
tion of both wives and husbands was posi- 
tively related to wives’ scores on Nurturance 
and Succorance. 


Additional Comparisons 


The product-moment correlation between 
husbands’ and wives’ total satisfaction scores 
was .55, indicating a fair degree of agree- 
ment. Men’s predictions of their spouses’ satis- 
faction ratings were significantly lower than 
their own satisfaction ratings (mean differ- 
ence = 2.97, CR = 3.09, p < .01), and sig- 
nificantly lower than their spouses’ predic- 
tions for them (mean difference = 2.82, CR 
= 2.24, p < .02). The mean self-ratings on 
satisfaction of husbands and of wives were 
virtually the same, as were women’s predic- 
tions of their husbands’ satisfaction ratings 
and their own ratings. 

Chi square tests of association (high-low 
vs. high-low) yielded no p values below .20 
when length of marriage was compared with 
each of the following: interspousal need simi- 
larity, interspousal similarity of total satis- 
faction scores, total satisfaction scores of 
either men or women, accuracy of predicted 
satisfaction ratings by either men or women. 
Similarly, there was no tendency toward a re- 
lationship between total satisfaction of either 
sex and interspousal need similarity. 


SUMMARY 


Fifty-six married couples were administered 
a modifed version of the EPPS and a ques- 
tionnaire for measuring degree of need gratifi- 
cation afforded by the spouse. One aim of the 
study was to examine the hypothesis of inter- 


208 


spousal complementarity of needs by compar- 
ing the EPPS scores of mates. Also, relations 
between total satisfaction and EPPS scores 
were explored. The findings were: 

Interspousal correlations on 11 EPPS vari- 
ables were close to the random level. 

Twenty-one husband-wife pairings of like 
and unlike EPPS variables produced five re- 
lationships contradictory to the complemen- 
tarity hypothesis, and none supportive of it. 

Degree of total satisfaction of wives was 
not consistently related to interspousal need 
complementarity. But total satisfaction of 
husbands was positively associated with in- 
terspousal complementarity in four need 
pairings. 

Thus, the nature of husband—wife need re- 
lationships, as measured by the EPPS, was 
different for various need pairings, for differ- 
ent degrees of total satisfaction, and for the 
sexes. 

Among wives, total satisfaction was posi- 
tively related to own EPPS scores on Nur- 
turance and Succorance, and was related to 
certain EPPS scores of their spouses. Hus- 
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bands’ total satisfaction tended to be related 
to some of their own EPPS scores, and to 
some of their spouses’ scores—including posi- 
tive relationships with spouses’ Succorance 
and Nurturance scores. 

Husbands’ predictions of their spouses’ to- 
tal satisfaction scores were lower than their 
own ratings and lower than their spouses’ pre- 
dictions for them. The mean self-ratings of 
husbands and wives were virtually the same. 

There were no relationships between length 
of marriage and other variables or between 
total satisfaction and interspousal need simi- 
larity. 
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OBJECTIVE MEASURES OF PERCEPTION IN 
SCHIZOPHRENICS AND NORMALS* 


RUTH COOPER 
Teachers College, Columbia University 


Many views have been proposed concerning 
the nature of impairment in schizophrenia. 
Some theorists postulate disturbances in cog- 
nitive functioning as one of the primary fac- 
tors in the schizophrenic disorder (Angyal, 
1946; Bleuler, 1950; Goldstein, 1943). Others 
suggest that schizophrenia is primarily an af- 
fective disorder (Freud, 1947; Sullivan, 1953). 
Despite differences in orientation and empha- 
sis, most theories agree that impaired reality 
testing is a central phenomenon in schizo- 
phrenic illness. 

While it may be true that schizophrenic 
impairment involves cognitive difficulties, or 
emotional conflicts, or both, another possible 
hypothesis is that schizophrenia involves, more 
simply and directly, impaired perceptual func- 
tioning. 

Klein (1951) averred that perception was 
perhaps the most important area of investi- 
gation of individual organization because it 
was the individual’s point of contact with re- 
ality. Since poor reality testing is considered 
a primary symptom of schizophrenia, and per- 
ception is intimately related to how an indi- 
vidual deals with his environment, one would 
expect less accurate perception by people 
whose reality testing mechanisms are inade- 
quate. 

Recent research indic 
est in the use of psyc 
as means of differentiating 
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groups. Studies on autokinetic effect (Voth 
1947), size constancy (Lovinger, 1956; 
Rausch, 1952-53), and judgments of weights 
(Salzinger, 1957) support the notion that 
objective, psychophysical measures may well 
provide useful diagnostic tools (Stevens 
1951). Their particular attractiveness lies in 
the premise that effects of more complex cog- 
nitive and emotional factors are saininized 
when simple, objective techniques are used. 
The present study was designed to investi- 
gate differences of response between schizo- 
phrenics and normals to three visual stimuli 
like those frequently used in psychophysical 
experiments. The general hypotheses were 
that schizophrenics are less accurate and more 
variable than normals in judging objective 
visual stimuli. Moreover, since there are vary- 
ing degrees of schizophrenic disturbance, from 
mild to severe, it was further liynothesized 
that degree of perceptual accuracy and vari- 
ability are related to degree of disturbance. 


METHOD 
Subjects 


Schizophrenic group. Sixty-six (66) male veteran: 
were selected from psychiatric wards of Manha i 
tan and Northport Veterans Administration Hos; ital, 
They had been diagnosed schizophrenic, were without 
organic brain pathology, and had not received shock 
therapy within a month of testing. Of this gro i 
46 Ss had been hospitalized for less, and 20 ance 
than 24 months, Included were paranoid anaien 
entiated, hebephrenic, catatonic, and sple sites 
phrenics. Only 3 had been diagnosed as eee 
schizophrenic. The group was representative of = 
ent day psychiatric veteran patients who are. for tte 
most part, chronic, process-type schizophiteniés a 

Normal group. Thirty-six (36) male veterans wi 
selected from general medical and surgical aie a 
Manhattan Veterans Hospital. Patients with past SSY 
chiatric referral or neurological involvement a 
excluded. The sample included 26 ambulatory, M em 
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postsurgery patients, and 10 long-term tuberculosis 
patients, some confined to bed. 

The groups were comparable with respect to age 
and educational level. Visual acuity was within nor- 
mal limits; those Ss who ordinarily wore glasses 
used them during the testing situation. 


Instruments and Administration 


S was informed that a research project was being 
conducted at the hospital to determine some of the 
factors that influence perception. He was told that 
his active cooperation would be appreciated, despite 
the fact that the findings might not be of immediate 
practical benefit to him. The purpose of such in- 
structions was to minimize any anxiety S might have 
felt if he thought test results might be used “against” 
him in any way. This approach seemed effective in 
virtually every case. Ss seemed motivated and inter- 
ested throughout the testing situation. 

Prior to actual testing, S was required to repro- 
duce an extremely simple block design, the model for 
which £ constructed. The purpose was to determine 
whether S could follow instructions. Only one S had 
to be excluded for inability to follow instructions. 

All three tasks were administered during one ses- 
sion, with Galton Bar judgments the first task for all 
Ss. To control for possible effects of fatigue or wan- 
ing interest, order on the succeeding tasks was varied; 
half the Ss performed Sander Parallelogram judg- 
ments next; the other half, the Kunnapas Squares 
judgments. 

Galton Bar. An adaptation of the Galton Bar was 
constructed. It consisted of a wooden board (20” 
wide X 18” tall) onto which were mounted two 
wooden slide rules from which all markings had been 
removed on the side facing the Ss. The entire in- 
strument, except for the areas behind the moveable 
Part of the slide rule, was painted black and kept 
scrupulously free of any markings which might pro- 
vide clues or anchoring points for the Ss. The back 
of the slide rule (the moveable part) was marked off 
on a millimeter scale, and ranged from 0 to 200 mm. 
Either side (right or left) could and did serve as the 
constant or the variable. The length of the constant 
was kept at 150 mm. 

E instructed S to indicate by saying “Stop” when 
the variable (which E manipulated at all times) ap- 
peared equal to the constant. A total of 16 trials was 
E 4 with the constant on the right; 4 with con- 
EN ca left; 4 more with constant on the right, 
abl 3 anas with constant on the left. Initial vari- 
a the 8S were either much longer or much shorter 

constant. Order for the size settings was 


longer-: Š 
ee shorter-longer, _ shorter-longer-longer- 


Sander Parallelo 
Parallelogram ak tela Bvoriedel of ithe Sander 


structed of stiff gray card- 

pee ae ‘eS mounted on a wooden frame. Except 
ee lagonal, which had to be moveable, the 
India ink. Th was drawn on the cardboard in black 
Sew oe e width of all lines, including the thin 
tip of metal which served as the moveable diagonal, 


was 4 mm. The length of the fixed diagonal was 150 
mm. The variable covered a range from 71 mm. to 
156 mm. The back of the parallelogram was marked, 
off by means of Polar Grid paper, from which read- 
ings were easily converted into millimeters. 
Administration was the same as for the Galton Bar, 


except that there were 8 trials instead of 16, and > 


constant was always on the same side because of the 
nature of the parallelogram. As with the Galton Bar, 
S was instructed to indicate by saying “Stop” when 
the variable appeared equal in length to the constant. 

Kunnapas Squares. Two sets of squares, adapted 
from a figure-ground instrument devised by Kun- 
napas (1955), were cut from white oaktag. Each set 
contained 11 cards; the first comprised squares whose 
side (like that of the constant) measured 9 cm.; the 
side of the second set measured 21 cm. A black line 
was inscribed in the center of each square. The 
range of line lengths was 11 mm., i.e., from 45 mm. 
to 55 mm. All lines were 4 mm. wide, and the stand- 
ard was 50 mm. long. 

The squares were presented on a large, light blue 
background and were always 13 cm. apart, lying flat 
on the blue background. The constant was kept on 
the right side for one ascending (45 mm. to 55 mm.) 
and one descending series of presentations, and on the 
left for the second ascending and descending series, 
This procedure was followed for both the 9 and 21 
cm. sets, providing a total of 44 judgments for each 
set. 

S was instructed to judge whether the variable line 
was “longer” or “shorter” than the constant. Since 
each set contained one square whose inscribed line 
was exactly equal to the constant, E instructed S to 
give his best guess whenever he experienced real diffi- 
culty in making a particular judgment. S was not in- 
formed that any of the lines were, in fact, equal to 
the constant. 

Montrose Rating Scale. Each psychiatric patient 
was rated for severity of illness by the ward psy- 
chiatrist. If S was not well known to the psychiatrist, 
the ward psychologist or nurse was asked to do the 
rating. The Montrose Rating Scale consists of seven 
five-point rating scales intended to measure S’s func- 
tioning in seven areas—reality testing, emotionality, 
communication, human relationships, aspirations, 
manifest overt behavior, and intellectual functioning. 
A split-half reliability check computed in the present 
study yielded an r of .87. 


Treatment of Data 


Judgments on each instrument were recorded with- 
out regard for sign, in terms of distance from objec- 
tive equality. Thus, each judgment was treated as a 
difference score (ie, an error), and average error 
and variance were computed for each S and for each 
group. 

On the Kunnapas Squares task, errors were weighted 
according to distance from equality. Thus. judgin, 

45 mm. line as “longer” than the 50 im Aaii 
was zae a weight of 5, whereas judging a 49 mm. 
line as “longer” carried a weight of 1, The measure 
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of variability was the number of reversals from 
“shorter” to “longer” to “shorter.” ae 

Additional analysis. Because of the apparent simi- 
larity of the perceptual tests, the data were ana- 
lyzed to discover whether any relationship existed 
between tests with respect to accuracy and to vari- 
ability of performance. 

1. The ¢ test was used to test 
differences of scores between Normal 


hrenic groups. nc i 
i 2. The F test was computed for significance of dif- 


ferences in variability between Normal and Schizo- 


hrenic groups. i 
j} 3. A product moment correlation was computed 


between scores on each of the three tests for the 


Normal group. 
4. Phi coefficients were computed between: 


(a) severity of illness and scores on each of the 
. tests; 


for significance of 
1 and Schizo- 


* (b) severity of illness and variability on each 
of the tests; 
(c) variability on each test for each group of 
Ss; F . 
(d) scores on each test for the Schizophrenic 
group. 


5. Split-half reliabilities were computed for each 
instrument and the Spearman-Brown correction was 
applied. 

RESULTS AND CoNCLUSIONS 


Results are presented for each hypothesis 
after a brief statement of the hypotheses. 


Accuracy of Judgments 

The first hypothesis stated that normals are 
more accurate than schizophrenics in judging 
objective visual stimuli. Differences between 


TABLE 1 
RES OF THE SCHIZOPHRENIC GROUP 
L GROUP IN JUDGMENTS ON THE 
SANDER PARALLELOGRAM AND 
car. KUNNAPAS SQUARES 


MEAN ERROR Sco: 
AND THE NORMA 
GALTON BAR, THE 
THE 9 AND 21 


Group 
2 Se 
Schiz- 

Jnstrument ophrenic Normal t p 
Galton Bar 8.78 4.43 3.50 .005 
alton Ba 
tase 24.07 18.63 5.33 .0005 

aral 
saat id ar 44.26 441 178 .0005 

9 cm. 

Es ea! 15.33 4.14 6.99 -0005 


TABLE 2 


MEAN VARIABILITY SCORES OF THE SCHIZOPHRENIC AND 
THE NORMAL GROUP IN JUDGMENTS ON THE GALTON 
BAR, THE SANDER PARALLELOGRAM, AND THE 9 AND 
21 ca. KUNNAPAS SQUARES 


Kunnapas 
Squares 
Galton Sander 
Group Bar Parallelogram 9cm. 21cm. 
Schizophrenic 48.50 30.89 10.17 8.95 
Normal 12.44 12.62 2.45 si 


Note.—All differences significant at or beyond .01 level. 


group means were tested with the ¢ test and 
found significant at or beyond the .005 level 
on each instrument. The first hypothesis, 
therefore, was supported. Table 1 presents 
the results. 


Variability of Judgments 


The second hypothesis, that normals are 
less variable than schizophrenics when judg- 
ing objective visual stimuli, was also sup- 
ported. The F test yielded results significant 
at or beyond the .01 level. Results are sum- 
marized in Table 2. 


Relationship of Severity of Illness to Accu- 
racy and Variability of Judgments 


The data did not support the hypotheses of 
relationship between severity of illness and 
accuracy and variability of judgment. Corre- 
lations with psychiatrists’ ratings of severity 
of illness were essentially zero. 


TABLE 3 


INTERCORRELATIONS OF THE SCHIZOPHRENIC Ss’ SCORES 
ON THE GALTON Bar, THE SANDER PARALLELOGRAM, 
AND THE 9 AND 21 cm. Kunnapas SQUARES 


Kunnapas Squares 


Sander 
Parallelogram 9cm. 21cm. 
Galton Bar -36* AS 21 
Sander 
Parallelogram .21 36* 
Kunnapas Squares, .52* 
9 cm. 


* Significant at or beyond .01 level. 
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TABLE 4 


INTERCORRELATIONS OF THE NORMAL Ss’ SCORES ON THE 
GALTON BAR, THE SANDER PARALLELOGRAM, AND 


TABLE 6 


INTERCORRELATIONS OF THE NORMAL Ss’ VARIABILITY 
ON THE GALTON Bar, THE SANDER PARALLELOGRAM, 


THE 9 AND 21 cm. KUNNAPAS SQUARES AND THE xp 21 cm. K APAS SQUARES 
Kunnapas Squares Kunnapas Squares 
Sander eR Sander —— 
Parallelogram 9cm. 21cm. Parallelogram 9cm. 21cm. 
Galton Bar 34* —.22 —.04 Galton Bar 91 .65 .89 
Sander Sander 
Parallelogram 06 —.28 Parallelogram 80 89 
Kunnapas Squares, Kunnapas Squares, 
9 cm. —.02 9 cm. 95 
* Significant at .05 level. Note.—All correlations significant at .0001 level. 
TABLE 5 


INTERCORRELATIONS OF THE SCHIZOPHRENICS Ss’ 
VARIABILITY ON THE GALTON Bar, THE SANDER 
PARALLELOGRAM, AND THE 9 AND 21 CM. 
KuNNAPAS SQUARES 


Kunnapas Squares 


Sander 
Parallelogram 9cm. 21cm. 


Galton Bar 21 .23 .00 
Sander 

Parallelogram .00 03 
Kunnapas Squares, 

9 cm. Ad 


* Significant at .001 level. 


Relationship Between Performance on the 
Tests 


To test for relationship between perform- 
ance with respect to accuracy, Pearson’s 7 
Was computed for normals, and phi for schizo- 
phrenics, 

_Table 3 demonstrates that correlations, sig- 
nificant at or beyond the .01 level, were ob- 
tained between scores on the Galton Bar and 
Sander Parallelogram, Sander Parallelogram 
and 21 cm. Kunnapas Squares, and 9 and 21 
= Kunnapas Squares by the schizophrenics. 
The ae bie ong achieved significance. 
he e ander Parallelogram accounts for 
; correlations in accuracy which, though 
ow, are significant, i 

For the normals, as Table 4 shows, a sig- 
nificant correlation was found in accuracy of 
responses between the Galton Bar and Sander 


Parallelogram. No other correlations achieved 
significance. 

Tables 5 and 6 indicate that with respect to 
variability no significant correlations were ob- 
tained for the schizophrenics except between 
the two subtests of the Kunnapas Squares. 
The normals, on the other hand, achieved cor- 
relations significant at the .0001 level between 
all of the tests. 


DISCUSSION 
Difjerences between Groups 


The hypotheses that schizophrenics are less 
accurate and more variable than normals in 
judging objective visual stimuli were sup- 
ported. Since both cognitive complexity and 
emotional factors were kept to a minimum 
one might speculate that schizophrenia is 
neither merely a cognitive nor an emotional 
disturbance, but that it also involves percep- 
tual impairment which is not readily ac- 
counted for by either cognitive or emotional 
theories of schizophrenia. It is particularly 
significant that such impairment can be dem- 
onstrated in simple, objective, readily quanti- 
fiable tasks. 


Relationship of Severity of Illness to Accu- 
racy and Variability of Judgment 


The hypotheses that there are relationships 
between severity of illness and accuracy and 
variability of judgment were not supported 
Several possibilities present themselves to a ; 
count for the results, One is that many en 
were involved in the study, and it was sa 
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TABLE 7 


RELIABILITY OF MEASURES FOR THE SCHIZOPHRENIC AND NORMAL GROUPS 


Schizophrenic Noinal 
Spearman Brown Sı z ri 

Test r correction r oe i 
Montrose Rating Scale 7 87* 
Galton Bar A8 .65* .30 46* 
Sander Parallelogram 94 .97* .96 `‘o8* 
Kunnapas Squares, 9 cm. -60 -75* .05 “10 
Kunnapas Squares, 21 cm. 61 .76* 71 "93* 


% Significant at or beyond .001 level. 


possible to check interjudge reliability. An al- 
ternative explanation is the possibility that in 
schizophrenia perceptual impairment has an 
all-or-none quality. One further possibility is 
the fact that since really regressed, back ward 
schizophrenics were excluded from the study, 
those Ss who were included did, in fact, vary 
very little in degree of schizophrenic illness. 


Intercorrelation among Measures 


Small but significant correlations in accu- 
racy scores were obtained for schizophrenics 
between the Galton Bar and Sander Paral- 
lelogram, the Sander Parallelogram and 21 cm. 
Kunnapas Squares, and the two subtests of 
the Kunnapas Squares. With respect to vari- 
ability, not only are schizophrenics more vari- 
able than normals, but they also vary more 
in their variability. 

For the normals, the only significant cor- 
relation in accuracy scores was between the 
Galton Bar and Sander Parallelogram. Prob- 


ably the low reliability of the 9 cm. and the 
e of scores of the 21 cm. Kun- 


limited rang € i 
napas Squares account for this finding. 


y Future Research 


used in the present re- 
trated a potential for dif- 
ormals and schizophren- 
all overlap between the 
"3 distribution of scores supports 
ri ao hat the Galton Bar and Sander 
Parallelogram, in particular, may prove to 
worthwhile additions to test batteries ay a 
to differentiate between normals and schizo- 
phrenics. 
The utility 
physical perceptual t 


Implications fo 

The instruments 
search have demons 
ferentiating between n 
ics. The relatively sm 


objective, psycho- 


of simple, i yeh 
asks in differentiating 


between clinical groups has been demon- 
strated. It is not unreasonable to expect that 
such tests may also be of considerable value 
in differentiating among personality types 
They may, as a result, contribute some fruit- 
ful hypotheses for investigation to personality 
and perception theorists. 


SUMMARY 


; The present study investigated differences 
in accuracy and variability of perceptual 
judgments between schizophrenics and nor- 
mals, using simple, objective, visual stimuli 
like those used in psychophysical experi- 
ments. Significant differences both in accu- 
racy and in variability were obtained between 
groups. 
p No a was demonstrated between 
egree of schizophrenic i 
egres o phrenic illness and perform- 
Correlations in accuracy of performance 
were demonstrated between some of the meas- 
ures. With respect to variability, no correla- 
tions were demonstrated for the schizophren- 
ics, whereas normals showed highly signifi- 
cant correlations between all tests. 5 
Implications of these findings, both for the- 
ory and for future research, were discussed 
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EGO STRENGTH AND TYPE OF DEFENSIVE BEHAVIOR* 


GERALD F. KING anp MARVIN SCHILLER 
Michigan State University 


In a study of the construct validity of the 
Barron (1953) Ego Strength (Es) Scale, 
Schiller (1958) failed to find a significant re- 
lationship between this scale and the amount 
of defensiveness displayed by individuals in a 
stressful, real life situation. In a postmortem 
consideration of the negative results, the con- 
cept of defensiveness was given some scrutiny. 
As employed in this and other studies, defen- 
siveness is a quantitative extension of the psy- 
choanalytic typology of defenses. It is a com- 
posite or global concept. This paper presents 
a reanalysis of Schiller’s data by exploring 
the feasibility of using specific types of de- 
fenses as a frame of reference instead of the 

‘al concept of defensiveness. ; 
“haa veanaivelt the initial task was seen 
as a conceptual one, that of deriving the ex- 
pected relationship between ego strength and 
different defensive behavior. While such a re- 
lationship is not explicitly outlined in psycho- 
analytic theory, the parallel between ego de- 
velopment and the developmental sequence of 
the defense mechanisms seems to offer a ba- 
sis for making predictions. Blum (1953) and 
others have indicated that there are a ne 
per of ambiguous aspects in the theoretical 
treatment of defense mechanisms," Par most 
of these elements can be bypassed in the pres- 
ent eE aron study, the concern was 

A the more gross features of psychoana y 
- For example, it is sufficient to utilize the 
a i i he junior 

5 pase on a mastona the mor ator, Grateful 
author and ore T ie Terrence M. Allen and Frank 
acknowledgment el and suggestions. 


Rane aar i Sie theoretical ambiguities of psycho- 


to defense mechanisms are as fol- 
s in the criteria as to what is clas- 
mechanism, (b) lack of specificity 
their chronological development, 
on whether some of the mecha- 
r in normal adults. 


isms of childhood also occu! 


differentiation that denial and projection are 
more primitive defense mechanisms than ra- 
tionalization, more primitive in the sense that 
they develop and operate earlier in the indi- 
vidual’s life. The sequence of defenses that 
are employed by the child parallels the mat- 
uration of the ego, especially in its increas- 
ing awareness and adaptation to reality. Both 
denial and projection are incompatible with 
reality. As ego functions become more ex- 
tended, denial and projection become less ap- 
propriate mechanisms. Bellak (1958) states, 

By and large the earlier it emerges, the more primi- 
tive a defense, and the more pathological in adult 
life, Extensive denial and very marked projection 
probably vie for first place with regard to serious- 


ness of pathology, by virtue of their effect on the 
individual’s adaptation to reality (p. 20). 


Reality is given definite recognition in the 
defense mechanism of rationalization. Accord- 
ing to Fenichel (1954), this mechanism serves 
as a means of justifying certain behavior by 
making it reasonable in the light of reality. 
Thus, rationalization emerges at a higher 
level of ego development or “ego strength.” 

To complete the conceptual picture, it 
seemed reasonable to assume that the rela- 
tionship between ego maturation and the de- 
velopment of various defenses in childhood 
has relevance for adult behavior, i.e., ego 
strength is related to the developmental level 
of the defense mechanism that is employed. 
The specific hypothesis was that, in a situa- 
tion conducive to the elicitation of defensive 
behavior, level of ego strength is positively 
related to the relatively greater use of ration- 
alization than either denial or projection. 


METHOD 
Subjects and Setting 


The Ss were 60 male drivers who, due to an ex- 
cessive number of traffic violations and/or accidents, 
had been summoned to the Driver and Vehicle Serv- 
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ices, Office of the Secretary of State (Michigan), for 
a re-examination interview to determine their future 
driving privileges. The ages of the Ss ranged from 17 
to 62 years (M =25.67, SD = 9.04), with level of 
education extending from 8 to 16 years (M = 11.82, 
SD = 1.99). The Ss were aware that the re-examina- 
tion interview could result in the loss of their driv- 
ing permits. Such decisions were made by the inter- 
viewers in more than 50%- of the cases. When the 
importance of a driver’s license to the average indi- 
vidual is considered, it can be assumed that there 
Were strong elements of anxiety in this real life situa- 
tion, one that could be expected to elicit defensive 
behavior. 


Procedure 


Prior to the re-examination interview, the Ss were 
individually administered a battery of tests that in- 
cluded the Es scale, the WAIS Vocabulary Subtest, 
and the DDB Inventory (King, 1957). 


Measures of Defensive Behavior 


The DDB Inventory was designed to measure the 
orientations of problem drivers toward their past 
driving records (violations and accidents). In con- 
tent, it consists of 40 defensive and nondefensive 
Statements that had been culled from the interview 
Temarks of several hundred problem drivers. The 28 
efensive statements were selected by judges to re- 
ect “defensiveness against accepting personal re- 
SPonsibility for one’s traffic record.” The Ss indicated 

eir degree of agreement with the inventory items 
On a three-point scale, The sum of the weighted re- 
Sponses to the defensive items gives a general defen- 
Sive score, Computed for several samples, the split- 
alf reliability coefficients for the general defensive 
Score were of the order of .80. In studies of concur- 
rent validity, correlations of the order of .60 were 
obtained between the DDB measure of defensiveness 
and interview ratings of defensiveness. 
© meet the purposes of the present analysis, the 
%8 defensive statements were submitted to 10 judges 
(advanced graduate students), who were asked to 
Use the following categories in classifying the state- 
penis according to the type of defense mechanism 
T was represented: denial, projection, rationaliza- 
cae and indeterminate. From the items on which 
: Was agreement by at least 8 out of the 10 
S 4 were selected for each of the three defenses. 
dan four items were available for two of the 


donee aii oi statements illustrating the different 
enses ar x a 
look bad, but I real e as follows: My record may 


I think that the pi 


> > Speed limit once in a while (rationali- 

a g delineations among the defensive 

tive-typology” ie i yielded separate “quantita- 
r es for deni: Baas 

and rationalization (R). enial (D), projection (P), 


RESULTS AND DISCUSSION 


As a preliminary step, the defensive scores 
(D, P, and Rj; along with six levels of ego 
strength from the Es scale, were treated with 
analysis of variance. Table 1 shows that, in 
addition to the significant difference for be- 
tween defenses (F = 32.81, p < .01), the in- 
teraction between the Es levels and the dif- 
ferent defenses was of marginal significance 
(F = 1.94, p< .05). The latter finding is 
compatible with the hypothesis. Using cor- 
rections for main effects, i.e., 
tween defenses, the scores wer 
Es being the ordinate and the defensive scores 
falling on the abscissa. In line with the hy- 
pothesis, the expectation was that R would be 
represented by an ascending diagonal, which 
would be crossed by the descending diagonals 
of D and P. The results were moderately good 
for R and P but not for D. 

A correlational analysis provided the strong- 
est support for the hypothesis, First, it was 
found that Es was not significantly correlated 
with the individual defensive scores: Es vs, 
D, —.10; Es vs. P, <= 10%: Bs: yg. R, —.08, 
What might be called a measure of the “ 
tive use of rationalization (RUR)” 
derived from the following simple 
RUR =R — (D+ P) + 10, the con 
ing added to eliminate negative va] 
port was given to the hypothesis } 
sulting significant correlation of 2 
Es and RUR (p< 05). No noti 
parture from rectilinearity was sy 


differences be- 
e plotted, with 


rela- 
was then 
formula: 
stant be- 
ues. Sup- 
y the re- 
7 between 
ceable de- 
gested by 


TABLE 1 


ANALYSIS OF VARIANCE OF THE DIFFERENT D 


EFENSIvE 
Scores WITH Six LEVELS or Eco STRENG a 


TH (Fs) 


o Sum of 
Source of Variation df Squares p p 
Between Es Levels 5 9.37 0.80 
Between Ss 54 1 
Total Between Ss 59 D 
Between Defenses 2 49.24 32.81 01 
i SL <, 
Es X D 10 14.55 194 <05 
Pooled Ss X D 108 80.77 i 
Total Within Ss 120 14456 
Total 179 267.64 


J 
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an inspection of the scattergram. Further 
analysis revealed that the correlation between 
Es and RUR was independent of intellectual 
level, as measured by the WAIS vocabulary. 
In deriving specific defensive scores from 
the DDB Inventory, only 12 of the 28 de- 
fensive items were used. Consideration was 
given to the possibility that the relationship 
between Es and RUR was a function of the 
12 items that were selected, that comparable 
results could be obtained with these items 
using the general concept of defensiveness. 
This possibility was not supported when Es 
was correlated with a general defensive score 
based on the 12 items, i.e., the sum of D, P, 
and R. The r of —.15 was in the direction 
predicted by Schiller (1958) but not signifi- 
b 
A second sample of Ss (W = 50), tested in 
the same setting, was available for a cross- 
validation of the obtained correlation between 
Es and RUR. While the Ss in this sample un- 
derwent a shorter test battery, the principal 
difference was that the DDB Inventory was 
slightly modified. The inventory items wete 
arranged differently, and some were change! 
in wording. As in the first sample, Es was not 
significantly correlated with the individual de- 
fensive scores or the revised general defensive 
score: Es vs. D, —.14; Es vs. P, —.17; Es vas 
R, .04; Es vs. D + P + R, —.12. The ag 
tion between Es and om was again signifi- 
= p< .05). , 
cani U tha A of comparison, the analysis 
of the same data in accordance with ped di 
ferent orientations can be ora ie, 
igi tment of the j 
ae ae between level of ego 
aoti and general defensiveness in two “a 
adent samples were —.18 and —.20, bot 
ed direction but not significant. 
by the specifics of psychoana- 
the present study yielded small 


stre: 
depen ; 
in the predict 
Guided more 
lytic theory, 
arch (King & Schiller, 1958) re- 
response set “acquiescence” oi 
vealed that n the general defensive score of ae 
in both ‘as sia which suggests that caution aa 
DDB e interpretation of epel fani W- 
ta cs wh these sales Temes în RUR. The 
ained wi of acquiesci i s 
i for me PERUR was not appreciably affected 
pn acquiescence. 
r P attempts to peed 


3 Previous Test 


partial ou 


but significant positive correlations (rs of .27 
and .31) between level of ego strength and the 
relative use of rationalization in the two sam- 
ples. While the differences produced by the 
two analyses are by no means striking, the re- 
sults do suggest that, in the present context, 
the psychoanalytic typology of defenses pro- 
vides a more fruitful frame of reference than 
the general concept of defensiveness. 


SUMMARY 


Preliminary research employing the general 
concept of defensiveness failed to find a rela- 
tionship between level of ego strength, as 
measured by the Barron (1953) Ego Strength 
Scale, and the amount of defensiveness shown 
by individuals in a stressful real life situa- 
tion. In a reanalysis of the data, the present 
study explored the possibilities of a different 
orientation, that offered by the psychoanalytic 
typology of defense mechanisms. The specific 
defenses considered were denial, projection, 
and rationalization. The following hypothesis 
was suggested by psychoanalytic theory: Ina 
situation conducive to the elicitation of de- 
fensive behavior, level of ego strength is posi- 
tively related to the relatively greater use of 
rationalization than either denial or projec- 
tion. The results obtained from two samples 
supported the hypothesis. 
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PERSONALITY CORRELATES OF PEPTIC ULCER PATIENTS* 


SIMONE MARSHALL ? 


Teachers College, Columbia University 


Alexander’s theory points out two major 
aspects in the psychodynamics of peptic 
ulcer patients: (a) a strong drive for achieve- 
ment, independence, and self-sufficiency con- 
flicting with (b) persistent infantile wishes to 
be dependent and protected. The patient re- 
mains unaware of his dependency needs and 
typically describes himself as follows: “I am 
efficient, active, productive; I give to every- 
body, help people, assume responsibilities, 
like to have people depend on me, like to 
be the effective leader and the self-sufficient, 
active, or even aggressive personality” (Alex- 
ander & French, 1948, p. 110). The Achieve- 
ment drive and the related personality vari- 
ables, Efficiency, Responsibility, Dominance, 
Self-sufficiency and Aggression, are conceived 
as defensive reactions against unacceptable or 
“ego-alien” dependency needs. 

Other aspects of the defensive pattern are 
also considered by Alexander and his col- 
leagues. Therese Benedek (1946) and Cath- 
erine Bacon (1948) have noted the “severe 
self-criticism” of the ulcer patient and his 
need “to lead a very ethical life” (Bacon, 
1948, p. 135) as a reaction against his recep- 
tive trend. Such an emphasis on self-criticism 
and strict moralism with the consequent lack 
of self-acceptance are likely to be reflected in 
a large discrepancy between self-concept and 
ideal self. The guilt and shame experienced 
in relation to dependent longings are gener- 
alized and emotional responses become re- 
stricted. This “absence of give and take of 
emotional responses” as well as the “inability 
n N and even feel emotion” (Alexander 
ape 1948, p. 189) may be termed emo- 

bition and is associated with im- 
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poverished interpersonal relationships. Such 
a limited ability to interact with others, 
coupled with the need to maintain a rigorous 
moralistic code, is not likely to allow much 
inner life and introspection so that another 
aspect of the defensive pattern characterizing 
ulcer patients may be the inability to question 
one’s motives and others’ intentions; that is, 
there is a lack of insight. In considering the 
relationships among variables such as Strict 
Moralism, Lack of Insight, and Emotional In- 
hibition in the context of a need for Achieve- 
ment, it appears that the ulcer patient may 
attempt to attain his goals by accepting only 
the more obvious social guidelines and by 
exercising restraint. In a sense, the emphasis 
may be on adjustment to explicit cultural 
sanctions, or in a word—conformity. This 
great reliance on conventional values does not 
encourage innovations but rather tends to 
maintain the status quo, since most changes 
are experienced as threatening to a precarious 
emotional balance. Thus, resistance to change 
may also be included in the defensive behay- 
ior pattern of peptic ulcer cases. 

In sum, the personality variables of par- 
ticular significance in the defensive behavior 
pattern of peptic ulcer patients include: Effi- 
ciency, Responsibility, Dominance, Self-suff- 
ciency, Achievement and Aggression, as spe- 
cifically emphasized in Alexander’s theory 
The variables of Strict Moralism, Lack of 
Self-acceptance, Emotional Inhibition, Lack 
of Insight, Conformity and Dislike for Change 
were derived from a review of the cases re- 
ported in the literature by Alexander and his 
followers. 

Studies which corroborate Alexander’s theo- 
retical position consist mostly of clinical ob- 
servations made in the course of psychothera- 
peutic interviews (Van der Heide, 1948: 
Garma, 1950) and investie: i 1 


bags 3 gations usin T 
jective techniques (Modell & Potter fea 
2 , 


Brown, Bresnahan, Chalke, P 
Fi eters 
Tougas, 1950). While most ni a b 
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with Alexander’s assumption concerning the 
existence of strong dependency needs in peptic 
ulcer cases, the particular defense mechanisms 
used to prevent realization of the depend- 
ent desires are frequently questioned (Kapp, 
Rosenbaum,’& Romano, 1947; Prince, 1950; 
Weinberg, 1951). The present study was de- 
signed to investigate the pattern of defensive 
behavior proposed by Alexander as particu- 
larly characteristic of the peptic ulcer patient. 


HYPOTHESES 


General Hypothesis 

The general hypothesis tested was that 
peptic ulcer patients, in comparison to a non- 
ulcer psychosomatic group and a nonpsycho- 
somatic group, endorse and value achieve- 
ment, responsibility, and related personality 


variables. 


Experimental Hypotheses 


The following research hypotheses were for- 
mulated on the basis of the personality vari- 
ables comprising the defensive pattern pro- 
posed by Alexander: — , : 

1. Peptic ulcer patients, in comparison to 
nonulcer psychosomatic and nonpsychosomatic 
patients, describe themselves as (a) more 
dominant, (b) more aggressive, (c) more effi- 
cient, (d) more responsible, (e) more self- 
sufficient, (f) more strictly moralistic, (g) 
more emotionally inhibited, (A) more con- 
forming, (7) more achievement-oriented, G) 
less interested in change, (k) less intracep- 


a Icer patients, in comparison to 


Peptic u t 
e psychosomatic and nonpsychosomatic 
ay how greater self-ideal discrepancy. 


patients, S 
METHOD 


Subjects 


tal group, composed of 40 patients 
of ulcer, was compared to two 
One control group, Ca, included 20 
control group’ "cases other than gastrointestinal dis- 
4 hma and neurodermatitis. The 
roup, Cx was comprised of 40 pa- 
ontrol 8"? osomatic or psychiatric disor- 


S 3 js P 
t Y espiratory infections and minor 
er 


ii 
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TABLE 1 


ANALYSES OF VARIANCE APPLIED TO DEMOGRAPHICAL 
CHARACTERISTICS OF EXPERIMENTAL AND 
CONTROL Groups 


Mean 


Sources of Variation df Square F p 


Age 

Between 2 6.60 Bo 9s: 
Within 97 52.20 

Total 99 
Education 

Between 2 3.70 94 ns 
Within 97 3.92 

Total 99 
Socioeconomic Level 

Between 2 3.85 1.89 ns 
Within 97 2.05 

Total 99 
Intelligence* 

Between 2 22.36 09 ns 
Within 78 256.79 

Total 80 


«Only 80% of the GT scores were available. The distribu- 
tion of nonavailable GT scores was nine in the experimental 
group, three in the control group Cı, and eight in the control 
group Ca. 


gence (GT score of Army Classification Battery, 
1948) and socioeconomic level (Hamburger Classifi- 
cation, 1957) as shown in Table 1. 


Measuring Instruments 


Two instruments were used to test the hypotheses 
of the present study: the Edwards Personal Prefer- 
ence Schedule (EPPS), and a specially designed rat- 
ing scale, the Peptic Ulcer Index, which provides for 
self and ideal self measures of eight of the variables 
considered in the defensive pattern against the aware- 
ness of dependency. Both instruments were presented 
to. Ss as questionnaires included in a survey of per- 
sonal attitudes of hospitalized versus nonhospitalized 
soldiers. 


Construction of the Peptic Ulcer Index 


A total of 160 statements, 20 for each of the fol- 
lowing variables, were compiled. The variables were 
defined in terms of descriptions given in cases re- 
ported by Alexander and his followers: 


Dominance: characterized by a person who likes 
to have people depend on him, likes to be the 
effective leader, manifests highly competitive 
tendencies. 

Aggression: characterized by occasional outbursts 
of anger, but more often by a subdued hostility. 

Efficiency: characterized by a person who is active. 
rae ae capable, and wants to get things 

one. 
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Responsibility: characterized by a person who likes 
to assume responsibilities, is very conscientious 
and worries a lot about what has to be done. 

Self-sufficiency: characterized by a person who 
neither wants nor needs help, makes decisions 
by himself, is ashamed of accepting any help or 
favor. 

Strict Moralism: characterized by a person who 
leads a very cthical life, is quite preoccupied by 
what is right and what is wrong, is inclined to 
follow well established rules, 

Emotional Inhibition: characterized by a person 
who considers that to show emotion is a sign of 
weakness, is unable to express any form of 
emotion. 

Conformity: characterized by a person whose in- 
terpersonal relationships are limited by emotional 
inhibition while communication takes place on 
a formal level. Acceptance of social demands, 
conventional attitude and no taste for change. 


Many statements Were constructed according to sug- 
gestions of psychologists who were asked to com- 
Pose items related to the eight variables. Additional 
Statements were derived from the Manifest Hostility 
Scale (Siegel, 1956) and from Murray’s question- 
naire (Murray, 1938). In the final form of the items, 
every attempt was made to reflect directly the be- 
avioral observations and self-evaluations given in 
cases reported in the literature. To counter the ef- 
fects of response-set, a number of items were phrased 
in the negative. The wording of the statements was 
not above the 5th-6th grade level of comprehension 
according to the Thorndike-Lorge Word Count 
(Thorndike & Lorge, 1944). 

The content validity of the scale was established 
by the following method. Five clinical psychologists 
and five patients of the nonpsychosomatic popula- 
tion not included in the studied sample were asked 
to classify the 160 statements into eight categories 
according to the description of the variables. The 
agreement of 8 out of 10 judges was considered suffi- 
cient to determine the inclusion of a given item in 
the Index. The 148 statements selected by the above 
method were presented to 170 pilot Ss who were 
asked to rate themselves on a five-point scale for 
self and ideal self, Following Flanagan's technique 
(Flanagan, 1939), the final selection of the items was 
based on the correlation between item scores and 
total Score for each variable. Items with a correla- 
tion less than 30 were eliminated. Fifteen items for 
each variable Were finally retained. 

Intervariable Correlations were computed with the 
ee for the item analysis. Correlations of 
Em r greater were found between Responsibility and 

aciency (.59), Responsibility and Moralism (.45), 


Conformity and Moralism ressi and 
Moralism (—.40). ee 


The reliability Coefficients, 
established for each variable by correlating randomly 
selected halves and by applying the Spearman-Brown 
formula. The reliability of the Index was deemed 
sufficiently high to test the experimental hypotheses. 


shown in Table 2, were 
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TABLE 2 


COEFFICIENTS OF RELIABILITY FOR EACH Vari 


ABLE 
OF THE PEPTIC ULCER INDEX 


Variable Coefficient 
Efficiency 83 
Aggression -65 
Moralism 87 
Emotional Inhibition 80 
Responsibility 81 
Dominance 88 
Conformity 84 
Self-sufficiency 80 

REsutts 


The general hypothesis 
patients, compared to a no 
endorse and value pattern 
and responsibility while denying Patterns of 
dependency, was tested in terms of twelve 
specific variables. The data concerning each 
variable were treated with the ¢ test in order 
to determine the significance of differences 
between experimental and control groups, 
The null hypothesis was rejected at the .025 
level of confidence with a one-tailed test, 
Since the two control groups were found to 
be equivalent on all relevant variables, the 
test consisted of a comparison of the ulcer 
group with the combined control groups, 

Self-ratings of the ulcer group, as measured 
by the Peptic Ulcer Index, were not statisti- 
cally different from self-ratings of the non- 
ulcer samples for the following variables: 
Dominance, Aggression, Efficiency, Responsi- 
bility, Self-sufficiency, and Strict Moralism 
However, the ulcer patients described them. 
selves as more emotionally inhibited and more 
conforming than the control Ss at the .02 anq 
-01 levels of confidence respectively, as shown 
in Table 3. 

On the EPPS, no significant differences 
were found between experimental and control 
groups on the variables of Aggression Domi- 
nance, Achievement, and Change. However 
the ulcer patients described themselves as less 


intraceptive than the control ind: 
in Table 4, ol Ss as indicated 


The second main 
that the discrepancy 
ratings is higher for 
than for the combined 
supported, 


that peptic ulcer 
nulcer population, 
S of achievement 
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TABLE 3 


MEANS, 


STANDARD DEVIATIONS, AND DIFFERENCES BETWEEN MEANS FOR 


EACH VARIABLE OF THE PEPTIC ULCER ĪNDEX 


Difference between Groups 


a ee ee 
Peptic Ulcer 
Group 

Peptic Ulcer Control 1 Control 2 vs. Cı & C2 Ci vs. C: 

Variables Mean SD Mean SD Mean SD t ? t p 
i 46.90 10.73 48.00 8.66 46.62 9.09 19 ns 51 ns 
a 40.80 894 43.15 12.08 3951 804 27 ns 138 ns 
Efficiency 58.00 9.06 56.35 6.24 57.13 10.33 66 ns 30 ns 
Responsibility 54.80 9.40 52.15 7.98 51.30 10.55 1.53 ns 01 ns 
Self-Sufliciency 56.03 7.79 5200 803 5293 991 194 ns 36 m 
Moralism 58.00 8.69 S70 11.36 55.75 11.70 149 ns 65 ne 
Emotional Inhibition 46.55 9.28 42.85 8.38 41.93 8.70 2.24 02 39 ns 
Conformity 49.88 7.95 45.25 7.55 46.09 8.53 249 01 37 ns 
Discrepancy Scores 99.97 38.70 109.85 45.07 102.20 45.45 8&3 ns 58 ns 


Additional Finding 

‘Among 10 other variables measured by the 
EPPS, no significant differences were found 
between experimental and control groups with 
the exception of Order. The ulcer patients de- 
scribed themselves as more orderly than con- 


trol Ss (p < .01). 


Additional Analysis of Data 


An additional analysis of data was made to 
determine whether the results could be gener- 
alized to a civilian population. A comparison 
of the “career soldiers” (more than three years 
of service with intention to remain in service 
as a vocation) with the “noncareer soldiers” 
(draftees with no intention to remain in serv- 
ice beyond the obligated tour of duty) showed 
no significant differences. Therefore, the prob- 


ability that the findings of the present study 
apply only to ulcer patients in the Armed 
Forces is quite limited. 


DISCUSSION oF RESULTS 


The data did not consistently support the 
defensive pattern of behavior proposed by 
Alexander. That is, peptic ulcer patients did 
not vigorously react against their longing for 
love and affection by expressing unusually 
strong needs for Achievement, Dominance, 
Aggression, Responsibility, Efficiency and 
Self-sufficiency, However, Emotional Inhibi- 
tion, Avoidance of Insight and Conformity 
characterized the ulcer group. Such a differ- 
entiation suggests that ulcer patients may at- 
tempt to ward off the realization of their de- 
pendency needs by (a) an avoidance of in- 


TABLE 4 


t VALUES AND Group MEANS ror EACH VARIABLE OF THE EPPS 
INVOLVED IN Mars Hyporursts 


Peptic Ulcer Ci G t i 

Mean SD Mean SD Mean SD PU vs, Evs 

Variable (N = 40) (W = 20) W = 40) C+C: Cy 
acenane 14.92 3.07 1445 481 14.72 402 30 E 
= 1442 472 1470 606 1525 440 30 aa 
Laien 13.52 4.42 16.25 3.08 15.00 4.90 2.29* 1.04 
Dominance 15.00 5.18 14.15 5.06 14.67 441 57 ‘1 
14.42 428 1455 467 1265 3.48 ‘07 182 


Aggression 


* Significant at .02 level of confidence, 


222 i Simone Marshall 


sight into their motives and feelings, (b) an 
inhibition of expression of their emotional 
needs, and (c) an uncritical acceptance of so- 
cial demands. This pattern can be seen as a 
compromise between an infantile need for 
protection and a socially acceptable, even 
sanctioned behavior. A conformist attitude is, 
in a way, a personally and socially acceptable 
form of dependency in which one can indulge 
without having to contend with disapproval 
and guilt. 

The findings obtained might be accounted 
for in terms of a shift in behavior patterns so- 
cially acceptable at the present time in con- 
trast with behaviors emphasized in the society 
of Alexander’s patients. Uncritical acceptance 
of social demands, emotional inhibition, and 
avoidance of insight seem to constitute a 
meaningful and coherent cluster of person- 
ality variables. Stereotyped interpersonal re- 
lationships are characterized by a formal style 
of communication and group-oriented values. 
Several contemporary authors (Fromm, 1947; 
Riesman, 1950; Whyte, 1956) consider this 
need to conform and accept others’ standards 
a major dictum in Western society at the pres- 
ent time. To characterize this pattern, Whyte 
has used the term “Social Ethic” while Ries- 
man refers to “otherdirectedness.” The de- 
pendency needs of the ulcer patient may be 
thus appeased, or partially gratified through 
the acceptable channel of conforming behav- 
ior. Under such circumstances, the Achieve- 
ment drive and the subsequent conflict, De- 
pendency-Achievement, are quite superfluous. 
However, 20 or more years ago, when Alex- 
ander formulated his theory, the socially pre- 
dominant pattern which is reminiscent of 
Weber's Protestant Ethic (1930) included 
achievement, ambition, and competition. It 
is possible to speculate that the defensive pat- 
tern of behavior proposed by Alexander had 
only a temporary validity, and that the lasting 
meaning of this behavior pattern resided in 
the underlying motivation, the need to follow 
closely the culturally patterned set of values. 

Several authors report similar findings con- 
cerning the conformity variable, without how- 
ever attributing much significance to this par- 
ticular aspect of the ulcer patient clinical 
portrait. Leary (1957) has noted the “hyper- 
normal” behavior of ulcer patients. Harris, 


Christiansen, and Ruesch (1946) mentioned 
conformity as a tentative form of adjustment. 
Weiner et al. (1957), in a multidisciplinary 
approach to the problem of peptic ulcer eti- 
ology, have found that, with the blood serum 
pepsinogen concentration controlled, the only 
criterion differentiating the ulcer ses was 
“the intensity of their attempt to maintain 
relationships with others” (p. 5). Thus, it 
may be the intensity of the need to relate to 
others, coupled with the inability to com- 
municate effectively due to repressive mecha- 
nisms, which. explains the emphasis on con- 
formity in the personality of ulcer patients. 

Although the unpredicted finding, strong 
order in ulcer cases, may be due to chance it 
may be possible to account for its occurrence 
by considering that this variable is’ assessed 
in the EPPS as a methodical approach in work 
habits and in everyday living. Military life 
certainly enforces such an approach, and the 
high Order scores of the experimental group 
can be viewed as an additional nuance to the 
picture of a person with limited insight, little 
ability to express himself emotionally, and 
great need to conform. 

In view of the fact that Alexander worked 
with patients of upper middle and lower upper 
class, business executives and intellectu 
and the fact that patients in the present study 
were lower middle class, skilled workers and 
semiprofessionals, it would seem Profitable to 
extend this study to a larger socioeconomic 
group analogous to Alexander’s sample. A 
study of this sort would help determi 
whether there has been a shift in the defense 
mechanisms of ulcer patients or whether aE 
difference of findings is related to a differ 7 
in social class grouping. in 


als, 


SUMMARY 

The defensive behavior 
Alexander as characterist 
tient in his handling of 
ency needs was in 
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sive, moralistic, emotionality inhibited, con- 
forming and resistant to change, but less in- 
traceptive and self-accepting. Forty ulcer pa- 
tients were compared to 20 nongastrointestinal 
psychosomatic and 40 nonpsychosomatic pa- 
tients, with the EPPS and a specially de- 
signed rating scale. 

The results did not consistently support the 
defensive pattern proposed by Alexander and 
it can be said that the dependency conflict is 
no longer manifested in terms of dependency 
versus achievement. The ulcer patients are 
adopting behavioral patterns of frustration- 
avoidance in relation to their dependency 
needs rather than defending against these 
needs by a blatant pursuit of achievement 
and success. Dependency needs are defended 
against, yet partially satisfied through behav- 
ioral patterns which include personal and so- 
cial constriction in the service of acquiescence 
to formalized soçial demands. A 

The results of the study were interpreted in 
terms of a shift in socially desirable patterns 
of behavior. Alexander’s ulcer patients were 
viewed as inner-directed and governed by the 
Protestant Ethic while the ulcer patients of 
the present study were considered to be other- 
directed and dominated by the Social Ethic. 
The constant element in the ulcer patient’s 
clinical portrait appeared to be his persistent 
need to follow closely the socially sanctioned 


set of values. 
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INHIBITION AND PERCEPTION OF MOVEMENT 
ON THE RORSCHACH 


ANN FILINGER NEEL 
Wyandotte County Guidance Clinic, Kansas City, Kansas 


Validation of hypotheses underlying Ror- 
schach variables is an elusive task. Yet, de- 
spite the problems of definition, and experi- 
mental control, research aimed at clarifying 
basic issues is being conducted. A series of re- 
cent studies by Meltzoff, Singer, Korchin, and 
their associates are a case in point. Generaliz- 
ing from Werner’s (1945a; 1945b) sensory- 
tonic theory of perception and Rorschach’s 
(1942) speculations about movement these in- 
vestigators predicted a positive relationship 
between motor inhibition and perception of 
human movement on the inkblots. They pro- 
duced inhibition variously by tying subjects 
down (Korchin, Meltzoff, & Singer, 1951), 
and by having them write slowly, in a con- 
stricted fashion (Meltzoff, Singer, & Korchin, 
1953). Their hypothesis was supported. Other 
types of movement revealed more ambiguous 
associations with inhibition. Further studies 
(Singer & Herman, 1954; Singer, Meltzoff, & 
Goldman, 1952; Singer & Spohn, 1954; Singer 
& Sugarman, 1955) produced similar results 
and suggested that both the capacity to in- 
hibit and the ability to perceive movement 
might be related to healthy childhood experi- 
ences and “good” identification with parents, 

The present study is an effort to test the 
generality of these findings. In the first place, 
the earlier studies were confined to motor be- 
havior, but the authors seemed to imply their 
results could pertain to all varieties of inhibi- 
tion including suppression of ideas and im- 
pulses. This assumption was never tested, 
Secondly, the ambiguous relationship between 
inhibition and animal or inanimate movement 
requires clarification. Finally, the original 
studies of Meltzoff, Singer, and Korchin uti- 
lized the responses to only two Rorschach 
cards, one under normal conditions counter- 
balanced with another under experimental] 


conditions. While later investigations em- 
ployed the entire Rorschach protocol, the 
association between perceived movement and 
inhibition was never tested on all 10 cards. 
To substantiate the relationship it should be 
placed in the perspective of the total Ror- 
schach situation. This becomes especially im- 
portant in view of an apparent tendency to 
increase the number of responses to the second 
card viewed, regardless of conditions. 

These considerations gave rise to the hy- 
potheses of this study: 


Hypothesis 1 


Any situation calling for inhibition, or pro- 
ducing an inhibited state, should increase the 
amount of human movement perceived on the 
Rorschach, 


Hypothesis 2 


Any situation calling for inhibition, or pro- 
ducing an inhibited State, should increase the 


amount of animal movement perceived on the 
Rorschach, 


Hypothesis 3 


_ Any situation calling for inhibition, produc- 
Ing an inhibited state, should increase the 


amount of inanimate movement perceived on 
the Rorschach, 


EXPERIMENTAL DEsIcn 
Subjects 


i Ninety-five introduct 
involved in the 
plete the experi 


ory psychology students were 
study. Two Subjects failed to com- 
ment, making the final N = 93. 


Procedure 


The use of all 10 Rorschach cards made it impos- 
sible to follow the Practice of making each person 
his own control, Instead, subjects were randomly as- 
signed to the several treatments comprising the ex- 
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Movement on the Rorschach 


periment. To control for the influence of intelligence 
on perception of human movement, the groups were 
matched on Wonderlic (1945) scores. There was no 
attempt to control age and sex. 

In all, five different conditions were employed. Two 
of these were similar to the inhibiting conditions 
of the Meltzoff, Singer, Korchin studies. The Tied 
group, N = 18, had their dominant arm bound to 
their de: during the Rorschach. The Slow Writing 
group, WV =19, were instructed to copy as slowly 
as possible a printed paragraph describing a foot race. 
Ten minutes was allotted for the task. The para- 
graph was substituted for the short phrase used by 
the earlier investigators in hopes of increasing the 
inhibitory effects. All other groups copied this ma- 
terial but were given no instructions as to speed of 
writing. 

The third experimental condition was introduced 
to test for the effects of inhibition of ideas. To pro- 
duce such a state, a Behavior Inhibition group, N 
=19, received the following supplement to the usual 
Roschach instructions: “Normal people commonly 
see at least a few sexual things on each picture, pri- 
vate parts of people’s bodies or various kinds of 
sexual activity. Despite sexual ideas being so com- 
mon in this experiment you must not give me any- 
thing pertaining to sex no matter how clearly you 
see it.” 3 g 

Before one can assume that such instructions pro- 
duce inhibition, one must discover whether the sug- 
gestion regarding the normality of sexual percepts in- 
creases the probability of their occurrence. For this 
reason, a Sex Control group, V = 18, were given the 
set that sex responses were commonly seen, and were 
not told to avoid them. 

A Normal Control group, N = 19, was given the 
ordinary introduction to the Rorschach. 

All five conditions were run concurrently in one 
large classroom, this form of data collection not only 
being efficient, but providing a class demonstration 
of multivariate design. All instructions were written 
and bound with necessary materials in booklet form. 
The subjects were told in the beginning that the 
study involved a number of different tasks and that 
the people next to them would not be doing the same 
things they were. They were requested to ignore the 
activities of their neighbors with the promise that 
the study would be explained later. Each Rorschach 
card was shown on a projection screen for two 
minutes. The subjects were asked to give two re- 
sponses per card. No inquiry was conducted. 

The Rorschach protocols were scored blind, guided 
by the criteria Klopfer (Klopfer, B., Ainsworth, 
Klopfer, W. G. & Holt, 1954) has established except 
that human percepts were not given a score of M un- 
less movement was explicitly described. Form level 
was scored according to the list given by Beck (1949, 
pp. 159-195). Because there was no inquiry some 
error is undoubtedly present in assigning form level 
scores. It was believed, however, that the informa- 
tion produced by such scores was sufficiently useful 
to justify an attempt at applying it. 
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Additional determinants were included in the final 
tabulation of scores, but were counted as half a 
score. This process of weighting multi-determinant 
responses, and the fact that many subjects did not 
give twenty responses resulted in unequal Rs. To 
equate individual records, frequency counts for each 
score were converted into percentages of total num- 
ber of responses. 


RESULTS AND Discussion 


With certain exceptions dictated by pro- 
cedure followed in the earlier studies, the chi 
square statistic was used. Fourfold tables 
were constructed, the median score of the 
Control Group being used as the cutting point 
for the comparison of it with the various other 
groups. Persons scoring on the median were 
included in the below median cell. Yate’s cor- 
rection was used whenever expected frequency 
in any cell was less than 10. 

Prior to the consideration of the results it 
is necessary to discover if certain contaminat- 
ing factors could influence the findings, 


Controls for Extrancous Variables 


Intelligence. In addition to matching for 
Wonderlic scores, each group was checked to 
discover if there was an indication of any re- 
lationship between good human movement re- 
sponses (M+), and IQ, or between all move- 
ment responses (M+ and M— Jand IQ. Equal 
numbers of persons fell in all four cells of the 
chi square tables, revealing no relationship be- 
tween human movement responses and intelli- 
gence in the population of the present experi- 
ment. 

Number oj Responses. Differences in the 
number of responses or determinants could 
affect the number of movement responses 
especially in the Meltzoff, Singer, Korchin 
(1953) approach. In that study the number 
of responses given under inhibitory condi- 
tions by the whole group was compared to 
the number of movement responses given un- 
der normal conditions. No significant differ- 
ence in R was found in the earlier studies, 
although a slight increase was noted on the 
second card seen. 

In the present study, the number of re- 
sponses ranged from 9 to 20 in the Behavior 
Inhibition group and the Normal Control 
group. from 10 to 20 in the 


other groups. 
The median number 


Of responses for the en- 
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tire sample was 18. There was no significant 
difference in the number of responses given 
by each group, but there was a consistent 
trend toward more responses in experimental 
groups. Only 32% of the Normal Control 
group gave more than 18 responses, com- 
pared to 37% of the Tied group, 39% of the 
Sex Control group, 45% of the Slow Writing 
group, and 47% of the Behavior Inhibition 
group. 

There was no significant difference in the 
number of determinants used, but the same 
consistent trend was in evidence. 

Validity of Inhibitory Procedures. We can- 
not discover whether the various experimental 
conditions employed did indeed produce in- 
hibitions but we can determine whether the 
instructions had any effect. The Behavior In- 
hibition group were given a suggestion to see 
sexual percepts but were asked to avoid giving 
them. The Sex Control group were given the 
suggestion to see such percepts. The Normal 
controls were given no suggestions at all. If 
the various instructions had any effect, the 
Behavior Inhibition group should produce 
less sex responses than the Normal Control 
group. Both groups should give fewer sex 
percepts than the Sex Control group. Chi 
square comparisons yield the predicted rela- 
tionships, significant beyond the 5% level 
for a one tail test where df = 1 (x* Normal 
Control group vs. Sex Inhibition group = 
2.92; x? Normal Control group vs. Sex Con- 
trol group = 7.27; x? Sex Inhibition group 
vs. Sex Control group = 17.13). 

The Tied and Slow Writing Inhibition 
groups should not be affected since they re- 
ceived normal Rorschach instructions. Chi 
square comparisons of the number of sex re- 
sponses given by these groups with the Nor- 
mal Control group were not significant as 
predicted. On the basis of these statistical 
comparisons it appears that instructions re- 
garding normality of sexual percepts tended 
to produce such percepts. Also, it appears that 
those subjects who were told to avoid sexua] 
material did so and therefore inhibited their 
activity insofar as we can tell from this in- 
direct evidence. 

In like manner we can test the effects of 
the slow writing instructions. The group in- 
structed to write slowly should copy signifi- 


cantly fewer words than any other group. Chi 
squares were all significant in the predicted 
direction at beyond the 2% level for a one 
tail test where df = 1 (x? Slow Writing group 
vs. Tied group = 4.20; x? Slow Writing group 
vs. Sex Inhibition group = 8.19: x? Slow 
Writing group vs. Sex Control group = 5.94; 
Slow Writing group vs. Normal Control group 
= 4.20). 


Human Movement 
The preliminary checks disposed of, we can 


now turn to the test of the major hypotheses. 


The first hypothesis held that human move- 
ment responses should increase with inhibi- 
tion. During the tabulation it became obvious 
that it might be well to separate M+ from M 
or M— and test for each one, The first three 
rows of Table 1 summarize the results of these 
comparisons. i 

Only the results of the Normal Control vs. 
Sex Inhibition group on M+ are significant, 
but the other tests suggest tendencies in the 
same direction. Contrary to the hypothesis 
and the previous findings good movement de- 
creased with inhibition. The findings were 
dissimilar for M—. There was an insignificant 
trend toward increase in M— in the Sex In- 
hibition and Sex Control groups. If such a 
trend were “real,” it seems most likely due to 
instructions about sex responses. Calculations 
for combined M+ and M— are not signifi- 
cant but again showed the consistent decreas- 
ing tendency, 

The results so far de 
the previous findin 
of different Statis 
Singer, Korchin st 
of the difference be 
ment responses }y 
Since the number 
ber of determina 
cantly we can per 
the tendency to į 
terminants casts 


finitely do not fit with 
gs but this may be because 
tical tests. The Meltzoff, 
udy (1953) used a ż test 
tween the number of move- 
efore and after inhibition. 
of responses and the num- 
nts did not differ signifi- 
form a similar test although 
Ncrease in responses and de- 
Some doubt on the validity of 
=< Fh ie The means and variances for 
fer ie are Sliven in Table 2. The F tests 
esis Padi of variance showed signifi- 
and all off nces between the Control group 
fhe a ers except the Slow Writing group. 
not be u Se circumstances the ¢ test could 

Sed and the Mann Whitney U test 


Ee  ——— A 
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TABLE 1 
CHI SQUARE Comparisons OF EXPERIMENTAL Groups WITH CONTROL Group: 
PERCENTAGE OF VARIOUS MOVEMENT RESPONSES GIVEN 
Slow Sex Sex All 
Control Writing Tied Inhibition Control Experimental 
Group Group* Group* Group* Group* Group? 

M+ Median Percentage 25.5 19 20.5 15 19.5 g 

x? with Control Group NSS: N.S. 6.27 N.S 8.33 

P Value <.02 (.10.)05 
M— Median Percentage 0 0 0 4.5 5:67 

x? with Control Group N.S. N.S N.S 2.18 N.S. 

P Value (.10).03 
Total 
AM Median Percentage 26.67 22 22.5 23 24 

x? with Control Group 2.92 2.71 N.S. N.S. NS 

P Value (.10).05 10 
FM + Median Percentage 5 6 10 8 7 

x? with Control Group N.S. 4.50 1.73 N.S. 6.37 

P Value <.02* <.10* <.10* 
F\— Median Percentage 1 5 4 2 2 

x? with Control Group 1.69 4.66 N.S N.S N.S. 

P Value (.10)*.05, <,02" 
Total 
FM Median Percentage 12 13 16.67 13 12.5 

x? with Control Group N.S. 2.37 N.S. N.S NS. 

P Value (.10)*.05 
m Median Percentage 1 6 4 7.2 55 

x? with Control Group 2.63 6.32 11.30 3.30 13.61 

P Value (.10)*.05 <.01* <.001* (.05)*.02 <.01* 
Total M, FM & m 

Median Percentage 42 42 42 42 39.5 

x? with Control Group N.S N.S. N.S. NS. NS. 

P Value 

aes ; 


* One tail test. 


was substituted. Neither ¢ nor U gave signifi- 
cant results for any of the groups. The only 
finding of importance was the increased vari- 
ance in the experimental groups. It would 
seem that inhibiting conditions may make 
Rorschach subjects behave less consistently. 
Meltzoff, Singer, and Korchin also made a 
t test of the total amount of movement pro- 


duced, and such a test was likewise made with 
the data from this study. The means and vari- 
ances for distribution of all movement re- 
sponses are given in Tabl 
nificant results were obta 
a consistent trend toward more movement in 
the experimental groups, 


t r This might be 
manifestation of the phenomenon PEEN i 


le 2. Again, no sig- 
ined but there was 


i] 
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TABLE 2 


MEANS, VARIANCES, AND Tests or HOMOGEN 


Y OF THE DISTRIBUTIONS OF M RESPONSES 


AND TOTAL MOVEMENT PRODUCTION 


F Slow Sex Sex 
Control Writing Tied Inhibition Control 
Group Group Group Group Group 
M Mean 4.00 3.45 4.67 3.66 4.20 
M Variance 1.80 1.82 6.35 5.46 6.19 
F test for 
Homogeneity 
of Variance 
with Control 
Group 1.00 3.52 3.03 3.44 
P Value NS. <.02 <.02 <.02 
M, FM & m 6.50 7.21 9.11 702 = 
Mean nee tat 
M, FM & m 7.24 7.03 15.11 
Variance wes 1344 
F test for 
Homogeneity 
of Variance 
with Control 
Group 1.03 2.09 
P i N.S N 4 H 
alue NS. NES. rs 
i f S NS. NS. 


the earlier investigators, dulled to insignifi- 
cance by alterations and experimental pro- 
cedure. On the other hand it might be a re- 
flection of the tendency to increase the num- 
ber of responses. After all, this combined 
movement score contained nearly half of the 
responses in the individual protocol accord- 
ing to the last row of Table 1. The results of 
the test FM and m will offer additional rea- 
sons for these increases. 

In an effort to duplicate the Meltzoff, Singer, 
Korchin findings, the Slow Writing group was 
tested for relationship between the amount of 
inhibition shown and the number of M re- 
sponses. A fourfold chi square table was con- 
structed comparing the status of a subject on 
the movement distribution with his position 
in the inhibition distribution (number of 
words written). The results of the co 
son were not significant. 

Since the Meltzoff, Singer, Korchin studies 
used only two Rorschach cards GII and VIT 
or VIII and IX), the number of movement 
responses were tabulated for each card. In- 
spection of the totals for each card revealed 


mpari- 


gata only one or two points, Use 
of the entire Rorschach apparently did not 


oe the contribution of any individual 
ard. r 


Animate and Inanimate Movement 


The hypotheses regarding FM and m also 
predicted increases. Animal movement was 
scored for good and poor form. Tests were 
made utilizing these scores both separately 
and combined. Rows four through seven of 
Table 1 summarize these statistical tests. The 
tests for FM+ and FM— were far from uni- 
versally significant, but all pointed toward in- 
creases in these determinants under condi- 
tions calling for inhibition, The combination 
of FM+ and FM— reflected the increase but 
pone of the chi square tests using the com- 
bined score were significant. While the com- 
parisons involving anima] movement were 
hardly an imposing demonstration of the pre- 
dicted relationship, they at least revealed con- 
sistent trends which occasionally reached sig- 
nificance, i 


The results for m were unequivocally posi- 


SS tw 


k 
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tive, even for the Sex Control group where 
inhibition, if produced, was an experimental 
artifact. 


The chi square test for all types of move- 
ment showed that there is no apparent differ- 
ence among groups. The contradictory trends 
demonstrated among the various types of 
movement could easily have cancelled each 
other out, thus rendering the use of this par- 
ticular combination of movement of ques- 
tionable value. 


DISCUSSION 


The hypothesis regarding human movement 
emerged from this study with little to support 
it. The most likely conclusion is that human 
movement responses are not affected in any 
consistent manner by the experimental pro- 
cedures used here. It is possible that good 
movement responses may actually decrease 
while poor responses show some slight tend- 
ency to increase when the possibility of sexual 
material is increased. Variability among per- 
sons must be expected. Providing that differ- 
ences in technique are not too great for gen- 
eralization and that the present results are 
not chance variations, it is possible that the 
increases observed by Meltzoff, Singer, and 
Korchin could be due in part to increased re- 
sponsiveness or to the effects of animal and 
inanimate movement. 

The results of the comparisons for FM and 
m, as a whole, support the predictions about 
them. The findings in regard to animal move- 
ment were not significant in many cases, but 
demonstrated trends in the direction hy- 
pothesized. 

It will be recalled that Klopfer holds ani- 
mal movement to be indicative of awareness 
of impulses in which the person does not 
necessarily indulge. By itself, however, it does 
not necessarily imply inhibition. Klopfer in- 
terprets animal movement to mean frustration 
only in the absence of CF and without an 
optimal relationship to M (Klopfer et al, 
1954, p. 266). The inconsistent findings may 
result from the interference of these other 
variables. The possibility of an interrelation- 
ship with color responses was demonstrated 
by Singer and Spohn (1954) in regard to M. 

Klopfer interprets m as indicative of ten- 
sion and conflict due to efforts at inhibiting 
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the need and integrating it into behavior. The 
present results support such a prediction. The 
fact that the Sex Control group also gave in- 
creased mm is not necessarily inconsistent, for 
nothing about the experimental design assures 
that tension and conflict are decreased by the 
suggestion to perceive sexual content. 


SUMMARY 


On the basis of previous studies and extra- 
polation therefrom, it was hypothesized that 
any situation calling for inhibition should in- 
crease the amount of human movement, ani- 
mal movement, and inanimate movement on 
Rorschach protocols. To test these hypotheses, 
five groups of college students were used. To- 
tal V = 93. In one experimental group inhibi- 
tion was induced by requesting the subjects 
to copy a paragraph in long hand as slowly 
as they could. In another, inhibition was in- 
duced by binding the subject’s dominant arm 
to his chair. A Behavior Inhibition group was 
instructed that it was normal to see sexual 
things on the test but was asked to avoid 
giving such percepts. Another group was em- 
ployed as a control to see if the suggestion 
that sexual responses were normal had any 
effect on protocol. A fifth group was a control 
group for whom all conditions were normal, 
The predicted increases in human movement 
percepts were not observed. In fact there was 
an insignificant tendency toward a decrease 
of good human movement. The findings in re- 
gard to animal movement were in the pre- 
dicted direction, but were not ordinarily sig- 
nificant. The results of inanimate movement 
were unequivocally positive, validating both 
the present hypothesis and Klopier’s inter- 
pretation of m responses. The failure of pre- 
dictions may have been due to changes in ex- 
perimental procedure or to a general increase 
in responsiveness and variability under ex- 
perimental conditions. 
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REJECTION OF FALSE INFORMATION ABOUT O ESELF 
AS AN INDICATION OF EGO IDENTITY 


WALTER GRUEN 1?! 
Beth Israel Hospital, Boston, Massachusetts 


In 1950 Erikson described personality de- 
velopment as a series of stepwise changes in 
the ego from birth to death. Each of his eight 
stages of development was defined as a crisis 
point which the maturing ego could solve in 
one of two polar directions, so that one of 
the solutions would always lead to a widening 
of ego boundaries and hence towards further 
growth, 

Since then several studies have been made 
or been started to either verify the theory or 
to use Erikson’s concepts in an exploration of 
individual differences alongside other system- 
atic dimensions. Gruen’s ° and Yufit’s (1956) 
studies are examples of both of these. A fur- 
ther attempt was made in the present study 
to demonstrate a connection between behay- 
ior predicted from Erikson’s theory and the 
particular personality integration one had 
reached during the ego identity stage of de- 
velopment. 

Erikson (1950) defines ego identity as fol- 


- lows: 


The growing and developing youths, faced with 
this physiological revolution within them, are now 
primarily concerned with attempts at consolidating 
their social roles. They are sometimes morbidly, often 
curiously, preoccupied: with what they appear to be 
in the eyes of others as compared with what they 
feel they are and with the question of how to con- 
nect the earlier cultivated roles and skills with the 
ideal prototypes of the day. . . . The integration now 
taking place in the form of ego identity is more than 
the sum of the childhood identifications, It is the 
inner capital accrued from all those experiences of 


1 Formerly at University of Buffalo. 

2 There is under preparation by the research staff 
of the Committee of Human Development of the 
University of Chicago a book on the adult person in 
urban life in which appears a chapter by W. Gruen: 
The Investigation of Adult Personality Through an 
Experimental Application of Erikson’s Theory of 
Ego Development. 


each successive stage, when successful identifications 
led to a successful alignment of the individual’s basic 
drives with his endowment and his opportunities. 
. . - The sense of ego identity, then, is the accrued 
confidence that one’s ability to maintain inner same- 
ness and continuity (one’s ego in the psychological 
sense) is matched by the sameness and continuity of 
one’s meaning for others (p. 216). 


We followed Erikson’s assumption that ego 
identity once achieved results in a more stable 
personality organization which is visible in 
behavior. Hence the hypothesis was advanced 
that a person with ego identity will show this 
accrued stability of a now integrated role pat- 
tern by rejecting evaluations of himself by 
others, if these evaluations do not coincide 
with his own crystallized notions about him- 
self. Similarly, a person still casting about for 
his various identities and roles and values, 
who shows signs of ego diffusion, will be more 
prone to accept others’ evaluation of himself 
even if they may have no demonstrable rela- 
tionships with any of his fleeting self-images, 
In fact, he is prone to grasp them like a 
drowning man reaching for a straw, especially 
if such evaluations come from status sources, 

In an effort to obtain a more objective 
measure of ego identity than the ratings based 
on interviews and projective data first em- 
ployed by Gruen (see Fn. 2) and Yufit (1956) 
it was decided to utilize the real-ideal Q sort 
discrepancy correlation score developed by 
Butler and Haigh (1954). Support from this 
choice of measure comes from Erikson’s defi- 
nition of ego identity as the acceptance of a 
stable set of roles which then become the sio 
ideals of the individual. The acquisition B: 
standards and their ful] acceptance into the 
ego would then reduce discrepancies between 
idealized attributes and attributes seen as 
characteristic of oneself. ` T 
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Further indications for the rationale of this 
measure here come from Turner and Vander- 
lippe’s (1958) comparison between real-ideal 
correlations and various indices of adjustment 
including sociometric ratings strongly sugges- 
tive of Erikson’s ego identity syndrome. A 
tentative validation was also attempted here 
by constructing 14 questionnaire items around 
the theme of ego identity versus ego diffusion. 
A total ego identity score on the 10 items on 
which 45 subjects (Ss) distributed themselves 
evenly, correlated —0.42 with real-ideal dis- 
crepancy correlations on the Q sort. 

The specific hypothesis tested in this study 
was as follows: A high self-ideal discrepancy 
as an indication of low ego identity would be 
related to an uncritical acceptance of a fake 
personality sketch which presumably was a 
realistic appraisal for the S. 


METHOD AND RESULTS 


Forty-five summer session students 18-24 
years old were asked to sort the 100 Q sort 
items in the usual manner, using first real self 
and then ideal self instructions. When they 
reassembled after one week, they were given 
the above mentioned questionnaire and were 
then handed individually typed personality 
sketches. They were told that these sketches 
were tentative conclusions from the testing of 
the previous week. The fake sketches were all 
identical in content but not in form to pre- 
vent discovery of the former while glancing 
at one’s neighbor. The sketch consisted of 13 
statements which Forer (1949) had derived 
from astrology magazines and had used as a 
pedagogic device in the classroom. 

To allow for rejection of the sketch, the Ss 
were told that the original test was a rela- 
tively new and untried one and that we 
wanted an opinion on how well our conclusions 
at ope with their own ideas as a way of 
ae Cig about the test. On an at- 

ey were asked to rate on 10- 


point scales (a) how effective our test had 
been in revealing perso 


vealing our sketch ha 


Gruen 


The correlation scores between the two Q 
sortings resulted in a range of scores from 
+.87 to —.43, with a mean of +.43 and a 
sigma of .29. Looking at the spread of ratings 
on the two 10-point scales derived from Forer 
(1949) two thirds of the Ss were distributed 
among the upper four points or “acceptance” 
poles. The correlation between the “effective” 
and the “revealing” ratings was so high (0.80) 
that only the “revealing” score was used in 
the final analysis because it also was the one 
score appropriate to the hypothesis. 

To test the study hypothesis the correlation 
between the sortings by each S was correlated 
with his “reveal” reaction to the fake person- 
ality sketch. Since the spread of ratings on 
the latter measure occurred within a small 
tange of points around a mean of 8.0 a chi 
square was also computed between the inde- 
pendent and dependent variable by splitting 
each distribution into two halves at the me- 
dian. 

The correlation between the correlation for 
the two Q sorts and the degree of acceptance 
of the fake sketch was —0.45. A correlation 
of —0.39 for this N occurs less than once in 
a 100 by chance. The corrollary chi square 
resulted in a value of 3.92, significant at 
the 0.05 level. Therefore, the hypothesis about 
the behavioral visibility of high and low ego 
identity is confirmed, if ego identity is indeed 
a useful concept and if it is adequately meas- 
ured by a real-ideal discrepancy in attitudes 
about oneself. 

The correlation between the “reveal” rat- 
ings and the score based on the aforemen- 
tioned ego identity questionnaire items was 
—.26. This figure is not quite significant at 
the .05 level, but is in the expected direction. 


Discussion 


The findings are first of all a verification 
of one part of Erikson’s theory, insofar as the 
demonstrated connections between the dis- 
crepancy of real-ideal self and the willingness 
to accept any assertions about oneself were 
derived from the theory. The results also have 
implications for a better understanding of 
the uncertainty behavior of adolescents and 
young adults. It would not be too farfetched 
to speculate that fad changes, vocational win- 
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dow shopping and the indiscriminant parade 
of dating partners are all related to willing- 
ness to accept glib statements about oneself 
and hence manifestations of ego diffusion, 
just as the opposites might be indications of 
ego identity. If self-ideal discrepancy is in- 
deed a good and valid measure of this dimen- 
sion, we might expect it to correlate with ob- 
jective measures of these behavior patterns 
typically found in many adolescents. 

One disappointing note is the lack of a 
significant relationship between degree of ac- 
ceptance of the fake sketch and the a priori 
questionnaire items which were devised to 
measure ego identity. Either the questionnaire 
medium is not a good one to catch a rela- 
tively complex dimension like ego identity, or 
both of these two measures have so much 
variance specific to them, that they in turn 
correlate with the self-ideal discrepancy cri- 
terion, but not with each other. A factor-ana- 
lytic investigation of an expanded question- 
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naire for Erikson’s eight ego stages on several 
100 Ss in all age ranges is now in process for 
a future report to answer some of these ques- 
tions. 
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The present study is based upon the 
premise that more should be known about 
the process of clinical judgment and psycho- 
logical evaluation. Asch’s original studies 
(1946) are to be credited with much of what- 
ever current interest exists in the general 
problem of how people form conceptions of 
each other. Shontz (1956), has stressed else- 
where the need for focalizing this interest 
upon the specific diagnostic processes of the 
clinical psychologist. 

Some of the basic questions which require 
empirical investigation are: 


1. To what extent may psychologists be 
expected to agree on certain basic psycho- 
diagnostic conceptions? 

2. To what extent is the diagnostic process 
influenced by diagnostic prototypes or pre- 
conceptions? 

3. Is there a common diagnostic process 
which is consistent regardless of the indi- 
vidual psychologist, or of the individual pa- 
tient in question? 

4. If such a common diagnostic process 
exists, to what extent is it influenced by the 
kind of information upon which it is based 
and by the order in which this information 
is obtained? 

5. To what extent may psychologists work- 
ing independently with identical data be ex- 


1 The senior author js indebted to D. W, Miles, 
W. M. Taylor, M. Horowitz, and C, Waltner of 
Western Reserve University for advice and guidance 
in completing the dissertation upon which this re- 
search is based. Thanks are also due to Cleveland 
State Hospital for the use of its facilities, 

2 The senior author is now at the Department of 
Marriage Conciliation, Court of Common Pleas, 
Cuyahaga County, Cleveland, Ohio, and the junior 
author at the University of Kansas. 


pected to agree in their final diagnostic con- 
clusions? 


The present authors 
these questions as 
least three classes 
related to the characteristics of the patient 


to the type of patient data which is made 
available to the psychologist, and the order 
of their presentation to him. 

The present study was limited to examin- 
ing the effects of variables of the third class, 
holding relatively constant the relevant pa- 
tient and psychologist factors, The method 


METHOD 
Measurement 


A Q sort instrument was made up of 54 items 
selected by two Psychologist judges from a list of 
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120 items constructed by Stephenson to reflect 
Rorschach features describing various aspects of 
personality (Beck, 1954). Items were selected to 
be most consonant with the clinical picture of para- 
noid schizophrenia.s In describing a patient or a 
diagnostic concept, each psychologist rater was re- 
quired to sort the final 54 items into a quasinormal 
distribution with the following frequency categories: 
1, 2, 4, 6, 9, 6, 4, 2, 1. Usual methods were employed 
to convert these data into statistically appropriate 
quantitative form (Stephenson, 1955). 


Raters 


The raters were 24 psychologists, 16 of whom 
were regular staff members of psychology depart- 
ments of either state or county hospitals. Three of 
the 16 were department heads. Three other raters 
worked for institutions offering primarily counseling 
or advisory services. Five were either students or 
interns, and 1 was in private practice. Eight raters 
were female, 17 male. Eleven were PhDs in psy- 
chology and 11 others had completed virtually all 
requirements for the doctorate except for the dis- 
sertation; the remaining 2 had completed at least 
two years of appropriate graduate work, The age 
range of the raters was 23 to 58, with an average 
of 33.5 years. Their clinical experience averaged 
slightly over 6 years, with a range of 1 to 19 years. 
All but one of the raters had received or were 
receiving their graduate education at the same uni- 
E ia point, the examiner served as an additional 
rater of all patients. At the time of the research he 
had completed three years of graduate study, had 
five years’ experience, and was 38 years of age. 


Patients 


Twenty-five Ss were selected from the inpatient 
population of a local state hospital. All patients had 
been assigned the diagnosis of paranoid schizo- 
phrenia, and the group consisted entirely of males. 
Age range of this sample was 28 to 62 years 
(median: 41 years); and length of time in the hos- 
pital ranged from 5 to 76 months (median: 38 
months). All but seven patients had been in mental 
hospitals one or more times previously. 

One of the patients served as a special control 
patient. The patient was 41 years of age, had been 
hospitalized 63 months, with no previous hospitali- 
zations, and had 10 years of education. 


Procedures 


a. Each rater was first asked to use the Q sort 
instrument to describe his conception of the proto- 
typic male paranoid (Pd), catatonic (Ct), and hebe- 
phrenic (Hb) schizophrenic. Homogeneity of these 


3A table giving the complete Q sort has been 
deposited with the American Documentation Insti- 
tute. Order Document No. 6286, remitting $1.25 for 
35 mm. microfilm or $1.25 for 6 by 8 in. photo- 
copies. 


prototypic descriptions was evaluated by correla- 
tional methods, and composite Q sort descriptions 
of the three relevant diagnostic categories were 
established from the ratings. Reliability of the in- 
strument was partially established by requiring eight 
of the raters to repeat their descriptions of the 
paranoid schizophrenic. 

b. All raters (except E) were exposed to case 
study information on the control patient. Informa- 
tion was presented in four increments, and raters 
made Q sort descriptions of the patient after each 
increment. The increments, in order of presentation, 
were: 


I. Statistical data: patient’s code number, birth- 
date, date of hospital admission, education, occu- 
pation, marital status, religion, race, sex, source 
of admission, number of previous hospitalizations. 

II. Interview: tape recording of a thirty-minute, 
semistructured interview with the patient. All in- 
terviews were conducted by E to control for skill 
and style of interviewing. 

III. Psychological report and test protocols: a 
psychological report based on a minimum battery 
consisting of the Rorschach and the Wechsler- 
Bellevue (Form I) Vocabulary, Block Design, and 
Similarities subtests. In most cases this minimum 
battery was supplemented by various other tests, 
selected by the examining psychologist. Testing 
and report writing were done by regular members 
of the hospital staff in their ordinary course of 
duty. 

IV. Social case history: a full report made by 
the psychiatric social worker on the basis of in- 
formation obtained from an informant, usually a 
close relative of the patient. 


Increments were presented in the order listed 
above, and this order of presentation was considered 
standard, since it represents the usual process of 
clinical examination in the institution. Data on 
actual diagnostic conclusions were not made avail- 
able to the raters. 

Q sort descriptions obtained from all raters on the 
control S were treated statistically to investigate 
changes from each step to the following steps of 
the procedure. 

Q sort descriptions were also correlated with the 
composite prototypic diagnostic concepts obtained 
in Step a to determine the relative influence of these 
concepts at varying stages in the diagnostic process. 

c. Since Step b does not enable one to distinguish 
whether changes in conception are a function of the 
type of information or of the serial order in which 
the information is presented, each rater (except E) 
was next assigned a separate subject from the sample 
of patients. The procedure of Step b was repeated 
except that information increments were presented 
in orders which differed systematically from rater 
to rater. 

Intercorrelations between successive descriptions 
provided data for analysis of variance procedures 
which permitted the evaluation of serial effects and 
information sources as contributors to the final Q 
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TABLE 1 
Q sort ITEMS Most AND Least CHARACTERISTICS OF THE Prototyric DIAGNOSTIC Conceptions 
iagnostic o 
at Most Characteristic Items Least Characteristic Items 
Paranoid 9 Projects (ascribes his own unpalatable 


Schizophrenic 


Catatonic 
Schizophrenic 


or painful wishes or fears to persons 
or objects in his world) 
13 Delusional thoughts, systematized 
17 Characterized by intellectualization 


37 Blocks out external perceptions and 
reacts to stimuli from within 

46 Characterized by stereotypy in action 

50 Affect is blocked (i.e. as a defense) 


14 Inattentive to point of distractibility 
2 Thought processes are accidental, 
tangential 
3 Hallucinatory, with much ego dis- 
organization 


21 Breadth of interest spontaneity is 


increased (ie. what might be called 
undirected spontaneity) 


Hebephrenic 


Schizophrenic organization 


2 Thought processes are accidental, 


tangential 


6 Much intrusion of accidental material 


3 Hallucinatory, with much ego dis- 17 


13 Delusional thoughts, systematized 
17 Characterized by intellectualization 


Characterized by intellectualization. 

18 Hallucinatory, but with little ego 
disorganization (i.e. low level re- 
gressed ego disorganization) 

54 Overconventional 


sort descriptions, E also reviewed all available data 
on each patient and provided Q sort descriptions of 
each, based upon his full knowledge of the case 
material. Correlation of these descriptions with those 


provided by the other raters provided further relia- 
bility data. 


RESULTS 


The homogeneity values (average inter- 
judge 7’s for the composite diagnostic proto- 
types were .47 for Pd, .40 for Ct., and .54 
for Hb (all values significant at the. 01 level). 

The intercorrelations of the composite pro- 
totypes derived from these descriptions were 
as follows: Pd vs. Ct, —.27 (not statistically 
significant); Pd vs. Hb, —.54 (significant at 
P < .01); Ct vs. Hb, 0.00, Some of the items 
considered by the raters to be most and least 
descriptive of each composite prototype are 
presented in Table 1. 

Sort vs. re-sort reliabilities for the eight 
raters who repeated their descriptions of the 


Pd prototype ranged from -74 to 92, with a 
mean of .86. 


Median intercorrelations þ 
sive Q sort descriptions of the 
are presented in Table 2, 
analysis of variance, using met 
by Wilcoxen ( 1949) showed 
relations could be separated j 
tically homogeneous subgrou 


etween succes- 
Control patient 
Nonparametric 
hods suggested 
that these cor- 
nto three Statis- 
PS, as indicate 


by the code letters assigned to each coefficient 
in Table 2. 

Correlations of the diagnostic prototypes 
with Q sort descriptions of the control patient 
provided after the Presentation of each incre- 


ment of case study information are presented 
in Table 3. 


Table 4 presents the 
Coefficients between successi 


TABLE 2 
MEDIAN CORRELATION BETWEEN 
SORT DESCRIPTIONS OF 
CONTROL PATIENT 


Successive 
THE 


Serial 

Positions Median 

Compared Correlation Code 
I vs. II .58* A 
I vs. III Sd A 
I vs. IV 50 A 
II vs. III 70 B 
IL vs, IV 65 B 
II vs. Iy .17 c 

Nowe velges significant at -01 level. 


vore etters indica i vhi 1 into 
statistically homogeneou nate coefficients which fall 


ieee ti 


a 
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TABLE 3 
AVERAGE INTERCORRELATIONS BETWEEN 
Composite DIAGNOSTIC PROTOTYPES 
AND Q SORT DESCRIPTIONS OF 
CONTROL PATIENT 


Composite Composite Composite 
Serial Paranoid Catatonic Hebephrenic 
Position Prototype Prototype Prototype 
I .80** -00 —.26 
IL .65** 07 —.28* 
IIL ao —.15 —.21 
IV Bre —:21 —.22 
ificant at .05 level, 
ignificant at .01 level, 
Note,—All values are means, derived by way of r to s trans- 


formation, 


TABLE 4 


MEDIAN CORRELATIONS BETWEEN Q sort DeEscrip- 
TIONS AS A FUNCTION OF Tyre or Case STUDY 
INFORMATION AND SERIAL POSITION IN WHICH 
INFORMATION WAS PRESENTED 


Serial Position 
Intercorrelations 


Type of 
Information IXIV IIXIV IN XIV 
Statistical data .46* -70 .82 
Interview .12 .82 -91 
Psychological report -70 81 86 
Case history 62 67 82 
* All values significant at .01 level. 


Note, 


al description (Serial Position 1V) taken as basis 
for comp; 


son of serial effects, 


of variance of the raw data upon which Table 
4 was based showed significant differences to 
exist between serial positions (p less than 
.01) and nearly significant differences to exist 
between types of case study information. The 
interaction was not statistically significant. 
Correlations between the raters’ final de- 
scriptions of their patients and E’s descrip- 
tions of these same patients ranged from a 
low of .28 to a high of .79. All but the lowest 
correlation were significant at the .01 level. 
The lowest correlation was found between Æ 
and the single judge who was not trained at 
the same university as the rest of the raters. 


DISCUSSION 


The present study showed that the group 
of psychologists examined possessed signifi- 


cantly homogeneous prototypes of the three 
diagnostic categories, Paranoid, Catatonic, 
and Hebephrenic Schizophrenia. Homoge- 
neity values were highest for Hd prototype, 
indicating least controversy with respect to 
the characteristics of this group of patients. 
Since hebephrenics are often considered to 
be the most deteriorated patients, this finding 
lends support to Shontz’s observation (1954, 
pp. 83-87) that the more obviously abnormal 
a patient is, the easier psychologists find it 
to agree upon a descriptive diagnosis. It is 
also possible that the relatively lower homo- 
geneity value found for the Ct prototype was 
due to the concentration of some raters upon 
phenomena associated with catatonic stupor, 
while other raters may have been more con- 
cerned with the excited states that often char- 
acterize this type of patient. It may be that 
differences in homogeneity of diagnostic pro- 
totypes are related to the “coherence” of the 
diagnostic category itself. Thus, for example, 
one might also expect less homogeneity to 
exist for the category “manic depressive psy- 
chosis” than for the category “depression” 
alone. 

The rather moderate homogeneity values 
for all the diagnostic prototypes leaves con- 
siderable room for disagreement between 
raters on specific behavioral or dynamic 
items. It suggests the possibility that for each 
prototype there are probably a few items 
which are consistently placed in specific Q sort 
categories, but that many other items exist 
which may be placed almost anywhere with- 
out destroying the basic integrity of the diag- 
nostic picture. Future studies might well be 
directed toward the isolation and specification 
of such crucial items for the variou 
nostic categories. 

It is obvious from the intercorrelations be- 
tween composite diagnostic prototypes that 
the catatonic prototype is independent of the 
other diagnoses. The negative Correlation be- 
tween Pd and Hb, however, suggests that 
these are thought of by this group of psy- 
chologists as rather Opposite diagnoses, Ex- 
rie of the items in Table 1 reveals 
the Ci ie i ; 
satellectual ae S E pies Ee 

: Y e paranoid and 
upon intellectual disorganization in the hebe- 
phrenic. 


s diag- 
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It is interesting to note in Table 3 how 
these diagnostic prototypes correlated with 
Q sort descriptions of the control patient. It 
is quite apparent that all raters identified the 
patient as a paranoid schizophrenic even on 
the basis of the statistical data alone (in 
Serial Position I). Doubtless this was de- 
duced from the raters’ knowledge of the sheer 
frequency of occurrence of this diagnosis in 
state hospitals, although it is also possible 
that the statistical data itself suggested 
specific things that led to this hypothesis. Of 
equal interest is the observation that the 
paranoid prototype tended to exert relatively 
Jess influence on the conceptualizations after 
the interview was heard, and then regained 
its high correlation with the Q sort descrip- 
tions as more case study material was added. 
It is striking that the final descriptions cor- 
related almost exactly as high with the diag- 
nostic prototype as did the initial descrip- 
tions, based on relatively little information. 
There is the strong suggestion in these data 
that the diagnostic prototype did, in fact, 
serve as an hypothesis which was established 
early in the procedure and which tended 
generally to become more firmly entrenched 
as confirming case study material became 
available. The slight, though steady, trend 
away from the Ct prototype, which is also 
evident in these data, is also of interest and 
suggests that a definite process of differential 
judgment was operative in the psychologists’ 
descriptions. It seems to imply, in short, that 
the reactivation of the paranoid prototype in 
Serial Position IV was not the result of sheer 
stereotypy of the psychologists’ thinking but 
that this reactivation did arise from a process 
of considered discrimination. 

Table 2 shows a little more clearly how 
ps process took place. Each Q sort descrip- 

ton was found to correlate significantly with 


were also taking pl: 
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relatively less following Serial Position II, 
and they were least following Serial Position 
III. It is therefore obvious that the psycholo- 
gists were being influenced by the informa- 
tion presented them. It is also obvious, how- 
ever, that they were not sufficiently influenced 
to alter significantly their basic conception of 
the control patient as a paranoid schizo- 
phrenic individual. 

Three questions arise about these conclu- 
sions, and all three are answered in Table 4. 
The first question is whether these results 
might not be a function of the individual 
patient selected. Since the data in Table 4 
are based upon Q sort descriptions obtained 
on 24 patients, and since the serial effects 
are the same (that is, for each type of case 
study data, the I vs. IV correlation is always 
less than the II vs. IV correlation, which in 
turn is always less than the III vs. IV cor- 
relation, yet all values remain significant at 
the .01 level), it is clear that the individual 
patient was not the determining variable, A 
conception of the patient was consistently 
formed early in the diagnostic process and 
this conception tended to remain holistically 
the same. More detailed information altered 
the conception only in its more specific as- 
pects. The second question is whether the 
trends observed are a function of the order 
in which case study information was pre- 
sented to the raters. Since the results of 
Table 4 are based upon all possible orders of 
presentation of the data, and since the trends 
remain the same, the answer to this question 
is obviously that it is not a function of the 
order of presentation of information. The 
third question is whether the phenomenon, 
which has been called the serial position ef- 
fect, is dependent in any way upon the type 
of case study information used. Although the 
interaction in Table 4 was not found to be 
statistically significant, there was a very defi- 
nite tendency for general correlation values 
to be lowest when statistical data were pre- 
sented first and highest when the interview 
was presented first. That is, the initial con- 
ceptions of the patient underwent more 
change over-all in the former situation, but 
the serial position effect was constant never- 
theless. Apparently, the most stable initial 
conceptions were induced by immediate ex- 
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posure to a recording of the patient himself, 
a fact which suggests that written material, 
no matter how detailed, is not a complete 
substitute for actual contact with the patient, 
where psychological diagnosis is concerned. 

One final question remains unanswered, 
i.e., to what extent the final patient descrip- 
tions provided by the raters are unique func- 
tions of the raters themselves. The generally 
significant correlations between E’s Q sort 
descriptions of the 24 Ss and the descriptions 
provided by the raters is convincing evidence 
that interjudge reliability of these conceptions 
is generally satisfactory. It is impossible to 
tell from the present study whether the single 
low interrater reliability which was found is 
actually a function of differences in education 
as such, or whether it represents another, per- 
haps, deeper type of personal difference be- 
tween Æ and the other psychologist in ques- 
tion, A number of factors could be respon- 
sible, among which could be (a) the effects 
of possible emotional involvement of the psy- 
chologist with the patient, and (b) the effects 
of the rater’s general mode of approach to 
the data available. (For example, it is pos- 
sible that the presentation of information, 
inconsistent with an initial diagnostic concep- 
tion, may induce one psychologist to alter 
his judgments completely, while another 
psychologist might prefer to reject the contra- 
dictory data and retain earlier judgments.) 
As previously mentioned, the restrictions of 
the present study with respect to many im- 
portant patient and psychologist parameters 
are recognized. Despite these limitations, it 
is fair to say that the findings suggest several 
basic principles which can, and should, be 
tested further on more widely varying groups 
of Ss and professional raters. 


SUMMARY 


A Q sort investigation was made of clinical 
psychologists’ conceptions of three specific 
diagnostic categories (paranoid, catatonic, 
and hebephrenic schizophrenia) and of the 
process of concept formation which is in- 
volved in arriving at descriptive diagnosis of 
individual patients. Twenty-five psychologists 
served as raters and 25 male paranoid schizo- 
phrenics served as Ss. Case study data were 
presented to the psychologists in such a way 
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as to make possible the evaluation of the 
effects of four types of clinical information 
and the effects of serial position, or order, in 
which this information was presented. 

It was found that the psychologists em- 
ployed possessed significantly homogeneous 
prototypic diagnostic conceptions, and they 
tended to view paranoid and hebephrenic 
schizophrenia as somewhat opposite disease 
entities; catatonic schizophrenia appeared as 
a statistically independent conception. 

A consistent and significant serial position 
effect was found in the diagnostic process of 
actual patients. It was apparent that psy- 
chologists formed a fairly complete clinical 
picture of the patient quite early in the diag- 
nostic process and that this basic conception 
changed in its details but not in its basic 
configuration as more information was pro- 
vided to the raters. The most stable initial 
clinical conceptions were formed when re- 
corded interviews with the patient were pre- 
sented first. 

Evidence was also found that individual 
differences in psychologists may, in certain 
instances, have profound effects upon the re- 
liability of the diagnostic process. It is not 
yet certain what type of individual differ- 
ences are most important in this regard, al- 
though it is possible that the psychologist’s 
education, his emotional involvement with 
the patient, or differences in the degree to 
which he is willing to be influenced by contra- 
dictory information, play a part in the proc- 
ess. Further research on these phenomena js 
strongly recommended. 
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WECHSLER-BELLEVUE PERFORMANCES OF REMITTED AND 
UNREMITTED ALCOHOLICS, AND THEIR 
NORMAL CONTROLS 
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The study presented here is part of a 
larger research project investigating person- 
ality variables involved in remission from 
alcoholism. Because of obvious differences be- 
tween remitted and unremitted alcoholics 
simply on a functioning level, as well as in 
their responses to similar opportunities for 
recovery and their ability to adapt to a new 
role, it was hypothesized that some part of 
a remitted group’s recuperative characteris- 
tics might be reflected in their intellectual 
performances on a standard intelligence test. 

Prior studies (Halpern, 1946; Kaldegg, 
1956; Peters, 1956) have indicated that alco- 
holic populations which were available for 
study generally earned IQs higher than the 
general adult population. In addition, these 
studies report few total test or subtest devia- 
tions from normal population intellectual 
parameters. However, there is little informa- 
tion available on subgroups in alcoholism. 
This paper bears on the aforementioned 
studies and presents data from an additional 
population of “alcohol-freed” subjects (Ss). 


PROCEDURE 


Intelligence Scale for 
Adults (Form I) was administered to a group of 
in remission (active AA 
of 2 years of sobriety) ; 
mitted alcoholics (never 


55 years; for the 
controls, 28 to 54. Th 


niversity of New York 


were distributed over a diversity of ethnic back- 
grounds, including Anglo-Saxon, German, and East- 
ern European. The average highest school grade 
reached was 12 for the remitted alcoholics, 11 for 
the unremitted group and 11 for the normal controls. 
The remitted group’s education ranged from 8 to 
20 school years; the unremitted 8 to 18; and the 
controls 8 to 18. The two experimental groups were, 
in addition, equated for duration and intensity of 
alcoholism, and duration of exposure to Alcoholics 
Anonymous. The average duration of alcoholism in 
the remitted group was 19.27 years and 19.39 years 
in the unremitted group. The duration of alcoholism 
ranged from 6 to 29 years in the remitted group 
and from 6 to 38 years in the unremitted group. 
The average number of years contact with AA was 
5.39 for the remitted group and 5.26 for the 
unremitted Ss, Neither of the experimental groups, 
at the time of testing, contained members who were 
currently hospitalized or in clinic attendance, 

The weighted subtest means, and Verbal, Per- 
formance and Total IQ means 
analyses of variance. In addition, each S$ in each 
group was assigned a scatter index score on the 
basis of 11 subtests, The scatter index js computed 
by calculating the mean weighted score of the 11 
subtests. The total deviation, disregarding sign from 
this mean is the index. The Rank-Sums test (Send- 
ers, 1958) was used to check the significance of the 
differences found between the groups on this 
variable. 


RESULTS AND DISCUSSION 


The mean subtest scores and mean Verbal, 
Performance, and Total Scale IQs for each 
of the three groups are given in Table 1, 
The range of Total Scale IQs for the re- 
mitted group is 99-139, for the unremitted 
group, 105-134, and for the normal controls, 
99-139. The mean Total Scale IQ for each 
group is above the average for the general 
adult population. This finding for alcoholics 


is consistent with the findings of previous 
studies. 
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were subjected to | 


of 


Wechsler-Bellevue Performances of Alcoholics 


The analysis of variance of each set of 
mean values revealed only one significant 
F ratio and that on the Arithmetic subtest, 
at the .01 level of confidence. The subsequent 
t tests between the three groups on this vari- 
able revealed significant differences between 
the remitted and unremitted alcoholics and 
between the unremitted alcoholics and the 
normal controls, both at the .01 level. In both 
cases the unremitted group had the lower 
mean. 

Thus, our two alcoholic populations reveal 
but one significant difference on the Wechsler- 
Bellevue variables investigated. The higher 
performance of the remitted as compared 
with the unremitted alcoholics in Arithmetic 
may reflect a more effective concentration 
consistent with the better stabilized sobriety 
of the remitted group and with their more 
disciplined adaptation to the 12-step program 
of Alcoholics Anonymous. It should be noted, 
however, that the running of 14 analyses of 
variance in such a homogeneous set of sub- 
samples increases the chance finding of sig- 
nificant differences and therefore decreases 
confidence in this subtest difference. 


TABLE 1 


WECHSLER-BELLEVUE Sustest AND Torat IQ 
MEANS FOR THE THREE Groups 


Remitted Unremitted Controls 
Subtest (N=22) (V=23) (N=23) 
Information 13.81 13.27 13.39 
Comprehension 13.50 13.14 12.83 
Digit Span 10.04 11.14 11.61 
Arithmetic 13.36 10.77 13.43 
Similarities 13.00 12.09 12.13 
Vocabulary 14.32 13.95 13.87 
Picture 
Arrangement 9.59 9.73 10.70 
Block Design 11.77 10.95 11.96 
Object 
Assembly 10.68 11.32 12.26 
Digit Symbol 9.81 9.45 10.48 
Verbal IQ 121.45 117.82 121.09 
Performance 
IQ 117.77 116.14 120.00 
Total IQ 121.27 118.45 121.83 


TABLE 2 


Ranx-Sums Test RESULTS COMPARING THE 
THREE GROUPS on THE SCATTER INDEX Z's 


Remitted Controls 
Remitted 3.31* 
Unremitted 42 2.93* 


* Significant at .01 level. 


The results of the scatter index analysis 
are shown in Table 2. The data do not show 
any differences between the remitted and un- 
remitted groups, but each experimental group 
differs from the normal controls. The alco- 
holic groups produced significantly higher 
scatter index scores than the normal group 
(P= .01). The fact that the two alcoholic 
groups are identical in mean scatter and that 
each independently differs significantly from 
the normal controls is evidence for the relia- 
bility of this finding. 

The diagnostic implications of this scatter 
measure as reflective of severe psychopathol- 
ogy was noted in a recent study by Trehub 
& Scherer (1958). Using the same scatter 
index as employed in this study, these authors 
successfully differentiated schizophrenics from 
neurotics and character disorders. Using a 
cutting score of 19 for schizophrenia, they 
report 70% successful diagnosis. Our com- 
bined alcoholic groups contain 33 members 
out of 45 with scatter indices of 19 or above, 
a substantial 73%. Eight or 33% of our 
normal controls have scatter indices that 
high. This, of course, does not imply that our 
alcoholic groups are therefore largely schizo- 
phrenic. Since our alcoholic groups were 
screened for presence of psychosis or brain 
pathology by means of extensive conferences 
and examination by psychological and psychi- 
atric staffs it seems likely that the results 
of the scatter index are not related to severe 
psychopathology or clinically detectable brain 
damage. The possibility, however, that pro- 
longed alcoholic excess, in terms of effectin: 
subtle brain damage, might have contributed 
check on this Was attend A Partial 

empted by running a 
Pearson r between the number of years each 
alcoholic reported he had been drinking in 
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excess and his scatter index score. The cor- 
relation obtained was .05; a nonsignificant 
finding. In addition, no differences between 
the controls and alcoholics were found on 
those subtests usually associated with organ- 
icity, i.e., Block Design, Object Assembly, 
Digit Span, etc. An investigation of the de- 
terminants of the alcoholics’ Wechsler-Belle- 
vue scatter, and a similar investigation of the 
scatter of a more pathological group (schizo- 
phrenics for example) would provide a better 
basis for understanding the nature of con- 
tributing factors to these uneven intellectual 
profiles. 

Thus, the independence of subtest scatter 
from duration and stability of remission, and 
perhaps also from years of prolonged drink- 
ing, suggests that unevenness of intellectual 
pattern might be due to deeply rooted per- 
sonality characteristics of the chronic alco- 
holic. It may represent a long-term process 
of erratic intellectual development and func- 
tioning symptomatic of a more generalized 
deviation in personality organization and in 
the patterns of adaptation, which antedate 
the addiction to alcohol. 


SUMMARY 


Male remitted and unremitted alcoholics 
and their normal controls, equated for age, 
education and ethnic background, were ad- 
ministered the Wechsler-Bellevue Intelligence 
Scale for Adults (Form I). 


The total IQs and weighted subtest means 
of both alcoholic groups generally supported 
previous studies. 

Significant differences in only one subtest 
were found. On the Arithmetic subtest, the 
unremitted alcoholics scored lower than both 
the remitted alcoholics and normal controls 
(P = 01). 

A scatter index analysis revealed no sig- 
nificant differences between the remitted and 
unremitted alcoholics. Both experimental 
groups, however, had significantly higher 
scatter index scores than the normal con- 
trols (P = .01). These differences suggest a 
pattern of intellectual unevenness in the alco- 
holic deviant from that of a normal popu- 
lation. 

The significance of these findings was dis- 
cussed and possible explanations were offered. 
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A RATIONALE FOR SELECTING TOYS IN PLAY THERAPY 


HAIM G. GINOTT 
Child Guidance Clinic, Jacksonville, Florida 


Child therapists of different orientations 
make use of toys and play materials in the 
diagnosis and treatment of emotionally dis- 
turbed children. The rationale for this prac- 
tice is the belief that play is the child’s sym- 
bolic language of self-expression. Through the 
manipulation of toys the child can state more 
adequately than in words how he feels about 
himself and the significant people and events 
of his life. To a considerable extent, the 
child’s play is his talk and the toys his words. 

If this assumption is true, then the kind 
of toys and materials found in the playroom 
is an important therapeutic variable. Yet 
there are very few guideposts in the literature 
to the proper equipping of therapeutic set- 
tings. In many clinics, the playroom looks 
like a junkyard with an astonishing assort- 
ment of toys of various vintages, some of 
them marred and disfigured beyond use. New 
toys are amassed at random, with the thera- 
pist’s personal predilection being the decisive 
factor in their acquisition. The following 
“confession” will serve as an illustration: “I 
suffer temptations in toy stores . . . and I 
have no difficulty in persuading myself that 
a certain doll is just what I need for a certain 
four-year-old who has been getting along just 
fine without it” (Fraiberg, 1954, p. 3). 

The literature on toy selection is contra- 
dictory, and with few exceptions (Lebo, 1955, 
1958) is based on intuition rather than in- 
vestigation. For example, Arthur (1952) and 
Slavson (1952) advocate a playroom with 
many enticing toys, whereas Fraiberg (1954) 
thinks that therapy proceeds more effectively 
when the child is given only a few toys and 
is not absorbed in fancy materials and fas- 
cinating activities. There is no agreement 
about the value of specific materials, nor even 
about the necessity of having a special room 
set aside and furnished for child therapy. 

Clearly, a consistent rationale is needed 
for the selection of materials that contribute 


maximally to the process of child treatment. 
The present paper aims to formulate criteria 
for selecting effective toys and tools for play 
therapy with children under the age of 10. 


THERAPEUTIC OBJECTS AND OBJECTIVES 


The aim of child therapy, as of all therapy, 
is to effect basic changes in the intrapsychic 
equilibrium of the child. Through relation- 
ship, catharsis, insight, reality testing, and 
sublimation, therapy brings a new balance 
in the structure of the personality, with a 
strengthened ego, modified superego, and im- 
proved self-image. The value of any toy, 
object, or activity in child therapy depends 
on its contribution to the realization of these 
objectives. In evaluating activities and mate- 
rials, one should consider their effect on the 
inner processes of therapy. There are five 
major criteria for selecting and rejecting ma- 
terials for child therapy. A treatment toy 
should: (a) facilitate the establishment of 
contact with the child; (b) evoke and en- 
courage catharsis; (c) aid in developing in- 
sight; (d) furnish opportunities for reality 
testing; (e) provide media for sublimation. 


Tools for the Therapeutic Relationship 


A therapeutic relationship can be estab- 
lished and maintained only if the therapist 
understands the child’s communications. The 
assumption is that everything said or done 
by the child in the playroom has meaning 
and significance in his frame of reference. Yet 
it is not easy even for experienced therapists 
to understand all of the child’s play messages. 
Questioning the child about the meaning of 
his play is worse than futile; it results in 
resistance and silence. Appropriate toys make 
it easier for the therapist to understand the 
meaning of the child’s play. Thus, for ex- 
ample, children usually play out family 
scenes by using dolls that represent mother 
father, and siblings. In the absence of such 


243 


244 


dolls, a child may symbolically play out 
family themes by using big and little wooden 
blocks. But the exact meaning of the message 
may escape the therapist. Banging two blocks 
together may represent spanking, or inter- 
course, or may merely be a test of the thera- 
pist’s tolerance for noise. Inserting a pencil 
into a pencil sharpener may represent inter- 
course, but it may also mean that the pencil 
needs sharpening. However, when a father 
doll is put on top of a mother doll, the thera- 
pist has less room for misinterpretation. For 
the child, pencil and doll may be equally 
useful as a means of expression, but to the 
therapist they are not. The presence of a doll 
family enables the child to assist the thera- 
pist in understanding him. Some materials 
are superior for use in therapy because they 
give therapists tools for understanding the 
child, without diminishing the fluidity of his 
play or conversation. It seems reasonable to 
assume that a playroom with a variety of 
selected toys is more conducive to therapeutic 
communication than one filled with odds and 
ends of junk. 

Many therapists find it easier to structure 
therapy and to make contact with a child 
when the playroom contains materials whose 
very presence reflects permissiveness. Some 
toys and tools are associated in children’s 
experience with parental and sibling restric- 
tions and punishments. Many children have 
been scolded or spanked for messing with 
mother’s typewriter, fooling with brother’s 
flashlight, or playing with father’s tool kit. 
Nothing conveys permissiveness to these 
children as much as the presence of such 
materials in the playroom for their own use. 
Noise-making toys such as drums, pegboards, 
xylophones, air rifles, and Cap guns serve the 
same purpose. They communicate loud and 
clear the adult’s basic spirit of tolerance. 

If it is at all possible, every child should 
find in the playroom some toy, tool, material 
or activity refused to him in the past. Per- 
haps the case histories of clinic children 
should contain more detailed information on 
the child’s play deprivations and aspirations. 


Toys for Catharsis 


The common assumption that children pro- 
ject their emotional needs on any play mate- 
rials is only half true. It ignores the fact that 
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young children has to be limited through 
therapeutic intervention (Ginott, 1959). Act- 
ing out is of value only when it represents 
working out of the child’s core difficulties. 
Knowing each child’s central problems and 
providing relevant toys and tools is an im- 
portant, though often a neglected, part of a 
therapist’s responsibility. 

In planning for therapeutic catharsis the 
therapist should furnish materials that elicit 
acting out related to the child’s fundamental 
problems and should avoid materials that 
evoke diffuse hyperactivity. Thus hyperki- 
netic, overactive, and brain damaged children 
should not find finger paints or running water 
in the playroom. Such materials overstimulate 
them and invite smearing of each other and 
the walls. And although such behavior may 
be highly pleasurable for children, it is psy- 
chonoxious in effect and leads to weakened 
inner controls and disorganization of person- 
ality. These children need materials and ac- 
tivities that will focus rather than diffuse 
their flow of energy. Pounding pegboards, 
building block houses, shooting rifles, driving 
nails, sawing wood, or constructing boxes may 
give form and direction to the disorganized 
urges of these children. These activities ex- 
pose the children to sustaining interests that 
call for concentration and coordination. In an 
atmosphere of wise encouragement the chil- 
dren’s frustration will be enhanced and they 
will become able to focus energies on projects 
and goals, both in and out of the playroom. 

Fearful, fragile children should find in the 
playroom materials that can be handled with- 
out the aid of tools, such as water, paint, 
sand, play-dough, dolls, chalk, crayons, etc. 
They need materials that will enable them 
to conceal what they do not want to reveal, 
and to do and undo acts without detection 
and embarrassment. Thus, they can use clay 
for sculpturing or scalping, sand for building 
or burying, and paint for decorating or dirty- 
ing. The reversibility of these media enables 
children to change the identity of their sym- 
bolic revelations at will and makes it safe 
for them to explore their inner and outer 


worlds. 


Materials for Insight 


Toys do not contribute directly to the 
achievement of insight. This is only gained 


through growth in inner security. But toys 
do facilitate interpretations that help chil- 
dren become more aware of themselves and 
their relationships with significant people. It 
is through this awareness that insight is 
attained. 

There will be occasions on which it is de- 
sirable to arrange the materials in the play- 
room so as to provoke a child’s habitual mode 
of reaction. In general, one would not aim- 
lessly repeat within the playroom the frustra- 
tions and deprivations that the child is en- 
countering outside. But occasionally frustrat- 
ing situations may be set up in order to 
enable children to gain insight into the dy- 
namics of their behavior. When only two 
attractive guns are provided for three aggres- 
sive children, the probability is that conflict 
will occur. The children’s heightened reac- 
tions in the conflict situation offer the thera- 
pist a chance to confront each child with his 
habitual, unfruitful, self-defeating modes of 
reaction and to suggest more effective ways 
of handling conflict. 


Toys for Reality Testing 


From the moment he enters treatment the 
child tests the realities of the setting, the 
reaction of the therapist, the response of his 
group-mates, the nature of the materials. The 
frustrations and satisfactions encountered in 
handling playroom materials and the sense 
of power acquired in mastering them have 
direct bearing on the child’s ego strength and 
self-image. The ego is strengthened and the 
self-image is enhanced by realistic success 
and both are damaged by repetitive failure, 
While young children need encounters with 
reality as part of their treatment, it must be 
so arranged as to suit the child's changing 
powers. Accordingly, it is especially impor- 
tant that children who come to treatment 
with vast experience in defeat do not relive 
failure in the playroom. Too many playrooms 
contain complicated mechanical] toys that are 
hard to wind, guns that are difficult to cock 
and puzzles that are impossible to put to- 
gether. Such play materials produce frustra- 
tion and dependence, and hinder the develop- 
ment of feelings of mastery and security. A 
playroom should contain materials of graded 
difficulty which allow each child, regardless 
of handicap, to achieve some measure of suc- 
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cess. Frequently a child with a long history 
of failure will discover that he had more 
ability in handling materials than he has ever 
known. The consequent improvement in self- 
image may mark the beginning of “cure.” 
Materials and activities can contribute to 
the development of a realistic social outlook. 
Axline (1955) asks the pertinent question: 
“If play-therapy is an experience in self- 
exploration, . . . how does the child learn 
to expand beyond his self-centeredness to a 
recognition and appreciation of others?” (p. 
622). In order for play therapy to be an 
experience in social learning, children should 
be provided with situations and materials that 
demand exploration of others as well as them- 
selves. Most children, at certain times in their 
therapy, should be exposed to peers, resistive 
materials, and planned scarcity of tools so 
that they can test themselves in relation to 
social actualities. A setting that selectively 
and intentionally provides one electric jigsaw 
for three children who work with wood is 
likely to create frustration and conflicts and 
subsequent modes of conflict-resolution. The 
therapeutic gains in such a setting will be 
greater than those where materials are pro- 
vided indiscriminately. 


Media for Sublimation 


One of the important objectives of psycho- 
therapy is to help children to develop sub- 
limations that are compatible with society’s 
demands and mores. Our culture does not 
give children much choice about relinquishing 
infantile gratifications. Children are forced 
to give up interest in primary Processes as 
early as possible. Children are strictly for- 
bidden to handle body products and even the 
desire to explore them is prohibited, This 
task is usually accomplished by punishment, 
which leads to repression. It is of the utmost 
importance in therapy to offer children op- 
portunities to enjoy forbidden pleasures in 
acceptable substitute ways, Since “a small 
child’s imagination makes urine of every fluid 
and feces of every messy substance” (Balint, 
1954, p. 24), sand, water, paint, and clay 
provide excellent means of sublimating ure- 
thral and anal drives. No playroom for small 
children is complete or adequate without such 
materials. Enuretic children should be given 
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paint and running water, encopretic children, 
mud and brown clay. Firesetters should have 
capguns, sparklers, and flashlights. All chil- 
dren should find in the playroom miniature 
utensils for cooking and serving meals to 
sublimate oral needs, dolls that can be 
dressed and undressed to sublimate sexual 
needs, and punching bags, targets and guns 
to sublimate aggressive drives, Each child 
should have the opportunity to express his 
needs symbolically in a great variety of ways 
according to his changing capacities. A child 
should be led to express anger by punching 
dolls and destroying clay figures, as well as 
by composing critical poems and writing 
murder mysteries. The lack of adequate out- 
lets in the playroom may impede the emer- 
gence of more mature means of sublimation. 
The therapy setting must provide materials 
that allow growth in the repertory of self- 
expression. 


SUMMARY 


This article presents a consistent rationale 
for selecting toys that contribute maximally 
to the process of play therapy. A desirable 
treatment toy: (a) facilitates the establish- 
ment of contact with the child, (6) encour- 
ages catharsis, (c) aids in developing insight, 
(d) furnishes Opportunities for reality testing, 
and (e) provides media for sublimation, 
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There has been some speculation concern- 
ing the relationship between degree or 
amount of verbal production and therapeutic 
change during group or individual therapy. 
Psychotherapy has often been called “the 
talking cure” and the assumption that there 
is a distinct positive relation between the 
amount of the patient’s verbalization and the 
extent of improvement has appeared so ob- 
vious a truism as to hardly require explicit 
statement. However, there is little in the lit- 
erature bearing on this matter. Gibby, Stot- 
sky, Hiler, and Miller (1954) have shown 
that the more verbally productive patients 
remain longer in psychotherapy, thereby pre- 
sumably having greater opportunity for im- 
provement. On the other hand, Rogers 
(1942) suggested that therapy may actually 
take place during periods of silence. Powder- 
maker and Frank (1953) cautioned that in 
group therapy “in itself speech may have no 
more value than silence.” 

The present study was an investigation of 
the relationship between degree of verbal. par- 
ticipation and changes in attitudes in a 
therapy group of adult offenders on proba- 
tion. It was hypothesized that individuals 


1 This study was part of a larger interdisciplinary 
study focused on the interaction process and the 
sociometric relations developed within a therapy 
group of probationers to determine whether ob- 
servational techniques would throw light on reasons 
for the change in attitude and perceptions of the 
subjects. 


with a greater degree of verbal participation 
would evidence greater positive changes in 
their attitude toward authority figures and 
social conformity. 


PROCEDURE 


Fifteen consecutive probation placements whose 
ages ranged from 16 through 32 years, whose IQs 
ranged from 80 to 125, and whose educational level 
ranged from the fifth to the twelfth grade, were 
divided into two groups of eight and seven sub- 
jects (Ss). Each group met for 90 minutes weekly 
for 15 sessions at the Brooklyn Association for the 
Rehabilitation of Offenders Clinic, a noncourt clinic, 
for client-centered group therapy. An observer kept 
track of the amount of time each individual spoke 
during each session, using a record blank which 
divided each 90-minute session into 20-second 
intervals. 

In order to evaluate the changes in attitudes, a 
modification of the TAT was employed following 
Stern (1952) and Newman (1953), who found that 
Pictures of specific authority figures interacting with 
subordinates were more effective than the standard 
TAT in eliciting attitudes toward authority figures, 

The specific pictures employed and order of 
Presentation was as follows: 


1. Mother and boy with violin (Card 1 of TAT 
superimposed on Card 5) 

2. Employer-employee situation 

3. Policeman-subordinate situation 

4. Judge-witness situation 

5. Buffer: silhouette in window 
TAT) w (Card 14 of 

. Father-son situation (Card 6BM of TAT) 


6 

7. Mother-son situation (Card 7BM of TAT) 
8. Warden-prisoner situation 
9. 
0. 


- Teacher-student situation 
. Probation officer—-probationer situation 
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Standard technique was followed (Murray, 1951), 
for administration before and after treatment. The 
TAT responses were scored for certain attitudes 
toward authority figures in accordance with the sys- 
tem devised by Aron (1949) and used by Stern 
(1952), to wit: (a) Aggression, (b) Independence, 
and (c) Dependence. 

The Human Relations Inventory (HRI), a 37- 
item projective questionnaire (Bernberg, 1954) 
which provides a quantitative estimate of social 
conformity based on the S’s need-value system, was 
also administered before and after treatment. 

Independence of treatment and testing was 
achieved by having an examiner who had no idea 
of the Ss’ degree of verbal participation, administer 
the tests, Uniform test instructions were given to 
all Ss and the responses were recorded verbatim. 
Objectivity in scoring was obtained by using the 
Aron system which resulted in interjudge agreement 
of better than 95%. The HRI is completely self- 
administered and self-scoring. 

In addition, the scoring of all tests was completed 
before the Ss’ degree of verbal participation was 
computed, so that the scorers were not influenced 
by the latter. 


RESULTS 


Amount of Verbal Participation and 
Changes in Attitude toward 
Authority Figures 
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The amount of time each S$ spoke during 
the fifteen sessions and the scores of the TAT 
and HRI administered before and after the 
period of group therapy are summarized in 
Table 1. The posttreatment score was sub- 
pee! from the pretreatment score for each 

to obtain a measure T 
hi fin of change on the TA 
The Ss were ranked according to degree 
of verbal Participation and according to their 
TAT and HRI change scores. Rank differ- 
ence correlations 


of .08 on the TAT and .01 on the HRI were 
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ment” of a member in group therapy, to 
relate the gross degree of oral output to prog- 
nosis. Almost invariably the “silent” member 
of the therapy group is considered somewhat 
lacking in reference to improvement. “How 
could he profit if he said nothing, just 
listened?” many common sense observers 
might ask. aaa 

Perhaps the important variable which is a 
function of improvement in treatment is not 
the torrent of words that emerges from the 
patient, but the experience of being accepted 
and understood in the therapy situation, as 
Rogers (1942, 1951, 1955) has repeatedly 
suggested. More currently, within the existen- 
tialist framework, May (1958, p. 28) has 
pointed out: 

It is well known to every therapist that patients 
can talk theoretically and academically from now 
until doomsday about their problems, and not really 
be affected; indeed, particularly in cases of intellec- 
tual and professional patients, this very talking, 
though it may masquerade under the cloak of un- 
biased and unprejudiced inquiry into what is going 
on, is often the defense against seeing the truth and 
against committing one’s self, a defense indeed 
against one’s own vitality. The patient’s talking will 
not help him get to the reality until he can experi- 
ence something of some issue in which he has an 
immediate and absolute stake. 


On this basis it would appear that the 
comparatively nonverbalizing patient has an 
opportunity in group therapy to experience a 
considerable degree of acceptance and under- 
standing by merely entering the pervasive 
permissive atmosphere of a functioning thera- 
peutic group without feeling any overwhelm- 
ing qualms of inadequacy because he is not 
verbally productive, as might happen in the 
face-to-face situation of individual treatment. 

The anxious therapist, concerned about the 
group member who does not appear to be 
actively participating in the verbal inter- 
action, may be reassured that even the rela- 
tively silent S may well be employing ie 
therapy period to reintegrate the perceptua 


field in the direction of self-realization or he 
may be staking out an area of commitment. 
This study at least suggests that the most 
articulate member of the group is not neces- 
sarily the person who gains the most, nor is 
the quiet one in the group the individual who 
does not achieve important attitudinal altera- 
tions. 


SUMMARY 


The relationship between degree of verbal 
participation during group therapy with of- 
fenders and their change in attitudes toward 
authority was investigated. It was found that 
the amount of verbal production did not cor- 
relate with change in attitudes. 
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SYLLOGISTIC REASONING ERRORS IN SCHIZOPHRENIA + 


LEONARD GOTTESMAN anv LOREN J. CHAPMAN 2 


University of Chicago 


Von Domarus (1954) and Arieti (1955a, 
1955b) have asserted that an important as- 
pect of schizophrenic thinking disorder is a 
certain kind of error in syllogistic reasoning. 
Arieti calls this error tendency “Von Doma- 
rus’ principle” (or “paleologic”) and describes 
it as follows: “Whereas the normal person 
accepts identity only upon the basis of iden- 
tical subjects, the paleologician accepts iden- 
tity based upon identical predicates” (p. 
194). For example, given the premises: 


Stags are swift, 
Certain Indians are swift 


Von Domarus and Arieti both state that on 
the basis of the common predicate of swift- 
ness, schizophrenics will conclude that “Cer- 
tain Indians are stags,” while normal people 
will not. Arieti adds (1955b, p. 57) that “Of 
course this principle of Von Domarus must 
be viewed as determined by an inability to 
abstract.” 

However, other investigators (Chapman & 
Chapman, 1959; Sells, 1936) have reported 
strong error preferences of this sort by nor- 
mals in syllogisms identical in figure and 
mood? with the above. Chapman & Chap- 


1This study was supported by resear 
(M-1416) from the National Institut 
Health, U. S. Public Health Service. 
This article is based in 


ch grant 
e of Mental 


2 The senior author 
Michigan, and the juni 
versity © Kentucky, 


a -tminology used to describe 
a uniform one, based 


is now at the University of 
or author is now at the Uni- 


man (1959), using syllogisms containing 
statements about letters, found that in some 
moods many college students conclude that 
two subjects are identical on the basis of a 


There are four fi 


orms of categorical propositions 
used in syllogisms: 


Expression Symbol 
All pots are black, A 
No pots are black. E 
Some pots are black. I 
Some pots are not black. (6) 


A syllogism consists of three such st 
first two of which are the premises, 

1. The major Premise, which states the relation 
between the middle term (M) and the Predicate (P) 
of the conclusion, 

2. The minor Premise, which states the relation 


between the middle term (M) and the subject (S) 
of the conclusion, 


atements, the 


The figures of th 


syllogism are the four possible 
arrangements of th 


e terms in the major and minor 


premises; 
Fig. 1 Fig. 2 Fig. 3 Fig. 4 
M-P P-M M-P PM 
5SM S-M_ M-S M-S 
S-P S-P SP SP 
The mood of 


f 2 syllogism refers to the combination 
of three propositions from among the four kinds of 


prs arica propositions. For example, a syllogism 
of the “ATI mood, in the first figure, is as follows: 
All men are mortal, 
Some In 


dians are men, 

Therefore: 

Some Indians are mortal. 
This is a valid s i 

yllogism, 

Each of the th i 


For details he fi ree Propositions of a syllogism may 
see any introductory logic textbook, such as Cohen Tas pre is or O. Considering the major and 
el (1934). The f i emises on], ibiliti 
HE 3 ae informa Summary, however of combination lege 16 such apie 
ne Bee ton for the purposes of conclusion in ite s een of bes s yield no w i 
3 r more of the our figures, e 
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common : predicate, i.e. middle 
example, in the syllogism 


term. For 


All A’s are B's, n 
Some C’s are B’s 


75% of a college undergraduate sample con- 
cluded that “Some C’s are A’s.” 

Although both Von Domarus and Arieti 
cast their examples in the second syllogistic 


} 
he Sauce, their principle also seems to apply to 


_«syllogistic Figures 1, 3, and 4 in the cases 


ae 
— 


$ 


w 


in which the premises are logically equivalent 
to the second figure. For example, II-4 prem- 
ises are of the form 


Some A’s are B’s, 
Some B’s are C’s. 


The premise “Some B’s are C’s” necessarily 
means that “Some C’s are B’s,” and making 
this change in the second premise above 
would create a syllogism of the second figure. 
Therefore, the same logical predictions may 
be made from II-4 as from II-2. In the same 
way, Von Domarus’ principle applies to AA, 
Fig. 2, AI, Fig. 2 or 4, IA, Fig. 1 and 2; 
and II, any figure. It appears that their pre- 
diction would be an I conclusion for AI and 
II, and an A conclusion for AA and IA. The 
latter A prediction is for the premises of 
the form 


Some A’s are B’s, 
All C’s are B's. 


By Von Domarus’ principle, “All C’s” and 
“Some A’s” are identical. Therefore, the rela- 
tionship of C to A is stated by the con- 
clusion “All C’s are A’s.” Previous research 
(Chapman & Chapman, 1959) found that 
for premise pairs containing statements about 
letters, normal Ss chose the Von Domatrus 
predicted error for II, AA, and AI re 
In IA, however, they chose I instead o: ` 

These findings contradict the claims o a 
us and Arieti that such patterns o 
e distinctively pathological. Any 
validity for their reported diaa obry 
tions would have to reside in t ag ie 
phrenics than in normals. Tarep 
sistent with evidence, recently (Chaptian, 
by one of the present writers 


ich have 
zinds of errors which v 
oe te wees “the schizophrenic 


Domar 
reasoning ar 
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thinking disorder” are made by both normals 
and schizophrenics, although less often by 
normals. 

We will compare schizophrenics with nor- 
mals on the frequency of occurrence of Von 
Domarus’ predicted errors. 

If syllogistic errors are important in schizo- 
phrenic thinking disorder, it seems unlikely 
that such errors are limited to these four 
moods. We will, therefore, compare our 
groups on all 14 pairings, from among the 
four forms of categorical propositions, from 
which no valid conclusion can be drawn. Our 
main hypothesis is that schizophrenic Ss will 
exhibit the same error preferences as normal 
Ss. Since schizophrenic Ss are expected to 
make more errors than normal Ss, we further 
hypothesize that their greater number of 
errors will be expressed as an increased mani- 
festation of normal error patterns. 
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METHOD 
Instrument 


The experimental instrument,* a modification of a 
paper and pencil syllogism test developed by Chap- 
man and Chapman (1959) was given to Ss either 
individually or in small groups. The test was com- 
posed of 40 multiple-choice items, each containing 
two premises, the word “therefore,” and five alter- 
native conclusions. Four of these alternatives were 
cast in the possible forms which a syllogistic con- 
clusion can take (i.e, A, E, I, and O), and the fifth 
alternative asserted that “None of these conclusions 
is proved.” For example, the test contained the 
folowing item, to which the correct answer is the 
third alternative which states that no conclusion is 
proved: 


All of Tom’s ties are red. 
Some of the things Ada is holding are red. 
Therefore: 
1. At least some of the things Ada is holding 
are Tom’s ties. 
2. At least some of the things Ada is 
are not Tom’s ties. 
3. None of these conclusions is proved. 
4. None of the things Ada is holding are Tom’s 
ties. ‘ 
5, All of the things Ada is holding are Tom’s 
ties. j 


holding 


The aim of the present study was to analyze and 
describe the kinds of errors made by nor kad. 


4 Copies of the test may be obtained from the 
authors, or may be ordered from the American 
Documentation Institute. Order Document No. 6287, 
remitting $1.75 for 35 mm. microfilm, or $2.50 for 
6 X 8 in. photocopies. 
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schizophrenic Ss on 28 experimental items, which 
contained two examples of each of the 14 combi- 
nations of categorical propositions for which no 
valid syllogistic conclusion is possible. The remain- 
ing 12 of the 40 items could be solved with one or 
more valid syllogistic conclusions. These were filler 
items included to prevent Ss from discovering that 
the alternative, “None of these conclusions is proved” 
was the only correct solution on the experimental 
items. The order of the items and of the alterna- 
tives was random. 

Preliminary testing of an earlier version of the 
test revealed that it was so difficult that accuracy 
scores of normal Ss as well as schizophrenics were 
close to zero. Such great test difficulty tends to mask 
differences between groups in accuracy. Therefore, 
in an attempt to make the test easier, all items were 
redesigned, so that they had three characteristics. 
First, in all items the subject, the predicate, the 
middle terms, and the relationships among them pre- 
sented objects and relationships that would be 
within the experience of all Ss, but yet were such 
that the syllogisms could not be solved on the basis 
of prior knowledge. That is, the statements all con- 
cerned common objects, e.g. toys, automobiles, 
clothing, but did not specify any particular such 
objects which the Ss themselves encountered. Second, 
an attempt was made to construct items in which 
the correct alternative, “None of these conclusions 
is proved,” was a fairly attractive one. Finally, in 
the I and O propositions, instead of the word “some” 
the expression “at least some” was always used, in 
order to avoid Ss’ misinterpreting “some” to mean 
“some but not all.” © 

In addition to the syllogism test, each S was 
given the CVS intelligence scale (Hunt, French, 
Klebanoff, Mensh, & Williams, 1948) as well as the 
full vocabulary subtest of the Stanford-Binet, 
Form L. 


Subjects 


A normal group and a schizophrenic group of 
30 Ss7 each were tested. Six of the normal Ss were 


5 Research with this type of instrument by Chap- 
man & Chapman (1959) has shown that the figure 
of the invalid syllogism has little or no effect on the 
number or the type of errors made by normal Ss. 
In addition, none of the writers from whom experi- 
mental hypotheses were derived consider the figure 
of the syllogism. We have, therefore, included exam- 
ples of only two figures where more than two figures 
of a given combination of major and minor premise 
were insoluble. 

6See Sells (1936) for a discussion of the ambi- 
guity of the word “some” as it is used in syllogisms 
The above change was influenced by his discussion. 

7 We are indebted to the managers of the West 
Side VA Hospital, Chicago, Illinois and the VA Hos 
pital, Downey, Illinois for permission to te: = 


i k st = 
tients for this experiment. We are also indebted t 
Meyer Williams and Julian H. Pathman for asik 


ance in obtaining the Ss. 
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TABLE 1 


MEANS OF AGE, EDUCATION, PRESENT IQ SCORE AND 
HıcnEstT Worp PASSED ON VOCABULARY FOR 
NORMAL AND SCHIZOPHRENIC SS 


Age Education IQ Highest Word 
Schizophrenic 42.3 10.6 89.3 23.9 
Normal 38.7 10.9 107.2 255 


drawn from the general medical and surgical pa- 
tients at the United States Veterans Administration 
Hospital, Chicago, Illinois and the remaining 24 
from Chicago firemen. No one was included in the 
normal group who appeared by hospital record or 
by interview to be mentally defective, to have had 
a psychiatric or neurological disorder, or to have 
been a drug addict or an alcoholic. The schizophrenic 
group consisted of 19 patients at the Chicago State 
Hospital and 11 at the Veterans Administration 
Hospital, Downey, Illinois. They had all received 
a hospital diagnosis of chronic schizophrenia, with 
their first admission for psychosis occurring at least 
one year previously and their last at least three 
months previously, However, almost all had spent 
most of the last 5 or 10 years in a psychiatri 
hospital. All but six were closed ward patients, ‘and 
the majority of them were from wards considered 
to consist of the hospitals’ most regressed patier ts 
Other prerequisites for selection of schizo h a 
were absence of any indicat: e 


y ation of cerebral organi 
i p rgar 
involvement or mental deficiency, no current p 


therapy, and no EST treat ithi 
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TABLE 2 


PERCENTAGE OF EACH CHOICE on ITems FOR WHICH A AND I ARE PREDICTED 
BY Von Domarus’ PRINCIPLE 


No. of 
Prediction Items Group A E I oO N 
A + Normal 12 1 34 11 2 
Schizophrenic 20 5 23 20 33 
I 4 Normal 6 S 33 3 29 
Schizophrenic 17 10 29 23 22 


of differences between normal and schizo- 
phrenic groups were made using the Mann- 
Whitney test. Two-tailed tests were used 
throughout. Except where specified, compari- 
sons were based on the 28 experimental 
items. 

Both groups erred on a majority of the 
items, although as expected the schizophren- 
ics made significantly more errors than nor- 
mals both on the total 40-item test (p = .03) 
and on the 28 experimental items (p =.05). 
The mean numbers of errors of schizophren- 
ics and normals were 20.6 and 17.8 on the 
experimental items, and were 27.8 and 24.1 
for the entire 40 items. 

The high number of errors by the normal 
Ss provides an excellent opportunity for mak- 
ing qualitative comparisons of the preferred 
errors of the two groups. 

Let us first examine the data on those eight 
items which are relevant to Von Domarus’ 
principle. Table 2 shows the percentage of 
normal and schizophrenic choices in each 
error category for the four A prediction items 
combined (two items with AA premises, both 
of which were Fig. 2, and two with IA prem- 
ises of which one was Fig. 1 and one Fig. 2), 
and for the four I prediction items combined 
(two items with AI premises of which one 
was Fig. 2 and one Fig. 4, and two with II, 
of which one was Fig. 3, and one Fig. 4). 

As seen there, schizophrenics qualitatively 
exceeded normals in Von Domarus’ predicted 
A errors, but the difference was not signifi- 
cant (p = .31). In addition, the predicted A 
choice was not the favored error for the 
schizophrenics, since O and I alternatives 
were selected as often as A alternatives. Von 
Domarus and Arieti seem to predict a con- 

HA 


clusion of complete identity of subject and 
predicate (A conclusion) for this kind of 
syllogism. However, it should be noted that 
since I is a conclusion of partial identity, the 
choice of this alternative would not be com- 
pletely inconsistent with the Von Domarus 
principle. The A and I choices together ac- 
count for 46% of the normal S’s responses 
on these items, and 43% of the schizophren- 
ics’ responses. 

On the four items for which Von Domarus’ 
principle would lead to an I conclusion, the 
schizophrenics did make the largest portion 
of their errors in the I category but normal 
Ss exceeded them (not significantly) in this 
error. 

Furthermore, in the case of both sets of 
items, the schizophrenics and normals showed 
a quite similar rank order of error preference. 
It appears that the differences between the 
two groups on choice of error can be de- 
scribed as a more random response by the 
schizophrenics. The modal error alternatives 
are relatively less frequently chosen by the 
schizophrenics, while the unpopular alterna- 
tives are more frequently chosen. A greater 
scattering of responses by schizophrenics is 
almost always found on a multiple-choice 
test. 

Another helpful way to view these data is 
in terms of percentage of errors which coin- 
cide with Von Domarus’ prediction 


ie., on 
Table 2 the percentage of the Incorrert 
choices which are A for the four items for 


which the Von Domarus princi 
and those that are I for the fo 
items. Grouping together the: 
the percentage is 35.39% 
and 33.7% 


ple predicts A, 
ur I prediction 
se eight items, 
for the normal Ss 
for the schizophrenics, Contrary 
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TABLE 3 


PERCENTAGE OF NORMAL AND SCHIZOPHRENIC CHOICE or EACH 
GROUPED BY NORMAL PREFERRED ERROR 


ALTERNATIVE FOR ITEMS 


Error No. 
Preferred of Items Group A E I o N 
Normal 2 8 18 35 37 
o 10 j 
Schizophrenic 11 13 22 29 25 
P Normal 8 5 43 16 29 
: 2 
Schizophrenic 18 7 32 18 25 
: p Normal 3 30 14 13 40 
oy 5 
Schizophrenic 11 19 24 19 27 
Normal 26 
; r 8 16 8 42 
Schizophrenic 22 12 16 23 27 


i iti m of the premises a ae A 

Note Tie aneng POO OI, BOL, EA ALa AO, ES Teste figure for which the various ery 
; for E prefe: ,AE-1, AE-3, IE-1, OE-4, OE-1; fo, S 
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ns, i.e., statements about letters, 


were: for O preference, OO-3, 
are more variable than those 


ively content-free propositio i ti 
eaves error preferences of the present study 


to the Von Domarus principle, schizophrenic 
and normal Ss are remarkably similar on this 
measure. 

We conclude that the data do not support 
Von Domarus’ and Arieti’s description of dif- 
ferences in syllogistic errors between schizo- 
phrenics and normals. , 

Let us now compare normal and schizo- 
phrenic error preferences on the entire 28 
experimental items. Items for which the nor- 
mals had the same error preference were com- 
bined for purposes of the comparison. Two 
items were dropped here because two of their 
alternatives were tied on frequency of choice, 
Using this pooling of items, the percentage 
of normal and schizophrenic choices for each 
of the five alternatives was computed. These 
findings are shown in Table 3. 

As seen in Table 3, the error patterns for 
the two groups are similar. Within the Pools 
of items the schizophrenic and norma] groups 
tended to prefer the same erroneous alterna- 
tives, and also tended to agree on their least 
frequent choice, with the differences between 
the groups of Ss consisting largely of a mor 
uniform dispersion of responses for oe 
schizophrenics, This pattern is marked rn 


Tors wi erred 
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consider the appropriateness of the instru- 
ment. It might be objected that a multiple- 
choice syllogism test is too artificial a situa- 
tion to elicit the kind of thinking distortions 
which are called forth by real events. In par- 
ticular it might be objected that Von Dom- 
arus’ principle appears only in affect-laden 
situations. However, this restriction does not 
appear to hold for some other aspects of 
thinking disorder. Several previous studies 
have amply demonstrated that many clinical 
thinking disorders can be measured by con- 
trived multiple-choice tests, e.g. Chapman 
(1956a, 1956b), Epstein (1953), Fey (1951), 
Gorham (1956), Payne & Hirst (1957). We 
therefore conclude that the failure of the 
present study to substantiate the Von Dom- 
arus principle indicates that this principle 
may not be a pervasive feature of schizo- 
phrenic thinking. 

Yet there are occasional schizophrenic de- 
viancies which appear to be clear-cut mani- 
festations of Von Domarus’ principle, of a 
sort that normal Ss would not commit. The 
results of the present study do not contradict 
such observations, but do indicate that this 
kind of error may be limited in its occurrence. 


SUMMARY 


Normal and schizophrenic Ss were given a 
40-item multiple-choice syllogism test in 
order to compare them on the erroneous con- 
clusions which they reach from premises 
which, by traditional logic, yield no conclu- 
sions whatever. These included 8 items which 
measured the tendency to conclude the iden- 
tity of subjects on the basis of identical 
predicates, an error which Von Domarus 
(1954) and Arieti (1955) have described as 
distinctively schizophrenic. 

The results failed to substantiate the pre- 
dictions inferred from Von Domarus and 
Arieti. The present investigators’ hypothesis 
was partially confirmed in that the schizo- 
phrenics showed qualitatively similar error 
patterns to the normal Ss, but was not com- 
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pletely confirmed in that the schizophrenics 
did not exceed the normal Ss in these error 
preferences. 


REFERENCES 


ARIETI, S. Interpretation of schizophrenia. 
York: Robert Brunner, 195Sa. 

Arretr, S. Somes aspects of language in schizo- 
phrenia. In H. Werner (Ed.) On expressive lan- 
guage. Worcester: Clark Univer. Press, 1955b. 

CHAPMAN, L. J. Distractibility in the conceptual 
performance of schizophrenics. J. abnorm. soc. 
Psychol., 1956a, 53, 286-291. 

CHAPMAN, L. J. The role of type of distracter in 
the “concrete” conceptual performance of schizo- 
phrenics. J. Pers. 1956b, 25, 130-141. 

CHAPMAN, L. J. Intrusion of associative responses 
into schizophrenic conceptual performance, ve 
abnorm. soc. Psychol., 1958, 56, 374-379. 

Cuapman, L. J, & CHAPMAN, Jean P. Atmosphere 
effect re-examined. J. exp. Psychol., 1959, 58, 220- 
226. 

Conen, M. R., & Nacet, E. An introduction to logic 
and scientific method. New York: Harcourt Brace, 
1934. 

ErsteIN, S. Overinclusive thinking in a schizo- 
phrenic and a control group. J. consult. Psychol., 
1953, 17, 384-388. 

Fey, ELrzaserH T. The performance of young 
schizophrenics and young normals on the Wiscon- 
sin Card Sorting Test. J. consult. Psychol., 1951, 
15, 311-319. 

Gornam, D. R. Use of the Proverbs Test for differ- 
entiating schizophrenics from normals. J, consult, 
Psychol., 1956, 20, 435-440. 

Hunt, W. A., FRENCH, ELIZABETH Gi, KLEBANOFF, 
S. G., Mensu, I. N., & Wuiuams, M. The clinical 
possibilities of an abbreviated individual intelli- 
gence test. J. consult. Psychol., 1948, 12, 171-173. 

Payne, R. W., & Hirst, H. L. Overinclusive thinking 
in a depressive and a control group. J. consult, 
Psychol., 1957, 21, 186-188. 

Serrs, S. B. The atmosphere effect: an experimental 
study of reasoning. Arch. Psychol., 1936, 29, 3-72. 

Von Domarus, E. The specific la 
schizophrenia. In J. S. Kasanin, (Ed.), Language 
and thought in schizophrenia. Berkeley: Univer, 
Calif., 1954. Pp, 104-113. 

Woopworts, R. S., & Setts, S. B. 
effect in formal syllogistic reasonin 
chol., 1935, 18, 451-460. 


New 


ws of logic in 


An atmosphere 
g. J. exp. Psy- 


(Received June 24, 1959) 


al of Consulting Psychology 
1560, Mal arto. 3° 256-261 


BODY IMAGE CHANGES ASSOCIATED WITH 
PERSONALITY REORGANIZATION 


SIDNEY E. CLEVELAND 
Veterans Administration, Houston, Texas and Baylor University College of Medicine 


In order to place the present study in 
proper perspective, it is necessary to sum- 
marize briefly the results of two previous 
research efforts by the author. 

First, in a study by Cleveland and Fisher 
(1958) a relationship between Barrier and 
Penetration scores and degree of personality 
reorganization following psychotherapy was 
found. Patients rated as improved following 
psychotherapy tended to show increase in 
Barrier scores and/or decrease in Penetration 
scores compared to pretherapy scores. Un- 
improved patients tended to reverse this 
trend. These preliminary results were tenta- 
tively interpreted as suggesting that rated 
improvement in adjustment paralleled greater 
definiteness in body image boundaries. 

The body image indices of Barrier and 
Penetration are derived from a content analy- 
sis of the Rorschach. In this scoring system 
a count is made of all responses referring to 
an object with well defined boundaries, such 
as “a cave with rock walls,” or “a knight in 
armor.” The total number of such responses 
constitutes the Barrier score. Similarly in ob- 
taining the Penetration score, a count is made 
of all responses in which the boundaries of 
an object are weak and ill defined. Such 
responses as “an open, bleeding wound” or 
“soft cotton candy” serve as examples. 

Results of extensive research indicate that 
Barrier Score reflects degree of definiteness of 
body image boundaries. Definiteness of boun- 
daries in turn is related to such personality 
pees level of maturity and toleration 

. Stress, Penetration score indicates vulnera- 
bility of body image boundaries and appears 


to be negatively Correlated with i 
: matu a 
toleration of stress, rity and 


A second re 


port by the auth R 
lates to body or (19587 1e 


image in schizophrenia. In a 


normative study, Barrier and Penetration 
scores were tabulated for 50 normals, 40 neu- 
rotics, 40 undifferentiated schizophrenics and 
40 paranoid schizophrenics. Normals and 
neurotics did not differ significantly on either 
body image scale. However, both groups dif- 
fered significantly from the two psychotic 
populations, with schizophrenics having lower 
Barrier scores and much higher Penetration 
scores than either normals or neurotics. 

This latter study of course represents sim- 
ply an examination of a cross section of pa- 
tients hospitalized for g psychotic condition. 
In order to examine more fully the relation- 
ship between body image and ego regression 
or reorganization, it was desired to follow a 
group of psychotic patients longitudinally as 
they underwent treatment. The problem to 
be studied involved repeated sampling of 
body image fantasies as the same patient 
either improved or regressed further over a 
period of time. 

The opportunity to gather such data was 
provided in the spring, 1958, when the VA 
conducted a nationwide research project to 
investigate the relative effectiveness of five 
different tranquilizers and a placebo (pheno- 
barbital) in bringing about remission of 
symptoms in schizophrenia. The study was 
conducted in the usual double blind manner, 
Participating hospitals included the Houston 
and Waco VA installations. 


PROCEDURE 


Twenty-four, newly hospitalized patients with the 
basic diagnosis of schizophrenic reaction, comprised 
the Waco sample. Eighteen patients similarly diag- 
nosed were included in the Houston study, Patients 
to be placed on the research project were carefully 
selected and assigned by the local project director * 


+I am indebted to Donald Gorham for collection 
of the Waco data, 
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and psychiatrists concerned. They were guided in 
their selection by a list of criteria. Among these were 
the exclusion of patients with lobotomy, CNS dis- 
ease, and primary alcoholism. Selected patients 
varied in age from 25 to 45 years and in education 
from 6th grade to college graduate. All were males 
and 10 patients were Negroes. Prior to being placed 
on an unknown medication, behavioral ratings were 
made of each patient utilizing the Lorr Multidimen- 
sional Scale for Rating Psychiatric Patients (1953). 
This rating was accomplished by a psychiatric team 
composed of psychiatrist, psychologist, and psychi- 
atric nurse in interview with the patient. Additional 
ratings were made by the psychiatric aides. These 
ratings were repeated during the 5th week of drug 
administration and again following 13 weeks of 
medication. 

The Lorr scale yields an over-all morbidity score. 
This score represents the degree of pathological be- 
havior observed. Disharmony of thought and feeling, 
presence of delusional thinking or hallucinatory ac- 
tivity, appropriateness of motor behavior, and degree 
of reality orientation all contribute to this rating 
of pathology. The scale is so constructed that these 
ratings can be made on observable behavior or 
readily inferable symptoms. Change in morbidity 
score from first to second and first to third rating 
serves as one criterion of degree of personality re- 
organization in the present study. A second cri- 
terion of personality reorganization was provided by 
the psychiatrist’s independent global evaluation of 
the patient made at the conclusion of the study 
(improved and discharged or unimproved and still 
hospitalized). 

The Holtzman Ink Blot Test (1958) was also 
administered to each patient at these same three 
periods during the study: predrug, 5th week, and 
13th week. The records were scored for Barrier and 
Penetration in the usual manner. The Holtzman Ink 
Blots were utilized rather than the Rorschach test 
because two carefully equated forms of 45 cards 
each are available. These were administered in ABA 
or BAB sequence. Another reason for using this ink 
blot series was the fact that the subject (S) is re- 
quired to give only one response per card. Holtzman 
contends that this system results in a more uniform 
total response pattern from S to S. 

Because of various factors it was not possible 
to obtain repeat tests and ratings on all Ss. Some 
patients refused to respond to the ink blot test, 
either initially or on repeat testing; a few patients 
were discharged from the hospital before repeat 
testing could be obtained; one patient committed 
suicide, However, no one selective factor appeared 
to be operating to determine which patients would 
cooperate by responding initially or on repeat test- 
ing. For example, some patients rated as improved 
refused to cooperate on repeat tests, as well as those 
whose condition had deteriorated. 

Overall at least one repeat testing was obtained 
on 12 of the 18 patients in the Houston group and 
13 of the 24 patients comprising the Waco sample. 
Morbidity ratings based on the Lorr scale were 


also available on these 25 patients. Of these 25 
patients 17 were given morbidity ratings with sig- 
nificant decrement compared to their initial mor- 
bidity score by the 5th week of medication. Eight 
were unchanged or had increased in morbidity score. 
By the 13th week 19 patients were given significantly 
lower morbidity scores as compared with the pre- 
drug rating and 6 remained unchanged or worse. 
Seven of the 25 patients left the hospital on trial 
visit soon after the 5th week to go back to work. 
By the end of the experiment 17 of the patients had 
left the hospital as discharged or on trial visit. 
These figures are quoted in order to give some per- 
spective on the degree of change apparent in the 
clinical picture of these patients. 

Analysis of the data proceeded in the following 
manner. The 25 patients available for repeated rat- 
ing and testing were ranked in order of (a) decre- 
ment in morbidity, (b) decrement in Penetration 
score, (c) increment in Barrier score for the pre- 
drug vs. 5th-week and 13th-week evaluations. The 
patient with the highest decrement in morbidity 
score was given a rank of 1; the patient with the 
highest increment in morbidity (much sicker) ranked 
25th. For the body image scores, the patient with 
the greatest drop in Penetration score ranked one 
and the patient with the largest increase in Pene- 
tration score ranked 25th. For the Barrier score 
ranking the order was simply reversed. 

Although Holtzman feels that one advantage of 
his ink blot series is that comparability of total pro- 
ductivity is encouraged, in the present study patients 
varied considerably in their response totals from test 
to test. Accordingly it was found necessary to treat 
the body image scores as ratios in relation to the 
number of responses given. 


HYPOTHESES AND RESULTS 


In view of the previous findings relating 
change in body image scores to personality 
reorganization, two predictions were made: 
(a) a positive correlation was expected þe- 
tween decrement in morbidity and decrement 
in Penetration score; (b) a positive correla- 
tion was also predicted between decrement 
in morbidity and increment in Barrier score. 

A significant rank order correlation of .60 
was obtained between morbidity ranking and 
Penetration ranking for the predrug vs. 5th- 
week evaluation. Similarly a rho of .61 was 
obtained on the same rankings for the pre- 
drug vs. 13th-week evaluation. These findings 
support Hypothesis 1. > 

For the Barrier score and morbidity rank- 
ings, however, a rho of .13 for the 5th week 
and a rho of —.02 for the 13th week were 
obtained. Neither correlation attains statisti- 
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TABLE 1 


A COMPARISON OF PENETRATION AND BARRIER SCORES AND HOSPITAL DISPOSITION 


Penetration Penetration Barrier Barrier 
Score Score Score Score 
Decrease Increase Decrease Increase 
Patients discharged 2 
or on trial visit 15 2 8 9 
Patients retained P 
in hospital 3 5 4 ä 
yya 4.7 0.0 
P .02, .05 


a With Yates correction applied. 


cal significance and Hypothesis 2 is not sup- 
ported. 

The second criterion of degree of person- 
ality reorganization, the psychiatrists inde- 
pendent global rating of each patient, yielded 
similar results when compared with changes 
in body image scores. In Table 1 a compari- 
son by means of chi square technique is made 
between body image scores and patient dis- 
position by the conclusion of the study. It 
will be noted that patients considered suffi- 
ciently in remission to warrant hospital dis- 
charge or trial visit tend to show a decrease 
in Penetration score. A similar comparison 
with Barrier score yields a chi square of zero. 
Again a relationship between Penetration 
score and degree of personality reorganization 
is found but no such relationship holds for 
Barrier score. 

Before discussing these findings, two other 
bits of data should be presented. First it is 
of interest to note qualitatively the nature 
of the decrement in Penetration score asso- 
ciated with rated improvement, According to 
the scoring scheme developed by Cleveland 
and Fisher for Penetration responses, not 
only are fantasies involving literal penetration 
of the body boundary included, but also 
many responses only symbolic of boundary 
indefiniteness are so scored. For example: “a 
shadow”; “soft mud”; “a transparent win- 
dow.” Previous research suggested that 
acutely disturbed schizophrenics tend to con- 
centrate on literal fantasies of boundary dis- 
ruption while excluding more symbolic repre- 
sentations. In the present study the patients 


gave an average of 9 responses involving 
literal disruption of the body boundary, 
human or animal, at the predrug or acutely 
disturbed period of testing. On later testing 
those patients rated as improved averaged 
only 1.4 responses involving literal boundary 
disruption. For example, one patient initially 
gave 45 responses involving literal penetra- 
tion of the body and only 3 such responses on 
subsequent testing. Another patient dropped 
from 22 to 5 responses, The following are 
examples of the kinds of Penetration re- 
sponses which tended to drop out with rated 
improvement on subsequent testing: “A man 
pulled in two by beasts”; “a body torn to 
pieces”; “bleeding vagina”; “a wound in the 
skin”; “a scalped head.” 

It is also of importance to report the relia- 
bility of Penetration and Barrier scores in a 
psychotic population undergoing no signifi- 
cant change in behavior. Opportunity to 
check the test-retest reliability of these two 
body image scores was afforded in a study 
conducted by Rhoda Fisher (1958).? She ad- 
ministered the Rorschach on two occasions 
five days apart to 50 schizophrenic women. 
The test was administered during the first 
week of hospitalization and prior to initiation 
of any formal treatment program. Test-retest 
reliability for the Penetration score is .89 
and for Barrier score .65. The low reliability 
correlation for Barrier score is due to the 


?I am indebted to Rhoda Fisher for loan of the 
Rorschach protocols, Scoring of the records and 
computation of the reliability coefficients were car- 
ried out by the present author. 
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extremely limited range of Barrier responses. 
For example, Penetration scores range 0-11, 
while most of the Barrier responses fall 
within the range 0-4. 

It is of further interest that 25 of Fisher’s 
psychotic women were subjected to a stress 
situation in the nature of a gynecological 
examination immediately before the initial 
Rorschach testing. This was done in order 
to study the differential effect on a number 


of psychological tests of a situational stress’ 


period associated with the testing. In spite 
of the fact that this examination constituted 
a literal penetration of the body of these 25 
women, Penetration scores obtained from 
their Rorschach records did not differ signifi- 
cantly from first to second testing nor from 
those of the control group of 25 women re- 
ceiving no such stress prior to testing. These 
findings support the contention that Penetra- 
tion score represents a basic personality vari- 
able involving orientation towards one’s body 
image boundaries and is not merely a reaction 
to the discomfort of body symptoms or situa- 
tional stress. 


Cross-VALIDATION STUDY 


Opportunity to check on the present find- 
ings relating body image scores to rated be- 
havioral change was provided in a study 
conducted by Goldman (1960).* Forty-five 
cooperative, acutely ill schizophrenic patients, 
ages 16-40, were the Ss of this study. Each 
patient was administered the Rorschach on 
admission and again upon leaving the hos- 
pital. Each patient was rated twice by two 
psychiatrists using an 11-item scale. A first 
rating was done on admission and a second 
at time of discharge. The psychiatrists agreed 
that 33 of the 45 cases showed clinical im- 
provement from first to second rating and 
that 14 of these showed marked improve- 
ment. Twelve cases showed no change. 

The Rorschach records were scored by the 
present author for Barrier and Penetration 
responses. This scoring proceeded on a blind 
basis as far as knowledge of the psychiatrists’ 
ratings was concerned. Total number of re- 
sponses was controlled by reducing the re- 


3I am indebted to Rosaline Goldman for loan of 
her Rorschach data and behavioral ratings. 


sponse total for the larger of the two records 
to that of the smaller. 

When the body image scores were com- 
pared with psychiatric ratings, the following 
differences appeared. For the patients rated 
as improved (V = 33), a mean Penetration 
score of 3.97 was obtained on first testing, 
and 1.97 on second testing. A ż test for dif- 
ference in means of related measures yields 
a value of 3.45, significant beyond the .01 
level. Markedly improved patients averaged 
4.18 initially on Penetration and 1.91 at dis- 
charge (¢= 7.85, significant beyond .01 
level). Unimproved patients also reduced 
Penetration scores slightly from first to sec- 
ond testing but not to a significant degree. 
For Barrier, the only significant change oc- 
curred in the markedly improved group who 
raised their mean score from 1.82 to 3:55; 
first to second testing (¢ = 7.52; P= 7.01). 

This independent study supports the origi- 
nal findings relating decrement in Penetration 
score to rated behavioral improvement. An 
additional finding hypothesized but not sub- 
stantiated in the first study, concerns signifi- 
cant increment in Barrier score being related 
to marked clinical improvement. These re- 
sults are interpreted as lending further sup- 
port to the idea that definiteness in body 
image parallels degree of personality organi- 
zation. 


DISCUSSION 


Findings in the present study are not re- 
ported with any claim concerning degree of 
recovery or “cure” in schizophrenia. No infer- 
ence is intended that in 5 or 13 weeks, de- 
spite a reduction in morbidity ratings, a cure 
or complete remission in schizophrenia has 
been effected. Some of the patients discharged 
from the hospital at the conclusion of the 
study have been rehospitalized w 
rence of symptoms. However, 
concern is not with the therapeut 
the study nor with the relative effectiveness 
of different tranquilizers. The focus for the 
present study is on the nature of the shift in 
certain aspects of body image within the 
same individual as he proceeds through his 
psychosis. 

As has been shown, the major change oc- 
curring is a dramatic firming up and defining 


ith a recur- 
the present 
ic results of 
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of the body image boundary. This clarifica- 
tion of the boundaries takes the form of a 
sharp reduction in fantasies involving disrup- 
tion and dissolution of the body image boun- 
daries. Acutely ill schizophrenics present a 
concentration of morbid, wildly disrupted 
fantasies portraying a literal breakdown of 
the body boundaries. Under a therapeutic 
regime these bizarre fantasies very quickly 
disappear, indeed, at least as early as five 
weeks after introduction of medication. 

Alteration in the boundary aspect of body 
image in schizophrenia is a finding in keeping 
with a number of studies. Szasz (1957) ob- 
serves that the profusion of bodily feelings 
in schizophrenia expresses “the inner experi- 
ence of personal dissolution at the periph- 
ery... .” Szasz has enumerated a series of 
phases in the genesis of bodily feelings in 
schizophrenia. Of interest to the present 
paper is his observation that in the acute 
and active period of a psychotic break, bodily 
preoccupation is focused largely on the loss 
of the body boundary as an object to which 
to relate. In terms of the present study it is 
conceivable that administration of tranquiliz- 
ers and closed ward hospitalization serve to 
provide for the patient a sharper delimitation 
of body image from environment. 

Other investigators emphasizing the role of 
body image boundaries in schizophrenia in- 
clude Bender & Keeler (1952) who have 
studied body image formation in schizo- 
phrenic children via the medium of figure 
drawings. They conclude that schizophrenic 
children “show in particular a difficulty in 
determining the periphery of their own body 
or the boundaries of their personality or ‘ego 
boundaries.’ There is an indefiniteness and 
blurring of the zone between their own body 
and the outside world.” These conclusions 
were drawn largely from the gross distortions 
of various body parts introduced into the 
drawings by the children. Interestingly, in 
comparison with the present findings concern- 
ing Penetration responses, Bender and Keeler 
noted especially that “body orifices were 
accentuated and displaced and body boun- 
daries received undue treatment” in the chil- 
dren’s drawings. 


Lastly, Weckowicz and Sommer‘ have ex- 
plored body image in a schizophrenic and 
nonschizophrenic population. They find that 
schizophrenics consistently estimate pictures 
of hands and feet to be significantly smaller 
than do nonschizophrenics. Schizophrenics 
also make significantly smaller self drawings 
of hands and feet when viewing themselves 
in a mirror than do nonschizophrenics. The 
groups did not differ in size estimation or 
drawings of central portions of the body. The 
authors attribute this perception by schizo- 
phrenics of distal parts of the body as smaller 
toa shrinking of the periphery of the body 
image. 

The importance of establishing a well de- 
fined body image boundary as a prelude to 
more total ego reorganization has long been 
recognized by some treatment programs. For 
example, Reich (1949), Bettelheim (1950) 
and others have formalized the importance 
of the reinforcing of body limits in the treat- 
ment of schizophrenic and severely neurotic 
patients. Programs of physiotherapy, hydro- 
therapy, and cosmetics all implicitly reinforce 
the Patient’s body periphery by providing 
maximal stimulation to this area, Again what 
1s emphasized is the value of demonstrating 
to the schizophrenic the reality and existence 
of his own body boundaries. With the return 
of the substance of his body periphery, the 
schizophrenic can then generalize this body 
reality to the area of ego reality. 


SUMMARY 


Twenty-five recently hospitalized schizo- 
phrenics were administered the Holtzman Ink 
Blot Test and Lorr scale prior to being 
placed on an unknown medication. These 
measures were repeated after 5 and 13 weeks 
on medications. The ink blot test was scored 
for the body image indices of Barrier and 
Penetration. A morbidity score obtained from 
the Lorr scale ratings constituted one cri- 
terion of degree of personality reorganiza- 
tion. Psychiatrist’s disposition of the patients 


*Personal communication. Weckowicz, T, E. and 
Sommer, R. Body image and self-concept in schizo- 
phrenia. Unpublished paper, 1959. 
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by the end of the study (discharged vs. still 
hospitalized) constituted a second criterion. 

A rho of .60 between decrement in Pene- 
tration score and decrement in morbidity for 
the Sth-week evaluation was obtained. A rho 
of .61 was maintained for the same rank- 
ings at 13 weeks. Increment in Barrier score 
was not found related to morbidity ranking. 
Similar results were obtained for the second 
criterion measure, psychiatrist’s disposition of 
the patient vs. body image scores. Implica- 
tions of these results for the nature of body 
image changes in schizophrenia are discussed. 
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FACTOR ANALYSES OF THE RORSCHACH * 


BERNARD I. MURSTEIN 
University of Portland 


Despite the plethora of studies utilizing 
the Rorschach (currently well past the 2,000 
mark) the amount of knowledge obtained has 
been disappointing. The reasons for this un- 
fortunate circumstance have been the re- 
fractoriness of the technique to crucial ex- 
perimentation. While tens of iconoclastic 
researchers have hammered away at the 
folklore of the Rorschach (e.g., the number 
of responses to Cards VIII, IX, X reflect 
emotional reactivity) the “defenders” have 
retorted that current statistical methods are 
insensitive to the important configurational 
and test-interpretive aspects of the instru- 
ment. 

The increasing importance of factor analy- 
sis in psychology made it inevitable that it 
would be utilized in research involving the 
Rorschach. It is the purpose of this paper 
to assess the contribution that factor analysis 
has made with regard to this technique. 

The problems inherent in the use of factor 
analysis will be dealt with first. Next the 
difficulties peculiar to the use of the Rors- 


chach as a psychometric instrument will be - 


discussed, followed by a section on the vari- 
ous Rorschach hypotheses tested by means 
of factor-analytic methodology. Then, a sec- 
tion listing the major new contributions to 
the Rorschach will be presented, followed 
finally by a discussion of the newer Rors- 
chach innovations which have attempted to 
create an instrument more amenable to psy- 
chometic treatment. A summary of each fac- 
tor analysis involving the Rorschach only is 
presented in Table 1. Studies in which the 
Rorschach was investigated in addition to 
other variables are summarized in Table 2. 
These tables indicate the population sampled, 


1The author would like to express his apprecia- 
tion to William A. Botzum and Walter G. Klopfer 
for generously giving of their time in reading the 
manuscript and offering their criticisms. 


the number and kinds of variables studied, 


the types of correlations used, and the factors 
obtained. 


PROBLEMS INHERENT IN FACTOR ANALYSIS 


In a study of Yale undergraduates, Witten- 
born (1950a) used the factor-analytic method 
to test hypotheses. Thus, he stated four hy- 
potheses and attempted to substantiate them 
via his rotated factors. Whether this is a 
valid test is debatable since solutions with 
or without rotation are not mathematically 
unique. There are many possible rotations 
and one may rotate the axes infinitely until 
one gets them into the position which best 
substantiates the hypothesis. 

It would seem that an “objective” means 
for evaluating his hypotheses would have 
been through “blind rotations” (i.e. rotation 
according to some criterion such as “simple 
structure,” but without knowledge of the 
nature of the variables involved). It is not 
apparent from the report that Wittenborn 
followed this procedure. 

Among the assumptions made in the use of 
factor analysis, two are especially cogent with 
reference to the Rorschach. These are: (a) 
the variables must be linearly additive, and 
(b) the variables must be formally independ- 
ent. A violation of the latter assumption oc- 
curs in the work of Adcock (1951). He treats 
as independent scores the ratios W : M and 
M:C. The common element in both (M), 
assures that if the former ratio is low due to 
the presence of many Ms the latter ratio 
must be high because M is now in the numer- 
ator of the second ratio as compared to its 
being in the denominator of the first ratio. 
Another example of linear dependency is the 
use of percentage scores, Hence, a high nega- 
tive loading for W% virtually assures a negli- 
gible or high positive loading for D% or 
Dr%. 
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TABLE 1 
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SUMMARY oF Factor-ANALYTIC STUDIES WITH THE RORSCHACH 


Investigator 


No. of 
Variables 


Subjects Analyzed 


Type of 
Correlation 


Factors Extracted and Solution 


Hsu (1947) 


bWittenborn (1949) 


Wittenborn (1950a) 


Wittenborn (1950b) 


Adcock (1951) 


Coan (1956) 


Stotsky (1957) 
Location 


Determinants 


76 emotionally disturbed 15 
children, ages 5-15 


247 Yale undergraduates 18 


92 Yale undergraduates 21 


160 in- and out-patients 21 
from psychiatric clinic 


88 Cook Island children 15 
30 New Zealand children 


Wittenborn (1950a) 92 12 
Yale undergraduate data 


148 Schizophrenics 3 


148 Schizophrenics 5 


Tetrachoric 


Tetrachoric 


Pearson 


Pearson 


Tetrachoric 


Pearson 


Tetrachoric 


Orthogonal 
Facility in use of words 
Facility in use of nouns 
“Face” factor 
Facility for verbs and adjectives 
“Human” factor 


Orthogonal 
Six factors 


Oblique 


Seven factors; interpretation of 
factors is largely avoided 


Orthogonal 


Productivity 

Low perceptual control 
Empathy 

One unnamed factor- 


Orthogonal 


Bizarre originality 
Low perceptual control 
Productivity 

Empathy ? 


Cook Island Ss: Oblique 


Fluency 

Introversion 

Intelligence 

Construction 

New Zealanders: Orthogonal 

Fluency 

Introversion 

Intelligence 


Orthogonal 
Productivity 
Low perceptual control (anxiety) 
Low perceptual control (Passive 
submission to emotional 
pulses) 
Intratensivity 


Inner control and empathy 
Outer control 


im- 


Orthogonal (location scores) 
Whole 
Common and rare details 


Orthogonal (determinant scores) 
Form 


Color and shading 


a Only Card I administered. n . 
b Group administration (Harrower-Erickson check list). 
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TABLE 2 
RORSCHACH INCLUDED AMONG OTHER TESTS 
No. of s 
MA Type of ki Rectora E xtracted 
P A - lation Variables and Solution 
Investigator Subjects Analyzed Correlatio: 
Orthogonal 
i 38 Tetrachoric Verbal Intelligence, Non- Fluency 
Sen (1950) tao a eo ad verbal Intelligence and Cognitive 
stu 36 Rorschach variables (synthetic vs, analytic) 
motional z 
(neurotic vs, non-neurotic) 
Group factors ‘ 
Fluency (basic factor) 
General intelligence 
General emotionality _ 
Reproductive association 
Orthogonal 
‘chiatric hospi- 22 Tetrachoric 22 signs of organicity from Organic factor; factors 2 
Hughes (1950) TOO p patients Rorschach through 8 not labeled. 
Orthogonal 
120 (60 normal school 27 Tetrachoric School vs. clinic category, Productivity 
Cox (1951) Foe aar mat Q, 25 Rorschach varis Normal vs. maladjustment 
acne clinic boys) ables Intelligence vs, laok ol 
intelligence 
Initative—Passivity 
Accurate-inaccurate per- 
ception 
Orthogonal 
sychiatric patients 50 ? 50 psychiatric patients High vs. low productivity 4 
Sarane SOREA D (Modified Ọ analysis) Perception of internal, ana. t 
kn tomical objects vs. per- 
ception of external ob- 
jects 
Animate percepts vs. in- 
animate percepts 
Defensive percepts vs. well- 
defined human parts 
Orthogona 
Lotsof (1953) 30 college students 11 Pearson Intelligence test, total Verbal intelligence 
words, adjectives, verbs, Productivity 
7 Rorschach determin. Elaboration 
ants Individuality 
Orthogonal | 
Williams and 100 psychiatric patients 22 Tetrachoric Verbal IQ, Performance Productivity 
Lawrence (1953) IQ, 20 Rorschach deter- Lack of perceptual control 
minants Intelligence F 
Movement 
Shading | 
Orthogonal and Oblique | 
Williams and 100 psychiatric patients 32 Pearson and 21 Rorschach Rorschach productivity 
Lawrence (1954) Tetrachorie 11 MMPI Ego strength 5 
Expressive-Repressive 
MMPI maladjustment 
Foster (1955) 1. 54 Policemen 11 Tetrachoric Vocabulary, Verbal IQ, Factors identified from two 
Anxiety ratings, 8 Ror- ae loadings on com- 
schach variables oon items (proportional z 
profiles) 
Orthogonal 
3 m tied spontaneity 
2. 28 College students 19  Tetrachorie Allport-Vernon Scale of Inhibition of Pe oe 
values 6, MMPI 6, Vere inal delve 
Wechsler 3, Rorschach poesia 
vari: nx! f z, 
variables 4. Anxiety. pressive 
hogonal 
Borgatta and 125 Alr F ore an 
Eschenbach Alr Force personnel 40 Pearson Rorschach variables 15, Rorschach a ae i 
(195 Interpersonal perform- Formal properties 
ance scores 11, rating schach 


scales 7, PMA 3, Actu- 
arial 4, 


Abstract intelligence 
Social intelligence 
Emotional assertiveness 
Task leadership 

Social acceptability 
Maturity-adjustment 
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TABLE 2 (Continued) 


=a No. of 


< Variables Type of R Factors Extracted 
Investigator Subjects Analyzed Correlation Variables and solution 
Oblique 
Singer, 100 male veteran 23 Phi coeffici- Rorschach variables 7, Motor inhibition and plan- 
Wilensky & schizophrenics “ent Wechsler Dellevie 1a, fulness 
McCraven (1957) Barron's Movement Ambitiousness 
Threshold Inkblots, TAT Emotional surgency 
Transcendence Index, Introspectiveness 
Planning Self-Ratings, 
Porteus Mazes, Wechsler's 
Number Square test 3, 
Time Estimation, Block 
Authority Reaction, 
Motor Inabilities Test, 
Digit Frustration, Be- 
havior Ratings 4. 
Orthogonal 
Consalvi & 45 adults (6th grade 14 Pearson Raven progressive matri- Intelligence 
Center (1957) education to profess- ces, Vocabulary, 12 Ror- Low form 
ional college) schach Variables Productivity 
Movement 
Orthogonal 
Lotsof, Comrey, 72 underachieving 33 Pearson Acturial Variables 3, Ver- Verbal Intelligence 
Borgartz, & school children bal IQ, Performance IQ, Productivity 
Arsfield (1958) Wechsler subtests 12, Perceptual nonhuman 
Rorschach variables 16. sf movement 
Age 
Performance IQ 
Seven unnamed factors 
x 
Oblique 
- Veiner, 150 physically disabled 35 Tetrachoric Otis S. A. test of mental Withdrawal 
Stafford, & male war veterans ability, Bender-Gestalt Tension 


of 


Krugman (1958) 


4, Rorschach 12, Draw- 
A-Person 6, Sentence 
Completion 7, Physical 
pathology 5. 


Need for social approval 
Anxiety 
Dependency-Passivity 
Affective Lability 


Pseudo-Intellectual Defens- 
iveness 


The variation in methods of rotation also 
has doubtlessly contributed to the emergence 
of different factors. While the centroid solu- 
tion with either orthogonal or oblique rota- 
tion has been favored, other variations have 
been used. The British analyses have included 
Burt’s simple summation and group methods. 
Other variations have included the Kaiser 
varimax method and various principle com- 
ponents solutions. The use of different meth- 
ods contributes somewhat to the differences 
in solution. 

Another problem is the fact that there is 
currently no method of obtaining a standard 
error of a factor loading. Hence the stability 
of the position of each factorial loading can 
be only indirectly assessed via such criteria 
as the number of subjects (Ss) used and the 
type of correlation employed. There are ways 
of reducing the risk of error by the employ- 
ment of techniques such as repetition of pre- 
viously found factors, rotation to known 
“psychological conditions” and the use of 


“marker” variables whose factorial makeup 
is known. These aids, however, are often not 
present in the first factorial investigation of 
a test. 


PROBLEMS SPECIFIC TO THE RORSCHACH 
Nonstatistical Problems 


While the Rorschach is ostensibly a per- 
ceptual test, operationally speaking it must 
be considered a verbal task only indirectly 
getting at the perception via the verbaliza- 
tion. This fact may not cause any conster- 
nation until one reflects fully upon the fact 
that the testee must manifest an extreme 
sensitivity to the determinants of 
ception. He must communicate t 
tance of form, and in the c 
shading, whether the concept 
three-dimensional, whether it involves the 
perception of “texture” or perhaps “vista” 
to mention just a few possibilities. But even 
if a person possesses a keen awareness of the 
determinants of his perception, does he pos- 


his per- 
he impor- 
ase of using 
is two- or 
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sess the vocabulary with which to communi- 
cate them to the examiner? It is apparent 
that to convey verbally the nuances of his 
perception a person needs a precise and ex- 
tensive vocabulary. One may well wonder 
whether the dearth of determinants found 
in low socio-economic class protocols reflects 
a “simple” perceptual world or a small and 
limited vocabulary. A 

The fact that a few “incidental” words 
may change the scoring of a determinant is 
yet another difficulty. If two men are seen 
on Card III the determinant might be an F 4 
But, if on inquiry the two men are “sitting 
down,” the response is scored not F, but M. 
The examiner’s task is to determine whether 
the original percept involved movement but 
was not verbalized, or whether the movement 
really was not a kinesthetic perception but 
a verbal embellishment. 

The fact that a single perception may be 
subject to many different scorings, depending 
upon the verbalization of the S, is another 
weakness in the test as Levin (1953), Baugh- 
man (1954, 1958, 1959), and Murstein 
(1958) have pointed out. This point is amply 
demonstrated in a study by Stotsky (1957) 
where the determinants D and F loaded .93 
and .96 respectively on the same factor. If 
almost every percept scored D for location 
is also form-determined (F), then they can 
hardly be conceived of as independent vari- 
ables. 

The same situation holds for the treatment 
of certain determinants and content scores 
as independent items. Can there be human 
movement (M) by other than a human being 
(H)? In a small number of cases the answer 
is yes, e.g., animals perceived as “dancing a 
polka.” In most instances, however, the num- 
ber of M correlates .90 or above with the 
number of H + Hd. Such a correlation then 
involves little psychologically meaningful co- 
variation but is in essence an artifact. Its 
presence in the factor matrix may lead to 
erroneous interpretation. 

Another problem is that if a perception 
involves several determinants a specific order 
of scoring may be used which allows only 
the determinant highest in the hierarchy to 
appear to be present more often than the 
lesser favored determinant. Accordingly, M 
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takes predominance over C, which takes pre- 
dominance over Fc, etc. For maximal inter- 
pretative value, however, the number of re- 
sponses should be independent of the fre- 
quency of occurrence of the determinants. 

Clinicians through the years have grumbled 
over the necessity of learning at least two 
systems of scoring (the Beck, and Klopfer 
system), not to mention the somewhat less 
popular Hertz and Buhler variations. The 
minor difficulty of understanding the various 
Rorschach dialects is simple in comparison 
to the man-sized task facing the reader of the 
Rorschach factor analyses. It is common 
knowledge that factors are not invariant when 
the population or tests used in the factorial 
matrix are varied extensively. A brief listing 
of procedural variations should suffice to indi- 
cate that no two Rorschach matrices reviewed 
herein have been identical: 


1. Different tests have been used. The 
Harrower-Erikson check list, for example 
(Wittenborn 1950a), is not comparable to 
the usual Rorschach, 

2. Different scoring systems have been 
used such as the Beck and Klopfer ones, in 
addition to mixtures in varying strengths of 
the two. 

3. Some analyses include the form level 
of a given determinant, others do not. 

4. Sometimes the whole test has been ana- 
lyzed, at other times only a part of the test. 

5. Sometimes the instructions have been 
the “usual” ? ones while at other times the Ss 
have been told to produce a fixed number of 
responses, 

6. The researchers have differed in their 
choice of variables to analyze. No two mat- 
rices analyzed by different investigators have 
contained the identical variables, 


7. The telescoping of scores has been freely 
undertaken. Thus, one investigator used three 
Separate color variables, FC, CF, and C; 
another used only SC. Some have analyzed 
shading and texture as separate scores while 


2 Their “usual” instructions are “usual” only in a 
gross sense since as has been brought to this author’s 
attention in a personal communication by Walter G. 
Klopfer, the Beck system by its instructions implies 


a much stronger set to Produce responses than does 
the Klopfer system, 


FG ws 


a 


? 
x 


Factor Analyses of the Rorschach 


another has combined shading and texture 
into one score. 

8. Some researchers have correlated the 
raw scores; others normalized scores; still 
others used percentages and ratios. 


Statistical Problems 


Choice of correlation. The Pearson product- 
moment correlation is the most frequent sta- 
tistic used when a measure of linear regression 
is involved, and several studies have utilized 
it. It is, however, justifiably applied only 
when (a) the regression is linear and (b) the 
variables possess the property of homoscadas- 
ticity. It is apparent from common clinical 
experience as well as such articles as the one 
by Fiske and Baughman (1953), that the 
relationship between most of the Rorschach 
variables is not linear due to the stimulus 
limitations of the cards which favor the dif- 
ferential occurrence of the determinants. 

Some investigators have realized the in- 
applicability of the Pearson r and substituted 
instead the tetrachoric correlation, This sta- 
tistic, however, also assumes normality of 
distribution and linear regression. In addi- 
tion, the tetrachoric 7 has a greater standard 
error than the Pearson r and tends to over- 
estimate the true correlation. To avoid the 
dubiousness of using the Pearson 7 as well as 
the tetrachoric r, the phi coefficient has been 
used because it does not assume underlying 
linearity nor a normal distribution. Its limi- 
tations, however, are formidable. It is not 
an estimate of a parameter, does not always 
vary from +1 to —1, and has no confidence 
limits. Another approach is to normalize the 
raw scores and then correlate the normalized 
scores using the Pearson r. This approach, 
however, seems to violate the logic involved 
in normalizing data. The assumption is usu- 
ally made in normalizing data that the trait 
or ability would be distributed normally in 
the population if we had sensitive enough 
measurements. It is the nature of the Rors- 
chach itself and not only the crudeness of 
the quantitative scores which do not justify 
a normalizing procedure. With the small num- 
ber of 10 cards and the various stimulus 
limitations inherent in these cards it is un- 
likely that the perception of the various de- 
terminants can in any way be assumed to be 
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basically normally distributed. In any event, 
it hardly could be claimed that the small 
select populations used in the factor-analytic 
studies were themselves normally distributed, 
thus justifying the use of such scores with 
them. 

Yet another problem involved in the use 
of normalized scores is that they have the 
further disadvantage of transformations in 
general, in that they render the original data 
meaningless, or as Wittenborn has succinctly 
put it, it “analyses the Rorschach as it isn’t.” 
The resulting conclusion is that the use of 
correlational indices are, strictly speaking, 
unjustified. The crucial question, then, is how 
much does one distort the findings in using 
a given statistic? This question is extremely 
difficult if at all possible to answer. Until 
some statistician solves the dilemma one may 
only conclude that most of the correlational 
matrices used have had an indeterminate 
amount of error variance and gave at best 
only approximate factorial solutions. 

Another problem is the nonmutually exclu- 
sive character of Rorschach scores. These 
scores by their nature determine that the 
correlations between the components must on 
the average be negative. This is easily seen 
with the location determinants, W, D, and 
Dd. If in a given protocol, the total number 
of responses is 30, and 8 responses are scored 
W, the limitation is that the remaining re- 
sponses must total 22. In a series of proto- 
cols then there must be a perfect negative 
relationship between W and D + Dd, if R is 
a constant. That R is not constant from per- 
son to person in most studies simply means 
that this relationship will be attenuated some- 
what but still present. 

The use of percentages instead of raw 
scores may often lead to unwarranted con- 
clusions as illustrated in a study by Sen 
(1950). By means of judges’ ratings and 
Burt’s Rorschach content method, a signifi- 
cant correlation was reported between the 
score “imagination” and the “productivity” 
factor. Sen noted that M often assumed to 
tap “inner creativity” had no loading on this 
factor. This finding is attenuated because 
M% was used instead of number of M. Con- 
sequently it may well have been that imagi- 
native individuals manifested a high R along 


268 


with a high number of M. If, however, the 
slope of AM/AR is one of negative accelera- 
tion as R increases as seems indicated by the 
data of Fiske and Baughman (1953), the 
M% will not remain constant with the in- 
crease in number of responses, but rather 
decrease somewhat. Thus, the failure of 17% 
to load on the factor of productivity may 
reflect not lack of imagination but the stimu- 
lus limitation for the perception of M on 
the cards. 

Unfortunately, the Rorschach Psychodiag- 
nostic was not conceived by a man familiar 
with the problems of quantitative measure- 
ment. Indeed, Rorschach scores like, M, F, 
CF, etc., are not scores at all but a shorthand 
for clinical impressions. By way of compari- 
son, a Wechsler Vocabulary Scale score of 18 
indicates that the S is extremely conversant 
with words, and probably of extremely high 
intelligence level (r of .85 with Full Scale 
IQ. That a S has seven Ms in his protocol, 
tells us very little. One wishes to know what 
the form level is, whether the Ms are mani- 
fested spontaneously or elicited via inquiry, 
whether the Ms were given on Card III or 
IV, and whether they refer to a large or small 
section of the blot. All of these questions are 
important to the clinician but usually none 
are reflected in factor analyses of the Rors- 
chach. It is little wonder then that succes- 
sive analyses with these “scores” seldom are 
comparable. 

Another problem often ignored is the ex- 
tremely low reliability of many Rorschach 
scores, including the various color and shad- 
ing measures. One cannot expect very mean- 
ingful factors unless the components which 
compose these factors, i.e. the correlations, 
possess adequate reliability. 

The problem of R, the number of responses 
as a variable contributing spurious variance 
to the factorial structure has been excellently 
dealt with in a study by Coan (1956). He 
states that: 


The correlation of R with any location or deter- 
minant score is actually a part-whole correlation, 
since the sum of either the primary determinant 
scores or of the location scores will equal R. Simi- 
larly, P and O are functions of protocol length, since 
their size is limited by the total number of responses 
and longer protocols present more opportunity for 
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popular and original responses to occur. On a simi- 
lar basis, certain specific location and determinant 
scores may be expected to correlate especially highly 
with R, so that even percentage scores will show 
certain systematic correlation effects (p. 282). 


As an antidote to this problem, Coan pro- 
ceded to analyze his data omitting R. The 
omission of R, however, does not necessarily 
remove the spurious component of correla- 
tion from any two determinant scores whose 
correlation is due to their mutual dependence 
on R. In short, Coan’s analysis is only par- 
tially free of spuriousness, and accordingly 
does not reflect the “intrinsic” relationship 
between the determinants. On the other hand 
since it has some of the spurious variance 
removed it does not reflect the “external” 
relationship either. 

Along the same line, Glickstein (1959) 
recomputed two of Wittenborn’s matrices 
(Wittenborn 1950a, 1950b) with R partialled 
out. The results showed that the mean of 
the matrices with R partialled out approxi- 
mated zero while Wittenborn’s original mat- 
rices appear to have a mean of about .30. 
The reply of Wittenborn (1959) indicates 
that their differences may be due to different 
assumptions. Wittenborn holds that R is the 
resultant of the number of responses in the 
various scoring categories and not the gen- 
erator of these responses, as Glickstein seems 
to imply. There is no direct evidence on this 
point. Webb and Hilden (1955), however, 
have shown that productivity on the TAT 
is related to verbal fluency. If it could be 
demonstrated that Rorschach productivity is 
related to productivity in other areas (i.e. 
verbal fluency) the argument that R gener- 
rates the determinants rather than vice versa, 
would be greatly strengthened. The origin of 
R is important since if R is due to a “set” 
the benefit of partialling it out may be the 
resulting gain in knowledge about the inter- 
relationship of the determinants, If R is not 
due to a “set,” Wittenborn is correct in stat- 
ing that we do violence to the data in par- 
tialling it out. Until further research is 
forthcoming, the issue remains in doubt. 

Wittenborn (1959) is on firmer ground 
when he points out that the use of partialling 
techniques ignores the actual responses given 
to the Rorschach which are the most mean- 
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ingful from an interpretative point of view, 
“At present one has the unhappy choice of 
studying the Rorschach ‘as it is, of studying 
it ‘as it isn’t,’ or of ignoring it altogether” 
(p. 77). 


EVALUATION oF RORSCHACH HYPOTHESES 


Many factor-analysts have been interested 
in submitting much of the Rorschach folklore 
handed down from clinician to clinician, to 
a factorial test. In the following pages some 
of their findings will be briefly discussed. 


M : XC (Erlebnistyp) 


Wittenborn (1950a) in a study of 92 Yale 
undergraduates found M to be factorially dis- 
tinct from C and CF, though not from FC 
which partially substantiated his hypothesis 
that M and the color responses would be 
factorially distinct. Such evidence seems to be 
detrimental to the continued use of the 
M : XC ratio. Ainsworth (1954) in a rebuttal 
makes much of the fact that the Rorschach 
categories are not based on a factorial ra- 
tionale. Most Rorschachers do not treat color 
unidimensionally. FC is held to be qualita- 
tively as well as quantitatively distinct from 
CF and C. Further, the relationship of M and 
FC is not unexpected in the normal adjusted 
personality. Even from a formal point of view 
it should be readily apparent that M and FC 
might have a good deal more in common than 
FC and CF. The scoring of M invariably 
implies the presence of form, for it is impos- 
sible to conceive of formless humans. It is 
possible that the movement response also 
might have included the perception of color 
although this might not be scored since M 
takes precedence over C in some scoring 
schemes. One may therefore conclude that the 
appearance of M and FC on the same factor 
probably indicates their mutual emphasis of 
form as well as their socially adjustive impli- 
cations, While FC and CF have color in 
common they differ in the importance of form 
for the perception, and apparently this latter 
difference is the more important one since it 
results in their loading on separate factors. 
These statements put the Rorschacher in the 
position of having his cake and eating it too. 
It should not be possible to defend the act 
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of distinguishing between FC and CF in one 
breath and in the next breath proceed to 
treat both as similar in compiling a M : 3C 
ratio. 

Adcock (1951) investigated the hypothe- 
sis that Erlebnistyp (“Experience Balance”) 
is indicated by several indices: (a) M : XC, 
(b) FM + M :Fc + c+ C', and (c) number 
of responses to cards VIII, IX, X. The vali- 
dation of the hypothesis was held to be de- 
pendent on their high loading on a common 
factor. This did not occur and Adcock con- 
cluded that all three indices did not measure 
Erlebnistyp. To some Rorschachers, this con- 
clusion would not be valid. They could argue 
that FM + M : Fc +c + C' represented a po- 
tential ratio of intraversive to extratensive 
tendencies which were not currently realized. 
There is no reason, however, why one’s po- 
tential extratensive tendencies should be a 
simple function of one’s “realized” degree of 
extratensiveness. The exact opposite might be 
argued in that if one is realizing much of his 
reservoir of extratensitivity there should be 
little left in reserve. While this argument 
weakens Adcock’s finding, it does not inspire 
confidence in the testability of the Rorschach 
hypotheses. 


Intelligence 


Many Rorschachers have held that intelli- 
gence can be estimated from several Rors- 
chach determinants as well as nonformal 
scores such as variety of content and origi- 
nality of percept. Among the formal scores 
intelligence is asserted to be present when 
W, M, R, and F are present, to name just 
a few scores. The factorial studies have in 
general supported these hypotheses. M has 
been highly loaded on the intelligence factors 
found by Sen (1950), Lotsof (1953), Wil- 
liams and Lawrence (1953), and Consalvi 
and Center (1957). W has been highly loaded 
in three of these studies, with F and R each 
represented in one study. Only Lotsof, Com- 
rey, Bogartz, and Arsfield (1958) failed to 
find any of these determinants on their “in- 
telligence” factor. Their sty 


€ e idy, however, uti- 
lized children as Ss and the “intelligence” 
hypothesis has not been generally extended to 


apply to children. 
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Neuroticism 


The number of “neurotic signs” referred to 
in the Rorschach literature is legion. It is dif- 
ficult to assess whether factor analysis aids in 
validating these signs when we cannot state 
an operational Rorschach definition of neu- 
roticism. Despite the variation in choice of 
“neurotic” determinants, however, one cate- 
gory seems to be present on almost all lists. 
This is the color category, with particular 
reference to CF and C. These determinant 
scores have been used in studies where Rors- 
chach “neurotic” factors were obtained. Sen’s 
“neurotic” factor does contain high loadings 
for C and C + F which is somewhat expected 
in view of the common component C. These 
signs are given substantial support by virtue 
of their correlating highly with several other 
“internal” and “external” criteria. The factor 
was found to correlate approximately .7 with 
Burt’s Rorschach Content Method Neurotic 
-Rating and with peer-judges’ ratings for neu- 

rotic tendencies. Both CF and C also had a 

significant negative loading on an “adjust- 

ment” factor reported by Cox. 

Lotsof (1953) reported putatively contrary 
findings. Since SC and Y (shading) had load- 
ings of .70 and .40 respectively on the same 
factor, Lotsof concluded that his study had 
serious negative implications for Rorschach 
theory since “Anxiety [Y] and affect [C] 
should not appear correlated for a ‘normal’ 
population to the extent that they do in this 
study, . . .” (p. 24). By way of retort one 
may state that without knowledge of what 
is meant by “anxiety” and “affect” it is 
hardly possible to say whether or not they 
are correlated in a “normal” population. Be- 
sides, the equating of XC to “affect” and Y 
to ‘anxiety” without any further investiga- 
tion into the form level and configural aspects 


of both of these categories is without justi- 
fication. 


The picture is cloude 
results of Williams and Lawrence (1954) 
Their “ego strength” factor contains moder- 
ately high positive loadings for C and CRE, 
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sign or validation of Erlebnistyp. The evi- 
dence seems favorable for the use of Rors- 
chach determinants for intelligence. One vital 
flaw which would be emphasized by many 
clinicians is that most of the studies have no 
reference to the form level of the deter- 
minants. This appears to be a valid criticism 
and it is strange that no experimenter has 
taken cognizance of this important point. Ac- 
cordingly, without reference to form level, the 
use of the determinants cannot constitute an 
adequate test of Rorschach hypotheses which 
do emphasize the value of form level in ref- 


erence to Erlebnistyp, intelligence, and neu- 
roticism. 


OTHER RORSCHACH FINDINGS 


The use of factor analysis has importance 
for increasing knowledge concerning the 
Rorschach apart from testing the commonly 
used hypotheses. The following section lists 
some of the newer findings: 


Productivity Factor 


Most of the Rorschach studies have re- 
ported a “productivity” factor which reflects. 
the degree to which the manifestation of thet 
various determinants are a function of the 
total number of responses in the protocole 
The importance of this factor for clinician - 
is that it enables them to test hitherto un- 
tested hypotheses as well as suggesting new 
ones. For example, the presence of a large 
number of D has usually been interpreted 
as indicating a preference for the larger de- 
tails of a task rather than the organizational 
aspect. Factor analysis, however, has indi- 
cated that D, d, Dd, F, and R are highly 
loaded on the “productivity factor.” Since 
the major portion of common variance of 
each of these determinants is accounted for 
by this factor it is possible to examine the 
usual psychological meaning applied to each 
of these determinants and see whether the 
simultaneous existence of all of these psycho- 
logical hypotheses is clinically meaningful. In 
the aforementioned factor this would mean 
that the tendency to attend to the larger 
details of a situation (D) is also accompanied 
by the tendency to attend to small but 
articulate details (d) and to the tiny aspects 
(Dd). In short, we could economize by say- 
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ing that the tendency to attend to details is 
general no matter what the size of the detail. 
But, the productivity factor also indicates 
that contour determined perceptions (F) and 
general productivity (R) should be related 
to the tendency to perceive details. It is 
apparent then that insofar as meaning exists 
in determinants codified so as to be amenable 
to factor analysis, the factors extracted are 
helpful in testing the validity of psychological 
hypotheses as to the meaning of these deter- 
minants. 


Psychiatric Diagnosis 

Several researchers have addressed them- 
selves to the question of whether the factorial 
structure of psychiatric patients could be 
differentiated from that of normals. The evi- 
dence would appear to be negative. Witten- 
born undertook factor-analyses of both kinds 
of populations. The Ss were unmatched, the 
normal population consisting of 92 Yale 
undergraduates (1950a) while the psychiatric 
one was composed of 160 in- and out-patients 
from a psychiatric clinic (1950b). Four fac- 
tors were obtained for each group, of which 

u three appear highly similar (Productivity, 
tlLow Perceptual Form, and Empathy). The 
C| chief difference between the two populations 
was a splitting of a factor found in the nor- 
I mal population which had high loadings on 
{ M, m, Fe, C', FC and original responses O 
(presumably reflecting a healthy controlled 
degree of empathy and originality) into two 
separate factors, one reflecting the movement 
and form controlled determinants, the other 
possessing high loadings on O and S which 
now reflected a bizarre kind of originality. 

The similarity of the factorial makeup for 
the two populations is surprising. It may well 
be that the formal determinants investigated 
in an atomistic way without regard to con- 
figural pattern or form level may not be dif- 
ferentiated by mental disease. This might be 
due to the stimulus properties of the cards, 
the Ss’ test-taking attitudes, or a myriad of 
other reasons. 

Moreover, the meaningfulness of the fac- 
tors is difficult to assess because of the lack 
of external criteria. Ainsworth (1954) cor- 
rectly points out, for, example, that rather 
than reflecting disturbance, the splitting of 
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the factor referred to in the student popula- 
tion into two factors in the psychiatric one 
may reflect the fact that some of the patients 
possess ego strength in seeing things differ- 
ently (O, S) than other less adjusted patients. 
In the absence of known personality corre- 
lates of the patients’ protocols, identification 
of the factors from a clinically meaningful 
point of view is difficult. 

Hughes (1950) investigated the protocols 
of 100 psychiatric patients (32 organics, 39 
psychoneurotics, 29 schizophrenics). Tetra- 
choric intercorrelations of 22 different signs 
representing the above syndromes were fac- 
tored and yielded 8 orthogonal factors. The 
results were disappointing in that only the 
“organic” factor was clearly defined. 

The factoring of persons (Q technique) on 
the basis of Rorschach scores was under- 
taken by Bendig and Hamlin (1955). Sixteen 
patients from four psychiatric categories were 
intercorrelated on 42 Rorschach element 
scores. The inverted factor analytic method 
(Q technique) did not separate the psychi- 
atric categories although clinicians were able 
to do so via blind analyses. Apparently there 
is a great deal of meaningful information in 
the Rorschach protocol which is not repre- 
sented in the scores obtained from it. 


Factorial Relationship of Rorschach 
and MMPI 


The relationship between the Rorschach 
and MMPI was investigated by Williams and 
Lawrence (1954) in a study which also in- 
cluded the Wechsler Scale of Adult Intelli- 
gence. The usual Rorschach “productivity” 
factor manifested no substantial MMPI sub- 
scale loadings. The second factor labeled “ego 
strength” was represented by all of the tests: 
Rorschach variables W (.82), k (.60), K 
(59), C (.55), CF (.72), Wechsler Verbal 
IQ (.54), and MMPI variables K (.51), F 
(—.47), Sc (—.45) and Ego Strength (.71). 
A “repression” factor was not clearly defined 
but had highest loadings on the MMPI for 
the R (repression) scale (.55), D (.40), Hy 
(.54) and on the Rorschach by D (—.48), 
FM (—.49), FC (—.48), and R (—.44). 
The last factor was termed a “maladjust- 
ment” one and was heavily loaded with most 
of the MMPI subscales. Surprisingly, no 
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Rorschach variable was represented to any 
degree on this factor. The authors concluded 
that a considerable amount of variance in 
each test was not accounted for by covaria- 
tion between them. 


Rorschach as a Measure of Basic 
Personality 


A study by Borgatta and Eschenbach 
(1955) utilized a wide range of Rorschach 
variables, aptitude scores, actuarial back- 
ground data and personality ratings obtained 
from 125 normal male Air Force personnel. 
Three of the eight factors obtained loaded 
highly with Rorschach variables. These in- 
cluded a “productivity” factor, a “formal 
properties” factor, loaded highly on C, 
texture and shading, S, and W, which would 
be perhaps more accurately labeled as “low- 
perceptual form,” and an “intelligence” fac- 
tor. Of chief interest to the authors was the 
fact that the Rorschach did not contribute 
significantly to factors which they labeled as 
“social intelligence,” “emotional assertive- 
ness,” “task leadership,” “social acceptabil- 
ity” and “maturity adjustment.” In view of 
these findings the authors believed that the 
use of the Rorschach as a tool for tapping 
the basic personality seemed questionable. 


One or two kinds of M 


A dimension of personality associating fan- 
tasy tendencies with the control of motility 
and impulsive behavior was hypothesized by 
Singer, Wilensky, and McCraven (1956). 
The population consisted of 100 male, vet- 
eran patients bearing the diagnosis of schizo- 
phrenia. These Ss received the Rorschach, 
TAT, and a battery of motor tests. Behavior 
rating scales were obtained from ward per- 
sonnel. The M determinant was believed to 
represent both a tendency to inhibit motor 
activity and a considerable imagination or 
use of fantasy. While M loaded highly on two 
of the factors obtained, “motor inhibition” 
and “introspection,” the fact that this hy- 
pothesized unitary factor emerged as two 
separate factors precluded a simple conclu- 
sion as to the meaning of M. The authors 
implied that the category M may be too 
gross in linking introversive individuals with 
those possessing considerable ability for re- 


Murstein 


straining impulsive behavior, A key problem 
that remained unanswered was whether the 
factors might be a function of the population 
employed with little generality for more rep- 
resentative groups. 


The Problem of “External” Validation 


The difficulty of interpreting factors in 
clinical studies when no behavioral correlates 
are used is illustrated in a study by Vernier, 
Stafford, and Krugman (1958) who investi- 
gated the relationships among several pro- 
Jective tests (Rorschach, Sentence-Comple- 
tion, Bender-Gestalt, and Draw-A-Person). 
Another Purpose was to determine the rela- 
tionships between four kinds of physical 
pathology (orthopedic, respiratory, cardiac, 
and neurological) and the projective scores. 
The Ss, 150 VA patients (105 in-patients, 45 
Out-patients) received the Otis S. A. Test of 
Mental Ability in addition to the 
tioned projective techniques, 

Tetrachoric correlations of 53 variables ob- 
tained from the tests were reduced to a 
35 X 35 matrix by eliminating intercorrela- 
tions approaching zero as well as those over 
-75. The analysis yielded seven oblique fac- 
tors which are extremely unclear in meaning. 
The lack of clarity is a function of the lack 
of experimental evidence as to the import of 
many of the “signs” obtained from projective 
techniques. For example, the “withdrawal” 
factor had substantial loadings on “hands 
omitted” from the Draw-A-Person test (.63) 
and on the “need affiliation” score from the 
Sentence Completion test (—.49). What this 
has to do with a third highly loaded variable 
“not rejecting cards on the Rorschach” (.64) 
is unknown. Do people who withdraw from 
interpersonal contact tend not to reject Rors- 
chach cards? Similarly, the six other factors 
obtained are difficult to interpret. The weak- 
ness of this study lies in the fact that the 
interpretive implications of clusters of signs 
from projective techniques are themselves 
meaningless unless we know the personologi- 
cal correlates of these responses. The crucial 
task then for researchers employing factor 
analysis is to indicate the common variance 
between projective responses and behavior in 
important personal-social situations, rather 
than indicating the common variance between 
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responses from different projective techniques. 
A study which deals with this problem is 
that of Sandler and Ackner (1951). Here the 
“external” validation was obtained by corre- 
lating 200 odd psychiatric ratings with the 
Rorschach factors treated as scores using the 
formula given by Sandler (1949). Space does 
not permit a full discussion of these interest- 
ing data. Suffice it to mention that Factor 2, 
for example (the perception of internal, ana- 
tomical objects as opposed to the perception 
of external objects), correlated beyond the 
.01 level with such current symptoms of the 
patient as “overt aggressive behavior” (.53), 
“complaints of feelings of aggression” (.41), 
“confabulations” (.41) as well as with pre- 
vious personality traits of “appearing self- 
distrustful” (.42) and “feeling tense” (.36). 
This interesting study shows the possibilities 
of obtaining new information when suff- 
ciently broad external behavior scores are 
compared with factors treated as scores. 


PRESENT INNOVATIONS IN THE RORSCHACH 


In the foregoing pages an attempt has been 
made to discuss the problems attendant to 
the use of the Rorschach as a psychometric 
instrument and at the same time to present 
some of the important findings resulting from 
factorial studies. In a way the two tasks are 
contradictory. If the Rorschach is so ill-fitted 
for psychometric treatment why bother to 
review studies at all? The reason for review- 
ing the content of many of these studies is 
that they present an admixture of “true” 
and “error” variance which is difficult to 
separate so long as we retain the Rorschach 
in its present form. In an effort to retain the 
baby we have saved the dirty bath water, 
too. Though the Rorschach may be a means 
of probing the depths of personality in the 
hands of some practicing clinicians, from the 
point of research it must be considered a 
psychometric sow’s ear. The small number 
of cards, quasiquantitative scoring system 
with resulting low reliability, variable number 
of R, use of multiple scores from single re- 
sponses, attests to its ancestry. Yet, another 
difficulty lies in the fact that the scoring is 
completely dependent on the self-knowledge, 
verbal fluency, and vocabulary strength of 
the testee, not to mention the skill of the 
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examiner in asking the right questions. The 
Rorschach is operationally then hardly a per- 
ceptual test but rather an interpretation of 
an interpretation. Several researchers have 
attempted to remedy this situation. Cronbach 
(1949) has offered several statistical ap- 
proaches to the Rorschach. He favors count- 
ing procedures over additive methods. Where 
correlation is desired the scores should be 
normalized. Ratio and difference scores 
should be replaced by patterns. His sugges- 
tions have had some influence on Rorschach 
research. They are, however, of a “let’s make 
the best of a bad situation” kind and are 
intended as stopgap measures pending the 
arrival of a more statistically sophisticated 
ink blot technique. 

Rust (1947), McReynolds (1951), and 
O'Reilly (1956) have constructed interesting 
subsystems designed for specialized use either 
in determining the presence of a given deter- 
minant, or for psychiatric diagnosis. In the 
interest of space, however, only those sys- 
tems dealing with a new total scoring system 
will be considered. The Howard Ink Blot 
test (Howard, 1953) does contain 12 new 
blots, but the system of scoring employed is 
identical to the Beck system and there is 
therefore little point in reviewing the tech- 
nique. 

Zubin (1953), the first to apply more ade- 
quate quantitative methodology, constructed 
60 scales measuring location, objective stimu- 
lus attributes, determinants, interpretation, 
content, organization and other attributes 
somewhat difficult to categorize. The scores 
obtained are rated on a five-point scale. 
Zubin’s system has not been generally ac- 
cepted for two reasons. One, a very basic 
reason, is that the system has not until very 
recently been published in book form. Sec- 
ondly, the excessive number of characteristics 
is a bit too detailed for the current state of 
knowledge about the stimulus correlates of 
the Rorschach. 

A more recent variation is that of Holtz- 
man (1959) who employs his own set of ink 
blots. He has borrowed Zubin’s rating 
scheme, trimming the number of scoring cate- 
gories utilized, however, to six. These are 
Location, Form Appropriateness, Form Defi- 
niteness, Color, Shading, and Movement En- 
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ergy Level, which are rated on from three- 
to seven-point scales. In addition, several new 
innovations are introduced for the purpose of 
making the scores more amenable to treat- 
ment as normal continuous data. Two sets of 
parallel cards are employed, each set con- 
taining 45 cards. With this large set of cards 
it was possible to ensure that fact that not 
only would each card be used, but that the 
same number of responses would be given to 
each card. Accordingly, the S is instructed to 
give only one response to each card. 

The system is a major advance in obtain- 
ing higher reliabilities in the scoring of the 
categories. Interscorer reliability is in the 
.90’s, while estimates of reliability based 
on internal consistency using Guilliksen’s 
matched random subtest method yielded cor- 
relations ranging from .80 to .91. 

In view of these important advances it 
seems unfortunate that in one respect the 
system has remained stagnant. This is in 
the manner of recording the response. 
Here Holtzman has relied more or less on 
the highly unsatisfactory standard inquiry 
method. The disadvantage of this method, 
which relies upon the S’s ability to verbalize 
the determinants of his perception, is appar- 
ent in the data presented by Baughman 
(1954). It is known that most people who 
see a bat on Card I give form as the chief 
determinant. Very few mention the black 
color and shading as influencing their per- 
cept. When, however, Baughman presented a 
modified version of Card I which had only 
the color removed, very few people saw the 
bat. Further evidence in this regard is re- 
ported in a recent study comparing two 
methods of inquiry (Baughman 1959). One 
method was the standard Beck approach. The 
other was the paired-comparison inquiry. 
This method, briefly stated, Presents the 
orthodox card and one of several various 
modifications during the inquiry period. The 
modifications (Baughman, 1958) currently 
employed are as follows: 


Achromatic (A): Color is eliminated; remaining 
blot properties are retained as in the standard series, 
Complex Silhouette (CS): Shading variations are 
removed; differentiation between major detail areas 
is retained, however, by giving each major detail 
area a different contrast value. Silhouette (S): All 
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shading variations and differentiations between 
major detail areas are eliminated, making a uniform 
gray. White (W): A white figure is placed on a 
uniformly gray background, Complex Form (CF): 
Figure-ground contrast due to brightness difference 
is removed; forms of major detail areas are re- 
tained. The form is like that of CS. Form (F): A 
form comparable to S and W is created, but figure- 


ground contrast due to brightness difference is elimi- 
nated (p. 382). 


The examiner chooses the modification to 
suit his question. If, for example, S sees “bat” 
on Card II and E wishes to determine the 
role of shading in the perception, the silhou- 
ette Card II might be compared to the stand- 
ard chromatic one, and S asked if he still 
sees the bat. 

The Beck-Baughman inquiry comparison 
indicated that much more shading was actu- 
ally utilized by most Ss on the Rorschach 
than was verbalized under the Beck inquiry. 

The Baughman method which retains the 
original 10 cards will probably not supply 
quite as reliable data as the more refined sta- 
tistical procedures seem to give the Holtzman 
modification. On the other hand Baughman’s 
studies have the advantage of being more 
readily related to the vast Rorschach litera- 
ture and should prove of greater interest to 
most practicing clinicians, Both have the dis- 
advantage of requiring an excessive number 
of cards, 45 in each of the two Holtzman 
series, and 65 in Baughman’s group. It is too 
early to state which of these procedures will 
Prove to be the most fruitful for personality 
study but both represent a considerable im- 
provement over the current Rorschach pro- 
cedure. 

The importance of these new revisions for 
factor analysis is considerable. To the extent 
that the current Rorschach is inadequate as 
a quantitative test the use of factor analysis 
will yield questionable results, If factor anal- 
ysis is to be utilized more fully in the future, 
the Rorschach will have to be superseded by 
a more appropriate test. The newer variations 
with increasing reliance on quantification and 
reliability should more readily justify the use 
of factor analyses and increase our confidence 
in the results obtained from such studies. 
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RELATION OF RORSCHACH MOVEMENT AND COLOR 
RESPONSES TO COGNITIVE INHIBITION +° 


ROBERT E. FAGER 
State University of New York® 


Recent research has suggested that Rorschach 
human movement (M) may be considered both 
as a product and as a measure of inhibition 
ability. Levine, Glass, & Meltzoff (1957) pro- 
posed that failure to make a mirror-image N on 
the digit symbol subtest of the Wechsler-Bellevue 
Intelligence Scale is a function of insufficient 
delay of the overlearned N response tendency. 
They hypothesized that the Ss who fail to reverse 
the N (experimentals) should produce fewer M 
responses than Ss who make the reversal (con- 
trols). They found: (a) greater proportion of 
experimentals than controls produced less than 
two M responses, (b) experimentals more fre- 
quently than controls had Sum C =0, (c) ex- 
perimentals had an experience balance character- 
ized by Low M-Low Sum C group, while controls 
usually fell into the High M-High Sum C group. 
The purpose of the study reported here is to 
replicate Levine’s work. 

Whereas Levine used only male psychiatric 
veterans, this study employed male and female 
Ss from both psychiatric and nonpsychiatric 
hospital wards. There were no differences in sex 
distribution between experimental and control 
groups, although they did differ in age and edu- 
cation (controls older and better educated). 
Psychiatric and nonpsychiatric Ss were combined 
because no significant differences were found for 
intelligence, total Rorschach responses, age or 
education. The IQ-M correlation for the total 
sample (+.22, p < .01) was of the order re- 
ported by Levine et al. (1959) for other studies 
(+.26). Ninety-nine of the 124 controls and 58 
of the 81 experimentals were psychiatric patients 
representing a wide variety of disorders, All Ss 


+An extended report of this study may be ob- 
tained without charge from Robert E. Fager, De- 
partment of Psychiatry, Upstate Medical Center, 
Syracuse 10, New York, or for a fee from the 
American Documentation Institute. Order Document 


No. 6288, remitting $1.25 for microfilm or $1.25 for 
photocopies. 
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were 16 years or older and made 10 or more 
Rorschach responses. Experimentals were iden- 
tified by noting errors in copying the mirror- 
image N; and controls were selected by taking 
the next protocol in alphabetical order which did 
not have this error. 

Results show no group differences (a) in pro- 
duction of less than two M responses; (b) in 
Proportion of Sum C=0; and (c) in type of 
experience balance. Moreover, differences in 
neither mean IQ (experimentals 97.9, controls 
101.9) nor mean total responses (experimentals 
23.1, controls 24.8) reached significance, 

Since Levine had significant IQ differences in 
favor of controls (p= .0001), an effort was 
made to replicate this feature by comparing high 
IQ controls (mean 117.7) with low IQ experi- 
mentals (mean 80.0). Again, neither the propor- 
tion making less than two M responses nor the 
frequency of Sum C=0 discriminated between 
groups. 

Failure to replicate Levine’s results raises the 
question of the stability of his findings. Sam- 
pling differences are an unlikely explanation for 
the variant findings since no demonstrable bias 
was introduced by including females and non- 
psychiatric Ss. An interaction between intelli- 
gence and inhibition is also a doubtful explana- 
tion since differences in M and Sum C could not 
be demonstrated in this study even when alter- 
nate experimental and control groups were se- 
lected to maximize intellectual differences. Con- 
sequently, the relation of Rorschach human 
movement and color responses to the type of 
cognitive inhibition employed by Levine remains 
unclear. 
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A FURTHER INVESTIGATION OF BODY-CATHEXIS AND THE SELF»? 


JON R. WEINBERG $ 
State University of Iowa 


The present study essentially replicated an 
experiment reported by Secord and Jourard 
(1953), in which attitudes toward one’s body 
were related to other aspects of the self. The 
relationships between a number of pairs of vari- 
ables were again investigated by correlational 
analysis. The following measures were employed: 
the homonym test, a word association procedure 
from which a score on somatic concern is de- 
rived; the body-cathexis scale, composed of self- 
ratings of satisfaction with parts of the body; 
the anxiety indicator, a subscore consisting of 
body-cathexis items given the lowest average 
ratings; the self-cathexis scale composed of self- 
ratings of satisfaction with personal attributes; 
and the Maslow test of psychological insecurity, 
a questionnaire instrument. 

The Ss, students in an introductory psychology 
course, were tested in groups of 30 to 40. Con- 
ditions of complete anonymity were insured, and 
honesty in responding was strongly emphasized 
in the instructions. The scores of 108 men and 
104 women were employed in the analysis. 


1 An extended report of this study may be ob- 
tained without charge from Jon Weinberg, C 605 
Mayo Hospital, Minneapolis 14, Minnesota, or for 
a fee from the American Documentation Institute. 
Order Document No. 6289, remitting $1.25 for micro- 
film or $1.25 for photocopies. 

2 This paper is based upon a master’s thesis sub- 
mitted to the State University of Iowa. The author 
is grateful to A. L. Benton for his guidance in this 
research. 

3 Now a United States Public Health Service Post- 
doctoral Fellow at the University of Minnesota 
Medical School. 


The primary interest of the study was re- 
assessment of the correlations between pairs of 
measures. The results indicated that some of the 
significant relationships found previously were 
replicable, while others were not. Briefly, it was 
again found that body-cathexis and self-cathexis 
were significantly related; but in the present 
study, the correlation for males was significantly 
higher than that for females. In contrast to the 
Secord-Jourard findings, the homonym test was 
not significantly related to the anxiety indicator 
nor, for women, to body-cathexis. The Maslow 
insecurity test was again significantly related to 
both body-cathexis and self-cathexis, but to the 
anxiety indicator for men only, A significant 
correlation for men was also found between the 
Maslow test and the homonym test, a relation- 
ship not investigated earlier. 

In general, then, it was again found that feel- 
ings about the body are related both to feelings 
about the self and to feelings of psychological 
insecurity. Both of these relationships, however, 
tended to be of greater magnitude for men than 
for women in the present study. The finding by 
Secord and Jourard that negative feelings about 
the body are related to somatic concern was not 
upheld, In addition, the present study found that, 
for men, somatic concern was related to psycho- 
logical insecurity. 
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THE TEMPORAL STABILITY OF MPI SCORES IN NORMAL AND 
PSYCHIATRIC POPULATIONS? 


JOHN B. KNOWLES 
Medical Research Council, Clinical Psychiatry Research Group, Chichester, Sussex 


The Maudsley Personality Inventory (MPI) 
constructed and standardized in Britain by 
Eysenck (1959) gives measures of Neuroticism 
(N) and Extraversion (E) as defined within his 
personality theory. The present study was de- 
signed to assess the test-retest reliability of the 
scale and, by comparing the retest correlations 
for psychiatric and normal Ss, to test hypotheses 
relating to its validity. 

Eysenck postulates that the relative predomi- 
nance of either cortical excitation or inhibition 
predisposes individuals towards introverted or 
extraverted patterns of behavior. If differences 
in excitation and inhibition are properties of the 
nervous system, then measures of personality 
based on these differences should remain rela- 
tively stable over time (Storms & Sigal, 1958). 
Further, it was predicted that normal and psy- 
chiatric populations would show an equivalent 
degree of stability over time, as measured by the 
test-retest correlation. Since neuroticism is de- 
fined as a liability to breakdown under stress, 
neither the appearance of, nor fiuctuation in 
clinical symptoms, should affect the temporal 
stability of N scores. Thus, it was predicted that 
the test-retest correlations for N in neurotic and 
normal populations would not differ significantly. 

Ninety-three surgical patients, and 63 psychi- 
atric patients with a clearly neurotic diagnosis, 
who had completed the MPI one year previously, 
were contacted through the post and asked to 
complete a further MPI. Fifty-two surgical pa- 
tients, used as the normal group in subsequent 
comparisons, and 40 neurotic patients replied. 

+ An extended report of this study may be ob- 
tained without charge from John B. Knowles, Medi- 
cal Research Council, Clinical Psychiatry Research 
Group, Graylingwell Hospital, Chichester, Sussex, or 
for a fee from the American Documentation Insti- 
tute. Order Document No. 6291, remitting $1.25 for 
microfilm or $1.25 for Photocopies. 


The retest correlations for E were 0.77 and 
0.79 for the normal and neurotic patients respec- 
tively. Since these values were almost identical, 
and virtually the same as the split-half reliability 
of the scale (approximately 0.80) it was con- 
cluded that the predictions for E were confirmed, 

The results appeared to disfavor the prediction 
regarding the N factor. The retest correlation 
for the neurotics (0.70) was below that for the 
normals (0.84). However, when tested by Fish- 
er’s z transformation the difference just failed to 
reach significance at the 5% level. 

The present results agree with those given for 
normals by Eysenck, and for two mixed normal 
and neurotic groups by Bartholomew & Marley 
(1959). When the groups studied in the latter 
investigation are compared, it is found that the 
group which was predominantly neurotic also 
gave a lower retest correlation on neuroticism 
whereas the correlations for E were similar. 

From the studies undertaken to date it is con- 
cluded that the MPI has high reliability, that 
E scores of normal and neurotic populations show 
an equivalent stability over time, but that there 
is some evidence that fluctuations in clinical 
status influence the stability of N scores, 
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A high correlation has been repeatedly 
noted between the judged desirability of a 
personality characteristic denoted by an in- 
ventory item and the probability of the item’s 
endorsement (cf. Edwards, 1957b; Wahler, 
1958). Edwards (1953) has pointed out that, 
while this correlation suggests the possibility 
that traits judged to be desirable are preva- 
lent or dominant in the group under study, 
it might also reflect a tendency to create a 
good impression or to respond, either deliber- 
ately or unintentionally, in a socially desir- 
able manner. Instruments designed to meas- 
ure this set to respond desirably were found 
to be substantially correlated with several 
clinical and personality scales (Edwards, 
1954; Edwards, 1957b; Fordyce, 1956; Han- 
ley, 1957) thus documenting the pervasive 
influence of social desirability on personality 
measurement. In view of this accumulating 
evidence, Edwards (1954) and others (cf. 
Heineman, 1953; Sisson, 1948) attempted to 
control for social desirability in constructing 
personality measures by pairing, in a forced- 
choice format, items with approximately 
equal desirability scale values. 

This forced-choice method of control as- 
sumes that social desirability is a single uni- 
dimensional continuum, along which state- 
ments may be spaced and in terms of which 
items may be equated. If, however, judgments 
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of social desirability were found to be multi- 
dimensional, then controls in terms of a 
single average scale might prove inadequate. 
Edwards (1953) obtained his social desira- 
bility scale values by the method of successive 
intervals (cf. Edwards, 1957a; Torgerson, 
1958), a technique which combines individual 
ratings of desirability to produce a single 
scale describing, on the average, the pooled 
judgments of the particular group used. Since 
the subjects’ (Ss’) own opinions about the so- 
cial desirability of traits were elicited in the 
rating task, one might expect various cultural 
and social background factors to produce dif- 
ferences in the scales obtained from dif- 
ferent judging groups. However, there may 
also exist within a single group consistent 
individual differences with sufficient structure 
to warrant their consideration as different 
“points of view” with respect to desirability., 
The purpose of the present paper is to investi- 
gate the dimensionality of social desirability 
judgments and to explore the nature of any 
dimensions or “points of view” uncovered. 
Edwards (1957b) and others (cf. Klett & 
Yaukey, 1959) have investigated the compa- 
rability of social desirability judgments ob- 
tained from different groups and have found 
generally high correlations among the various 
scales. On a college sample, for example, Ed- 
wards (1953) found no essential differences 
in desirability values as a function of age, 
sex, or education, and Klett (1957a), on a 
high school sample, concluded that there were 
no significant differences among socioecg- 
nomic groups, or between grades or sexes. 
The correlation between desirability scale 
values obtained from the college and high 
school samples was .94. Klett (1957b) also 
obtained desirability ratings from a sample 
of mental hospital patients: he found no 
essential differences in the desirability ratings 
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of psychotic and nonpsychotic subgroups, and 
scale values for his total hospital sample cor- 
related .88 with the college values and .87 
with the high school values. In evaluating 
this striking similarity in group opinions 
about the desirability of personality traits, it 
must be remembered that successive intervals 
scale values represent a kind of average rat- 
ing for the group involved. Within-group 
ratings would need to be very homogeneous 
and between-group differences quite marked 
before a low correlation would be expected 
between such averages. Rosen (1956), for 
example, reported highly similar mean profiles 
for groups given the MMPI under self-ap- 
praisal and desirability instructions, but at 
the individual level the correlations between 
normal and desirability profiles were low and 
sometimes negative. 

Despite high correlations between scale 
values obtained from different groups, evi- 
dence suggesting the inadequacy of a single 
global view of desirability has been noted. 
Klett (1957a) found that matched item pairs 
for the Edwards Personal Preference Sched- 
ule (EPPS) were less adequately matched in 
terms of the high school values. French 
(1958) noted that EPPS scores for basic air- 
men were significantly higher than published 
norms on traits presumably valued by the Air 
Force, suggesting that recruits were able to 
identify responses that get military approval 
and that the Air Force view of desirability 
may differ from that expressed in the college 
ratings. Klett (1957b), noting that some of 
the differences between hospital and college 
ratings were systematically related to scale 
content, found that EPPS mean scores for a 
sample of schizophrenics were significantly 
higher than college normative scores for needs 
judged to be more desirable by the hospital 
group. Klett (1957c) also divided the mental 
hospital sample into several dichotomous 
groups in terms of high and low scores on 
scales derived from the MMPI. For each 
breakdown, correlations were computed be- 
tween median desirability ratings obtained 
from high and low scorers; and some of the 
Coefficients, although still high (.68 and .71), 
were significantly smaller than those previ- 
ously reported. 


The present study carries the investigation 
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of consistent differences in judging desirabil- 
ity back to the level of the individual ratings. 
The technique applied is essentially a factor 
analysis of intercorrelations among ratings of 
personality items. This approach, used pre- 
viously by Morris and Jones (1955) in their 
study of value dimensions, yields factors 
which presumably reflect different points of 
view with respect to the attribute rated, in 
this case social desirability. 

An alternative approach to this same prob- 
lem would be to analyze “Q correlations” 
between people to isolate groups of similar 
raters and examine their median patterns. 
This use of Q correlations, however, depends 
upon the existence of relatively pure types of 
raters; whereas for the present approach, 
which seeks consistent judgmental tendencies, 
it is an empirical question whether or not 
such types exist, i.e., whether there is a sim- 
ple structure among factor scores for the 
raters. In the present analysis dimensions of 
desirability can emerge even though raters 
are “complex,” i.e., have high factor scores 
on more than one dimension, which is the 
usual case in the area of mental abilities, In 
this question of research strategy, it was con- 
sidered more fruitful to deal with a structured 
sample of items than to depend upon the 
isolation of a typology of raters. In any 
event, factor spaces obtained from the two 
approaches are closely related, and the dis- 
tribution of people in the space may be 
examined in the present method by the sub- 
sequent determination of factor scores. 


METHOD 
Subjects 


The present study is a reanalysis of desirability 
judgments collected by Klett (1957b) from 118 
manifestly disturbed mental hospital patients. For 
the present analysis 10 Ss were eliminated from the 
sample because of incomplete data. 


Procedure and Analysis 


The Ss had been asked to rate on a nine-category 
scale the desirability of 140 items purportedly rep- 
resenting 14 psychological needs. Categories 132P 
from one for “extremely undesirable” through five 
for “neutral” to nine for “extremely desirable.” These 
items had been previously scaled by Edwards (1954) 
in constructing measures for the EPPS. 

In Klett’s (1957b) analysis of this material, the 
scaling procedure of successive intervals (cf. Torger- 
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TABLE 1 


PERSONALITY [regs SELECTED FOR 
FACTOR ANALYSIS 


EPPS Median 
Item Item No. Scale Rating 

1 17A Deference 7.13 

2 178A Endurance 8.28 

3 114A Dominance 5.97 

4 2A Deference 8.00 

5 68A Order 8.21 

6 152A Change 8.20 

7 41A Achievement 5.97 

8 45A Autonomy 4.10 

9 14A Exhibition 6.80 
10 18A Succorance 7.82 
11 221A Nurturance 6.74 
12 5A Autonomy 8.41 
13 214A Heterosexuality 7.97 
14 11A Achievement 7.56 
15 222A Change 6.31 
16 98A Succorance 6.30 
17 39A Exhibition 5.44 
18 82A Intraception 6.47 
19 94A Dominance 4.55 
20 224A Heterosexuality 5.60 
21 168A Endurance 7.07 
22 42A Deference 6.09 
23 91A Affiliation 7.49 
24 169A Heterosexuality 5.78 
25 146A Affiliation 6.81 
26 33A Order 7.32 
27 51A Achievement 8.40 
28 190A Aggression 5.24 
29 102A Intraception 7.64 
30 180A. Aggression 6.32 
31 70A Autonomy 5.67 
32 65B Endurance 7.09 
33 93A Succorance 5.36 
34 76A Affiliation 8.24 
35 49A Exhibition 4.47 
36 206A Nurturance 8.08 
37 185A Aggression 3.28 
38 18A Order 7.21 
39 166A Nurturance 7.42 
40 104A Dominance 7.34 
AL 197A Change 6.60 
42 112A Intraception 6.17 


son, 1958) was used, which in effect simultaneously 
normalized the ratings of the 140 items on a com- 
mon base line. The scale values of the items were 
essentially the means of these normalized ratings as 
measured on the common scale. In the present anal- 
ysis, on the other hand, the original ratings of each 
item were intercorrelated and factor analyzed. In a 
sense the two analyses complement each other in 
that the first one dealt with means and variances 
to obtain an average scale for the group, and the 
second dealt with intercorrelations to obtain the 
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structure of consistent individual differences in 


judgments. 

To make the factor analysis of feasible size, 42 
items were selected from the 140 available, three 
from each of the 14 needs represented. By examining 
the distribution of ratings for each statement, one 
item was chosen with a high median desirability 
value for each need, one with an intermediate value, 
and one with a low value. No attempt was made 
to select in terms of item content, except to require 
three items for each need. Since the needs differed 
in their judged desirability, the median values for 
items representing one need overlapped with those 
of another to cover a wide range of desirability. This 
variation in judged desirability within each need 
favored the appearance of a general desirability fac- 
tor, but at the same time the choice of three items 
from each scale permitted the occurrence of cluster- 
ing according to needs. EPPS item numbers and 
median desirability ratings are presented in Table 1 
for the statements selected. Product-moment corre- 
lations among the ratings of these 42 items were 
factor analyzed by the grouping method (Thurstone, 
1947), with communalities estimated by the largest 
correlation in each column. 


RESULTS 


After the extraction of nine factors, the 
residuals were found to be symmetrically dis- 
tributed about zero with 93% of the coeff- 
cients included between .10 and —.10. Since 
further factor extraction would have gener- 
ated small doublets of doubtful interpreta- 
tion, only nine factors were retained for 
rotation. Although one of them (Factor IX) 
was quite large, accounting for approximately 
42% of the common variance and 22.6% of 
the total variance, a general factor of desira- 
bility did not seem indicated. For comparison 
purposes, the first centroid factor was also 
extracted; it accounted for 22.8% of the total 
variance. The factors were rotated analyti- 
cally to oblique simple structure using the 
oblimax criterion (Pinzka & Saunders, 1954), 
and some minor additional rotations were 
performed graphically. The final rotated load- 
ings are given in Table 2. In the orthogonal 
solution obtained by the grouping method 
the nine factors accounted for the following 
percentages of total variance, respectively: 
4.03, 3.57, 3.39, 9.05, 3.19, 2.48, 2.57, 2.53 
22.65. These orthogonal factors were natu. 
rally modified, sometimes considerably, in 
the course of rotation, but they correspond 
roughly to the oblique factors in the order 
listed in Table 2. The intercorrelations among 
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TABLE 2 


Factor Loapincs ROTATED OBLIQUELY 


Factor 
Variable I Il Ill IV Vv VI VII VIII IX 

1 .22 12 06 01 12 .26 15 35 3t 
-2 —.10 —.04 —.06 —.09 21 .03 01 16 39 
3 31 .06 03 A9 08 —.01 03 03 10 
4 01 —.00 —.18 AL —.06 08 19 10 40 
5 —.23 —.00 —.00 01 02 25 —.03 14 34 
6 05 —.00 27 —.13 03 —.18 55 = Si 
7 56 —.04 18 =.09 .06 07 —.01 —.05 08 
8 —.13 .03 .28 04 0 —.00 .08 03 —.27 
9 04 —.04 -26 -04 .25 —.05 —.04 —.04 .20 
10 —.04 .02 14 —.11 .02 —.00 01 AS 31 
11 43 04 39 —.11 .04 -00 —.14 —.13 .03 
12 —.19 —.04 02 —.02 .09 08 18 —.01 46 
13 —.22 — 13 19 04 sid. 07 —.00 08 12 
14 23 =—17 23 02 .08 —21 -10 04 36 
15 OL 06 35 07 —.03 16 18 16 —A7 
16 10 06 67 06 —.08 —.05 —.07 —.01 —.08 
17 —.03 25 48 07 -06 —.03 —.10 —.06 12 
18 64 =12 04 Al —.05 —.01 <A, 4 02 
19 10 04 25 22 ai 02 —13 —.02 —.19 
20 —.03 01 06 64 .00 02 —.05 05 —.11 
21 =21 15 — 04 —.11 AZ 27 —.18 -10 13 
22 —.02 24 —.09 -13 —.06 05 —.01 43 —.06 
23 03 03 14 16 =.17 31 04 03 29 
24 —.05 —.05 10 59 —.02 —.02 10 —.04 —.01 
25 —.00 01 .00 Mi OL 67 —.01 03 —.11 
26 —.01 —.02 —.02 —.12 03 .02 —01 52, 02 
27 —.12 —.01 —.02 —.04 —.03 14 10 06 56 
28 —.20 30 AS 23 07 03 05 —A6 07 
29 —.04 08 04 —.05 12 01 25 -05 44 
30 —.05 .09 =01 —.04 -56 —.07 —.01 05 O04 
31 —.16 .06 =07 as 44 .09 —.10 01 —.08 
32 .21 .23 —.22 .25 .00 —.06 —.16 —.14 61 
33 06 .29 ao 45 —.07 —.07 —12 .01 —.01 
34 —.05 —.05 ll —.14 —.02 .20 20 —.02 AS 
35 .02 58 3i —.01 .05 —.05 09 —.08 .07 
36 27 —.10 ll 7 = 12 .03 =11 04 AS 
37 —.16 62 .08 —.03 02 Atl .07 07 —.10 
38 06 —.08 .08 22 10 09 =i 35 —.03 
39 = 13 —.04 Be .08 +01 Al 07 -13 08 
40 05 —.14 —.00 —.02 54 —.10 10 —.10 .29 
41 —.01 .02 —.04 07 —.02 -10 69 —.00 —.02 
42 .03 13 —.06 .01 36 19 —.04 —.00 05 


primary factors 


(Table 3) are generally posi- 
tive and indicat i She 


e the presence of a second- 
order structure, but enough coefficients are 
sufficiently low to suggest that a general fac- 


tor of desirability does not emerge at the 
second-order level either, 


These factors summ 


: arize certain consist- 
encies among the ori 


i A ginal ratings and may 
be interpeted in terms of different viewpoints 


as to what is desirable. Factor loadings .25 
or greater will be emphasized in the inter- 


pretations. In general the items do not cluster 
according to the specific scales they were 
written to represent, but it should be noted 
that there is no necessary relationship be- 
tween these factors of judged desirability and 
the scales or item clusters determined from 
responses to the inventory. The fact that the 
items do not cluster according to specific 
needs when rated for desirability does not 
reflect upon the homogeneity of scales from 
Edwards’ inyentory. 
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TABLE 3 
INTERCORRELATIONS AMONG PRIMARY FACTORS 


Factor I II III IV v VI VI VI IX 

I 1.00 AL + —.03 11 15 <19 16 Se 03 
I lt 1.00  —.05 16 J: =A =03 =21 =—36 
Ill —=03 =,05 PLO 25 22 28 08 48 30 
IV At 16 35 1.00 49 20 33 24 12 
v 15 10 22 49 1.00 29 27 24 29 
VI 16 =i 28 20 29 100 17 49 58 
VII 16 —.03 08 33 27 17 1.00 30 AS 
VIII 7 =A AS 24 4 49 30 1.00 48 
IX 03  —.36 30 12.29 58 AS 48 1.00 


Factor I 


18. To like to put yourself in someone else’s 
place and to imagine how you would feel 


in the same situation. (Int.) 64 
7. To like to write a great novel or play. 
(Ach.) 56 


11. To like to have your friends confide in 
you and tell you their troubles. (Nur.) 43 
3. To like to settle arguments and disputes 


between other people. (Dom.) l , 31 
36. To like to treat other people with kind- 
ness and sympathy. (Nur.) 27 


The common behavioral element viewed as 
desirable on this particular dimension might 
be labeled Interpersonal Concern or Inter- 
personal Sensitivity. The determination of 
factor scores would presumably uncover indi- 
viduals in the sample analyzed with consist- 
ent tendencies to rate these items highly 
desirable, even though the scale values for 
the total group are only moderate (see 
Table 1). 


Factor II 
37. To like to make fun of people who do 
things you regard as stupid. (Agg.) 62 


35. To like to use words which other people 
often do not know the meaning of. (Exh.) .58 

28. To like to criticize someone publicly when 
they deserve it. (Agg.) 30 

33. To like to have your friends make a fuss 
over you when you are hurt or sick. 


(Suc.) IR 29 
17. To like to be the center of attention in 


a group. (Exh.) 25 
This dimension might be labeled Aggres- 
sive Arrogance or Criticalness. The presence 
of Items 33 and 17, however, suggests that 
arrogant and critical behavior might possibly 
be considered desirable in this instance be- 
Cause of the social attention thereby derived 


and the attendant gratification of certain nar- 
cissistic and exhibitionistic needs. An alter- 
native and perhaps more descriptive factor 
label, then, might be Narcissistic Exhibition- 
ism, 


Factor II 
16. To like to have friends who feel sorry for 
you when you are sick. (Suc.) 67 


33. To like to have your friends make a fuss 
over you when you are hurt or sick, 


(Suc.) 51 
17. To like to be the center of attention in 
a group. (Exh.) 48 


11. To like to have your friends confide in 
you and tell you their troubles. (Nur.) 39 

15. To like to participate in new fads and 
fashions. (Chg.) 35 

35. To like to use words which other people 
often do not know the meaning of. (Exh.) 31 


8, 6, 39, 9, and 19 also have loadings above .25. 


This dimension, which appears to be some- 
what related conceptually to Factor II as a 
different aspect of ego-supporting needs, 
might be labeled Emotional Dependence or 
perhaps Succorance. In this connection Item 
15 is probably not viewed so much as an 
indication of the need for change as of the 
need for attention and support, so that in this 
case, as on some of the other factors, one 
need may be instrumental to the gratification 
of another. Thus the need for change may 
here be activated in the service of the need 
for succorance by subsidiation (Murray 
1938, p. 86), a process not considered in the 
development of scales for one need at a time. 


Factor IV 


20. To like to listen or to tell jokes in which 
sex plays a major part. (Het.) 64 
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24. To like to read books and plays in which 
sex plays a major part. (Het.) 59 
32. To like to be able to put in long hours of 
work without distractions. (End.) 25 


This factor appears to be a doublet repre- 
senting Sexual Interests. The appearance of 
Item 32, and perhaps even the next three 
items in order (28, 19, 38), is interesting 
from a psychoanalytic viewpoint. The ab- 
sence of Item 13 dealing with “love for the 
opposite sex” suggests a possible differentia- 
tion in the desirability of love relationships 
and substitute sex interests. 


Factor V 


30. To like to attack points of view that are 
contrary to yours. (Agg.) 56 

40. To like to argue for your point of view 
when it is attacked by others. (Dom.) 54 

31. To like to do things your own way with- 

out regard to what others may think. 
(Aut.) 44 

42. To like to judge people by why they do 

something—not by what they actually do. 
(Int.) 36 

9. To like to tell other people about adven- 

tures and strange things that have hap- 
pened to you. (Exh.) 25 


This factor might represent something like 
Intellectual Aggressiveness or Active Intellec- 
tual Independence. 


Factor VI 
25. To like to write letters to your friends, 
(Aff.) 67 
39. To like to show a great deal of affection 
toward your friends. (Nur.) 41 
23. To like to have strong attachments with 
your friends. (Aff.) Si 


21. To like to stick at a job even when it may 
seem as if you are not getting anywhere. 
(End.) 27 


1 and 5 also have loadings above .25. 
This dimension, which might be labeled 
Interpersonal Involvement, reflects a view- 


point in which affiliative and affectionate be- 
havior is judged desirable. 


Factor VII 


41, To like to move about the country and 
live in different places. (Chg.) i 


6. To like to travel and see the country, ss 
(Chg.) 55 

29. To like to analyze your own motives and : 
feelings. (Int.) 25 
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This factor appears to be a highly specific 
doublet reflecting Interest in Travel. The 
need for change may be involved in this di- 
mension, but it is of interest to note that 
Item 15, dealing with participation “in new 
fads and fashions,” received a low loading. 


Factor VIII 


26. To like to have your meals organized and 
a definite time set aside for eating. (Ord.) .32 
22. To like to be in groups where someone 
else takes the lead in deciding what you 
are going to do. (Def.) 43 
38. To like to have your life so arranged that 
things run smoothly and without any 
change in plans. (Ord.) 35 
1. To like to conform to custom and avoid 
doing things that people you respect might 
consider unconventional. (Def.) 35 


This dimension might be labeled Compul- 
sive Conformity. It reflects a viewpoint in 
which conforming deference, a kind of de- 
tached disinterest and orderliness are con- 
sidered desirable. 


Factor IX 


32. To like to be able to put in long hours of 
work without distractions, (End.) 61 
27. To like to do your very best in whatever 
task you undertake, (Ach.) 56 
6. To like to travel and see the country. 
(Chg.) 51 
34. To like to be loyal to your friends, (Aff.) .48 
12. To like to be able to come and go when 


you want to. (Aut.) 46 
36. To like to treat other people with kind- 

ness and sympathy, (Nur.) 45 
29. To like to analyze your own motives and 

feelings. (Int.) 44 


4. To like to learn what great men have 
thought about various problems in which 
you are interested. (Def.) 40 

2. To like to start in and keep working on 
some assignment until it is completed. 
(End.) 39 

14. To like to be recognized as an authority 
in some job, profession, or field of spe- 
cialization. (Ach.) 36 


5, 1, 10, 40, 23, and 8 also have loadings above 
28. 


This large factor appears to reflect an 
Achievement-Oriented, Middle-Class Stereo- 
type of desirable behavior or a kind of Prot- 
estant Ethic. This popular conception favors 
a hard-working (Items 32, 2), success-orl- 
ented (27, 14), self-broadening (6, 4) 
friendly (34, 36, 23, 10), personally inde- 
pendent (12 and perhaps 40), socially con- 


SS 


a 
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ventional (1), orderly (5), responsible (8) 
orientation. 


The Factors in Relation to the Group 
Desirability Scale 


Median desirability ratings computed for 
the total sample are presented in Table 1 
for each item. For correlational purposes, 
these median ratings are very similar to the 
more refined successive intervals scale values 
used by Klett (1957b). This median desira- 
bility scale was correlated with each of the 
nine factors in turn, and the coefficients were 
found to be —.04, —.65, —.30, —.33, —.11, 
12, .19, .15, and .76, respectively. The mul- 
tiple correlation for the nine factors in pre- 
dicting the median values was .96; the B- 
weights were —.05, —.44, .15, —.03, .24, .06, 
.29, and .78, respectively. Thus, the vector 
of medians can be oriented in the nine- 
dimensional factor space in such a way as 
to account for almost all of its variance. This 
is not a necessary finding, since the 42 
median values represent averages of 42 fre- 
quency distributions, and the means of these 
distributions were essentially eliminated in 
the present analysis of intercorrelations. 
However, one might expect the average de- 
sirability scale for the judging group to be 
related to the component dimensions, par- 
ticularly to the larger or dominant factor, 
since a point of view prevalent in the sample 
under study should make a larger contribu- 
tion to the average scale. 


Response Biases and Possible Artifacts 


In the analysis of ratings based on a fixed 
number of categories, there are several possi- 
ble sources of bias which could produce arti- 
facts in the factor space. For example, ratings 
of items with medians near either extreme of 
the scale must be skewed, and this differen- 
tial skewness has a small but consistent effect 
on the corresponding correlation coefficients, 
which could generate spurious dimensions 
similar to the ‘‘difficulty” factors in mental 
test analysis. Such factors should be consist- 
ently related to extreme median values, as 
are Factors II and IX in the present analy- 
sis; the other seven factors, however, involve 
mostly items from the same upper-middle 
range (Categories 5 to 7). Factor II includes 
some, but not all, items near the undesirable 
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extreme and correlates —.65 with the medi- 
ans; Factor IX involves mostly desirable 
items and correlates .76 with the medians. 
However, if a rotation is made in the plane 
of Factors II and IX, essentially equating 
IX with the medians, the new Factor II 
would then include primarily Items 35, 37, 
and 32, which are fairly consistent psycho- 
logically with the old Factor II but which 
involve both extremes of the rating scale. 
This leaves Factor IX highly related to the 
medians, and possible spurious influences 
should be considered in interpreting it. How- 
ever, it is doubtful that differential skewness 
would generate the largest factor, especially 
when skewness was not extreme in the pres- 
ent data: the rating distributions for the 42 
items selected had no modes in either extreme 
category and only 14 modes in either Cate- 
gory 2 or 8. 

Another source of potential bias stems from 
consistent rater preferences for specific cate- 
gories, such as a predominant use of the 
extremes or the middle range in making judg- 
ments. Most of the raters in the present sam- 
ple, however, appeared to use a wide range 
of categories, with only about a dozen Ss 
showing noticeable preferences for specific 
categories or restricted ranges. Although no 
more than two of these Ss appeared to have 
markedly similar response patterns, consistent 
but slight category biases operating through- 
out the sample could distort the factor space 
and should be considered in the interpreta- 
tion. In this connection, it is interesting to 
not that, when Factors II and IX are rotated 
as mentioned above, the new Factor IT has 
high loadings for items from both extremes 
and could reflect intensity of rating. 

Such rating biases thus offer alternative 
interpretations for Factors II and TX, but 
since there is little evidence for their occur- 
rence in the present data, the simple struc- 
ture orientation is preferred. 


Discussion 


The finding of divergent viewpoints about 
social desirability is obviously highly depend- 
ent upon the sample of individuals studied. 
For example, one could probably assemble 
a group of people with sufficient homogeneity 
that their ratings of desirability would reflect 
only one consistent point of view. The issue 
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in question, however, is not whether a single 
scale can be obtained for some particular 
population, but whether or not such an aver- 
age scale is of sufficient generality to use in 
personality measurement to control for the 
tendency to respond in a socially desirable 
way. The present finding of several different 
dimensions of desirability suggests that such 
an average scale would not provide an ade- 
quate control in individual assessment and 
would have inconsistent effectiveness from 
subject to subject. Equating in terms of a 
large dimension which seemed to reflect a 
popular set of values for Western culture, 
such as Factor IX in the present analysis, 
might prove adequate for studying inter- 
relations among variables at the group level; 
but for individual assessment the controls 
could be quite ineffective, particularly for 
people whose perceptions of desirability dif- 
fered markedly from the group consensus. 
The multidimensionality of social desira- 
bility ratings, the major finding of the pres- 
ent study, naturally must be cross-validated 
on normal populations. The stability and gen- 
erality of the particular nine factors found 
here would be of interest, but it might be 
expected that several new and different points 
of view would be uncovered by varying the 
type of subjects sampled and items rated, 
However, it must also be considered that 
wide group differences could occur with only 
minor attendant changes in a factor structure 
which spanned several points of view. Morris 
and Jones (1955), for example, found that, 
although the ratings of various value orien- 
tations differed considerably across cultures, 
the factors obtained proved to be very similar. 
In a society with widely differing social 
and subcultural influences, it is not surprising 
that there should be several dimensions of 
social desirability. However, the present find- 
ing of as many as nine factors, on a fairly 
restricted sample of 42 items and 108 cases, 
suggests the possibility that the consistent 
variation is not so much among clearly de- 
fined social groups, such as high school stu- 
. dents or basic airmen, as among individual 
points of view. It is true that this diversity 
in viewpoint might be related to peculiarities 
in the judging sample, but in that case the 
factors could be important in their own right 
in relation to psychopathology. However, as 
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Edwards (1957b, p. 48) has noted, his in- 
structions for rating social desirability are 
similar to Rosen’s (1956) for personal desir- 
ability, in that the S’s own judgments of the 
desirability of traits in others are emphasized. 
It might be expected that individual differ- 
ences in such personal views of desirability 
would generate many factors, even for rela- 
tively homogeneous, normal populations. 
Thus, since the term “social desirability” ap- 
pears somewhat misleading, even when dis- 
cussing Edwards’ (1957b) results, it would 
seem appropriate to substitute another label 
(cf. De Soto, Kuethe, & Bosley, 1959; Rosen, 
1956) which emphasizes differences in indi- 
vidual viewpoints of desirability. 

The discussion thus far has emphasized the 
reduction of error variance in personality 
measurement by controlling the effects of the 
tendency to respond in a socially desirable 
way. This response tendency, however, may 
also be viewed as a stylistic personality vari- 
able (Jackson & Messick, 1958), and in this 
connection, rather than trying to eliminate 
the effect, attempts might instead be made 
to measure the individual’s position on the 
different dimensions. If, for example, meas- 
ures of differential tendencies to respond in 
various desirable ways can be formulated, 
they may be used on the one hand as person- 
ality variables in their own right and on the 
other as multiple control measures to partial 
differential desirability biases out of content 
scales. Since a reliable tendency to respond 
in desirable ways might in itself reflect cer- 
tain traits or needs in the respondent, much 
research is needed to develop control tech- 
niques (cf. Helmstadter, 1957), which do not 
require, as partial correlation does, that the 
corrected content scale be uncorrelated with 
the response tendency. It must also be re- 
membered that the factors discussed in the 
Present paper represent dimensions of judged 
desirability, which may or may not be related 
to dimensions of the tendency to respond in 
desirable ways, if such exist. 

The proposed treatment of desirability di- 
mensions as personality variables is similar 
in many ways to the diagnostic use of ‘ideal- 
self-” descriptions, which have been widely 
employed in evaluating outcomes of psycho- 
therapy (Rogers & Dymond, 1955). Osgood, 
Suci, and Tannenbaum (1957, Ch. 6) have 
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also utilized ratings on the Semantic Differ- 
ential, which has a large evaluative compo- 
nent, to compare conceptual structures in 
different personalities and to study person- 
ality change during psychotherapy. Inciden- 
tally, this study of differences in individual 
conceptions appears fruitful even though 
comparisons at the group level show nearly 
identical factor structures for different cul- 
tures and pathologies (Osgood, Suci, & Tan- 
nenbaum, 1957, pp. 170-175). Thus, the 
presence of consistent individual differences 
in the perception of desirability offers the 
possibility that the seemingly objective task 
of rating the desirability of a trait may be 
useful in making inferences about the person- 
ality of the rater. 


SUMMARY 


Attempted controls in personality inven- 
tories for the tendency to respond in a so- 
cially desirable way usually assume that the 
domain of desirability is unidimensional. The 
present study attempted to evaluate this 
assumption and to explore the nature of any 
multidimensional structure uncovered. Rat- 
ings of the desirability of personality items 
from the Edwards Personal Preference Sched- 
ule were intercorrelated and factor analyzed. 
Nine factors were extracted and rotated to 
oblique simple structure. The factors were 
labeled and interpreted in terms of different 
“points of view” as to what is desirable. The 
finding of several dimensions of desirability 
suggests that the equating of items in terms 
of a single average desirability scale would 
leave much of the consistent variance of the 
ratings uncontrolled. Such a control method 
would provide inconsistent effectiveness for 
individual assessment, depending upon the S’s 
conformity to the group consensus of desira- 
bility. 
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The concept of mental health defies defini- 
tion despite efforts to render it operational 
and objective. The intent of this paper is to 
suggest an alternative set of concepts, which 
may prove more useful to the growing field 
of research and technology committed to 
mental health pursuits. The need for such 
concepts is clear. The urge of man to mold 
and perfect life has in the past two decades 
shifted, particularly in Western lands, from 
an attack on sociopolitical and economic 
forces to the intrapsychic and interpersonal 
forces believed to prevent the attainment of 
“the good life” (Duhl, undated). An example 

` of this is given in the following quotation 
from a brochure issued by a mental health 
association in one of the northeastern states: 
“IF EVERYONE ENJOYED MENTAL 
HEALTH, there would be no crime, no in- 
sanity, and no deepseated unhappiness born 
of a feeling of frustration or of a sense that 
one does not belong.” It would appear that 
we have by now badly overloaded the concept 
of mental health to the point where it has 
become virtually useless either for action or 
research. 

Much of the early epidemiological work in 
this field has been concerned with cases of 
demonstrable pathology, i.e., patients accepted 
for hospital or outpatient care. From the 
viewpoint of such studies, mental health is 
the absence of mental illness, but even here 
there is the problem of determining generally 
acceptable criteria for deciding what is or is 
not an instance of genuine pathology. Prob- 
lems multiply when we turn our attention 


1 While maintaining responsibility for the asser- 
tions contained herein, the author wishes to express 
appreciation to Erich Lindemann, whose perspective 
and great vision have led to this attempt at clari- 
fication, Gratitude is also given to Frank Babbott 
for assistance with epidemiological concepts, to 
Frank and Joan Babbott for editorial help, and 
for their critical comments and suggestions to L, 
Borstelmann and W. Hollister. 


from hospitals and clinics to the general 
population. The few statistics regarding 
distribution of problems in the general popu- 
lation seriously challenge our present in- 
adequate concepts of mental health and emo- 
tional illness. For example, the “psycho- 
pathology of everyday life” has been demon- 
strated recently by Leighton (1956), who 
found that psychiatric appraisal of a sample 
of the population of a Canadian town re- 
vealed psychiatric impairment among at least 
37% of the citizens. 

Jahoda (1955) and Lindemann (1953), 
among others, have laid down explicit criteria 
for positive mental health. Their definitions 
describe the mental health of the individual 
in a social context. They seek to avoid enclos- 
ing him in the strait jacket of statistical nor- 
mality or requiring him to acquiesce to 
malign environmental circumstances. In their 
definitions, adequate perception of reality, 
ability to adjust to inevitabilities and to 
master resolvable difficulties, and—for Linde- 
mann—attainment of personal satisfactions 
and tranquility without impairment of others’ 
functioning are viewed as prime character- 
istics. Thus, mental health is defined not as 
the absence of symptoms, but rather as the 
presence of three qualities in sufficient abun- 
dance: good reality testing, active mastery 
of the environment, and a feeling of content- 
ment not based on victimization of others. 
At this point we are left with many ques- 
tions regarding the interrelations of the three 
qualities. Does a deficiency in one imply that 
an individual no longer enjoys good mental 
health? Or must all three be reduced for ill 
health to ensue? Is any one—reality testing, 
for instance—a prerequisite for the other? 

Such questions arise, in part, because three 
different basic meanings have been attached 
to the concept of mental health. In order 
not to perpetuate such confusion, the bal- 
ance of this paper will concern itself with 
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the three connotations of mental health, using 
a different term for each. First, there is that 
meaning attached to the term “mental health” 
which appears to refer to the more enduring 
condition of the personality (hereafter re- 
ferred to as “soundness”); second, there is 
that meaning which appears to refer to the 
more immediate state of health, equilibrium, 
or current effectiveness of the person in terms 
of the reaction of the organism to a stressor 
agent (hereafter referred to as “well-being”) ; 
third, there is that meaning which appears 
to refer to the ability of the individual to 
avoid illness even when exposed to illness 
producing agents (hereafter referred to as 
“emotional stability”). 


SoUNDNESS 


By “soundness” is meant the level of inte- 
gration of the general, more enduring person- 
ality structure. It is assessed in the perspec- 
tive of the culturally shared ideal of the 
optimal in growth for any age level or other 
significant role dimension. Thus, in our cul- 
ture it might include such attributes as 
adaptability, well developed social skills, a 
high degree of initiative, a sense of social 
responsibility, and a general perfusion of 
modest optimism in one’s perception of man’s 
struggle with nature and himself.” It is be- 
lieved that such ideal attributes, though cul- 
ture bound, tend to vary inversely with 
severity of psychiatric pathology. Perhaps 
psychiatric nosology is itself culture bound. 
Possibly such ideal characteristics are indeed 
hallmarks of emotional maturity, reflecting 
the superiority of our culture over others. 
It seems more probable, however, that in this 
society—as in most, if not all, others—ideal 
types have emerged which exemplify one or 
more of the several sound integrations of 
which man is biologically capable as he 
attempts to satisfy needs, pattern behavior, 
form social groupings, and find secondary 
satisfactions in interpersonal contacts. 

Certain possible universal attributes of 
integration suggest themselves. The writer 
believes the attributes of an individual with 


2A somewhat similar conception is presented by 
Shoben (1957) in a model of integrative adjust- 
ment. He also includes self-control which, as will be 
seen, this writer believes is more appropriately con- 
sidered separately as one characteristic of stability. 


a high soundness level can be expressed in his 
own terms pretty much as follows: “I feel 
I belong; I am a worthwhile person; I know 
where I stand.” The external observer might 
choose to express the first attribute in terms 
of the clarity of the social role of the indi- 
vidual and the value ascribed to that role 
by the individual and others. He might then 
speak of the personal sense of worth of the 
individual and add an assessment of the ap- 
propriateness of role behavior as compared 
with the expectancies of the significant refer- 
ence group.” 


WELL-BEING 


The concept “well-being” refers to the cur- 
rent state of equilibrium between the indi- 
vidual and the social-emotional environment 
impinging upon him. It is postulated that 
almost all individuals exhibit virtually con- 
tinuous fluctuations in well-being. It is 
further believed that nearly all individuals 
undergo repeated states of ill-being and that 
such states are manifested in one or more of 
the commonly recognized patterns described 
in articles on combat fatigue, disaster fatigue, 
or psychological shock reactions observed in 
air raids and other mass stress situations 
(e.g., Janis, 1951). Finally, it has been noted 
in work with individuals undergoing emo- 
tional crises that states of ill-being tend to 
be time limited, the tendency being to return 
to a state of equilibrium whether or not treat- 
ment is available. The time period appears 
to depend upon the nature, severity, and 
duration of the stress. 

Those who cannot arrive at an equilibrium 
following a state of ill-being are apt to suffer 
a serious decrement in soundness. This ob- 
servation is supported by experience with 


3 Some reinforcement for such speculations is of- 
fered by Schutz (1958), who suggests, on the basis 
of a large number of studies of group behaviors, 
that three core tasks in almost any interpersonal 
context relate to the questions: (a) Do I have or 
can I gain membership? (inclusion); (b) What is 
the structure or pattern of relationship into which 
I must fit? (control); and (c) To what extent can 
I safely express my personal needs in this relation- 
ship? (affection). Schutz further presents a schema 
in which each of the three interpersonal need areas 
is seen to vary according to whether it is handled 
in an “ideal” fashion, is “deficient” or “excessive” to 
a moderate degree, or is dealt with pathologically. 
Thus a pathological underdevelopment is believed to 
accompany schizophrenia. 
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combat fatigue cases during World War II, 
who developed a higher proportion of chronic 
emotional problems when removed perma- 
nently from combat units as compared with 
similar cases which were quickly returned to 
their units after a brief respite during the 
Korean conflict. Therefore a mental health 
program concerned with prevention and 
health maintenance is justified in devoting 
itself to the brief care of those suffering 
temporary impairment of well-being in an 
effort to forestall reductions in soundness 
levels. 

When is a condition of ill-being in a popu- 
lation serious enough to warrant attention? 
This is one of the most challenging questions 
facing the field of community mental health. 
It can be satisfactorily answered only when 
base or “endemic” rates of prevalence and 
incidence for specific states of ill-being have 
been established among major population 
groupings. Meanwhile, many of those work- 
ing with more or less closed communities, 
such as mental hospitals, schools, and prisons, 
are usually very clear in their own minds 
when institutional tensions affecting the well- 
being of large numbers in their care have 
reached “epidemic” proportions. 

In order to establish base rates, it is neces- 
sary for there to be something to count. In 
one study, which led ultimately to a clearly 
defined preventive program, it was possible 
to count the number of withdrawals of stu- 
dents from nursing training. Generally ac- 
ceptable indices of emotional distress are not 
always so readily available. And yet common 
sense and current societal preoccupations 
suggest that they do exist. Thus, for exam- 
ple, acts of violence, such as murder or 
assault, should justifiably be counted as in- 
stances of poor mental “health” (i.e., emo- 
tional ill-being) whether or not they occur 
in individuals of high or low soundness. Rates 
of occurrence of such behavior may also be 
taken as indications of the general mental 
“health” (well-being) of a population. Such 
indices may be of greater import in the 
evaluation of mental health programs than 
figures concerning cases treated in and out 
of the mental hospital. So it is important to 
study acute states of ill-being or crises on 
their own merits (irrespective of degree of 
more central personality dysfunction) in 
order to learn something about their etiology 
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in the individual and their frequency and 
effects in the population. 


EMOTIONAL STABILITY 


The distinction between general soundness 


and well-being, the latter viewed as response 
to environmental stress, is derived from the 
ecologic notion that health or illness are 
states of equilibria between an organism and 
its environment (Burnet, 1953). The ability 
of the individual to cope with stress deter- 
mines whether or not well-being is affected. 
While emotional well-being can be maintained 
under either ordinary or especially stressful 
circumstances, ill-being can result only from 
untoward reaction to stress. Stability thus 
becomes the ability of the individual to cope 
with specific environmental stresses while 
maintaining a state of emotional well-being. 
Stability levels of individuals can be com- 
pared inferentially by noting their behaviors 
under similar stressor conditions. Ultimately 
it may be possible to estimate stability levels 
by psychological laboratory tests, which have 
been found to correlate highly with adequacy 
of response to the specific stress or clas 
stresses in question. 

As Whitehorn (1956) emphasizes, indi- 
viduals differ markedly from one another in 
respect to situations they perceive as stress- 
ful. Given such variability, a phenomenologi- 
cal approach to the Concept of stress suggests 
itself. Nevertheless, it is believed preferable 
for present Purposes to consider stress in 
terms of forces impinging upon the indi- 
vidual, without regard to his reaction. Such 
a differentiation of stress from the organism’s 
reaction permits clear-cut identification and 
ultimate categorization of psychological stres- 
Sor agents. In preventive work the presence 
or absence of the stress in a population is 
worthy of study, apart from the reaction in 
any particular case, 

Borrowing from the field of epidemiology, 


we have attempted to distinguish between 
four types of emotional stability.* 


General Stability 


s of 


The “general stability level” refers to the 
capacity of the individual to maintain equi- 


, Jn epidemiology it is usual to think of immunity 
in four categories: inherited, general immunity; ac- 
quired, general immunity; inherited, specific im- 


munity; and acquired, specific immunity. 
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librium or well-being under conditions of 
stress without regard to the nature of the 
stress itself. A “specific stability area,” on 
the other hand, refers to the ability to with- 
stand a specific kind of emotional stress. Any 
specific area of stability, therefore, must be 
defined in terms of some specific stressor 
agent. Specific stability is discussed more 
fully below. 

Much work needs to be done in order to 
arrive at an adequate appraisal of the ingre- 
dients of the general stability level. In this 
regard, it should be worthwhile to examine 
the possible general “immunizing” effects of 
certain kinds of prolonged, insight-oriented 
psychotherapies. It may ultimately be dem- 
onstrated, for example, that classic psycho- 
analysis represents not only a therapy, aimed 
at relieving a specific state of emotional ill- 
being, or psychological exercise, which raises 
the general soundness level, but also a form 
of “immunization,” which helps the indi- 
vidual ward off a wide variety of potentially 
noxious stimuli by increasing general stability 
or supporting specific stability areas. 

Evidence has been advanced (Eysenck, 
1952), which suggests that recovery or im- 
provement rates are roughly 60% regardless 
either of the nature, depth, or length of the 
treatment or of whether it is carried out by 
psychoanalysts or family physicians (who 
may prescribe a few weeks in the general 
hospital for rest and recuperation). Such 
comparisons obviously suffer from the fact 
that uniform criteria of successful treatment 
do not exist. Nevertheless, once such criteria 
are available, it is not outside the realm of 
probability that so-called insight therapies 
will be found to resemble many other pallia- 
tives in the degree to which they restore 
emotional well-being, but will be proved sig- 
nificantly better able to render patients more 
generally stable under subsequent stress con- 
ditions, with the result that such patients are 
less apt to suffer recurring states of emotional 
ill-being. 

The distinction between general stability 
and soundness is fuzzy at best. It is believed 
that soundness has a moderately high positive 
correlation with general stability but that the 
relationship is not univocal. It is, assumed 
that high general stability can exist in an 
individual of only moderate soundness, es- 
pecially when the personality defenses of the 


individual are such as to insulate him from 
the full impact of stress. Such defenses may 
not always be considered to reflect a high 
soundness level though they may be very 
effective aids to general stability. They in- 
clude such reactions as repression, denial, 
selective inattention, and possibly even an 
ability under some circumstances to resort to 
the use of phantasy when reality becomes too 
bleak. Examples of apparently effective use 
of both denial and phantasy have been ob- 
served in some adult polio patients in respira- 
tors. These individuals appear to avoid the 
full impact of their predicament, whereas 
other patients not capable of so doing dis- 
play depression and other signs of being 
overwhelmed by the all-too-real catastrophe. 


Specific Stability 


Studies of populations undergoing marked 
shifts in sociocultural environments may 
shed increasing light on the relationship be- 
tween general and specific stability levels. It 
seems probable that individuals with high 
general stability will demonstrate areas of 
low specific stability when exposed to certain 
forms of stress. Careful review of soldier and 
civilian populations in wartime, of forced mi- 
grations of whole peoples, and of groups 
known to be either upwardly or downwardly 
mobile from a socioeconomic standpoint may 
suggest fruitful lines of future inquiry. 

In order to study areas of specific stability, 
it is necessary ultimately to arrive at some 
categorization of types of stress conditions 
to which large segments of the population are 
routinely exposed. Some suggestions are to 
be found in the literature. Birth trauma is 
given some emphasis by certain psychoana- 
lytic theorists. Perhaps here is the biologic 
prototype of the entry of the individual into 
a new, unfamiliar, confusing, and potentially 
harmful setting. Reactions to frustration or 
interference with goal-directed behavior have 
also been studied in the laboratory and find 
their counterparts in clinical patients, who 
become anxious or depressed when advance- 
ment up the economic or social ladder seems 
no longer possible. Lindemann’s (1944) ob- 
servations of reactions to bereavement have 
led to a formulation of emotional hazard 
which emphasizes sudden alterations or dis- 
ruptions in key role relationships. 
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Inherited Stability 


The distinction between inherited and ac- 
quired immunity—however useful for public 
health—may seem simply another way to 
raise the hackneyed controversy of nature vs. 
nurture, heredity vs. environment. It may be 
possible, nonetheless, to make some meaning- 
ful distinctions without promoting further 
useless controversy. At the outset, it may be 
helpful to remind ourselves that behavior 
does not exist in the absence either of the 
organism or its environment. 

There are those (e.g., Erikson, 1950) who 
suggest that children are born with tempera- 
mental response tendencies (heredity, if you 
will) which affect parental responses to them 
(ie., environment) in various ways, depend- 
ing in part upon what the mother herself 
brings to the relationship. If the mother’s 
initial attempts at mothering are discouraged 
by certain response tendencies in the child, 
permanent damage may be done to the affec- 
tional ties between them. A recent study by 
Klebanoff (1959) supports this viewpoint. 
He found that mothers of children with brain 
damage and of schizophrenic children re- 
sponded similarly to questions on a child 
rearing attitude scale and that both, in turn, 
showed significantly more pathological atti- 
tudes toward child rearing and the family 
than did mothers of normal children. 

If some children are born with response 
tendencies rendering them highly vulnerable 
to maternal rejection, they may be described 
as having inherited a kind of emotional in- 
stability and therefore as being susceptible 
to subsequent emotional maladjustment. Im- 
pairment of the mother-child relation as a 
result of the way in which the mother han- 
dles the child’s inadequacy will result in a 
child with an extremely low soundness level 
(Fries, 1944; Mittelman, 1954), Obviously 
such speculations regarding inherited stability 
and instability must be tested ultimately by 
the detailed observations of newborn children, 
such as those of Fries (1944) who correlated 
response tendencies of neonatal infants with 
the specific mothering patterns required by 
each type of child to insure healthy psycho- 
logical development. 


Acquired Stability 


In epidemiology the concept of acquired 
immunity implies that the organism has had 
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some experience with the stressor agent and, 
in the process, has developed successful (i.e., 
nonillness producing) ways of coping with it 
for some period of future time. During the 
period of immunity, the organism and the 
agent are able to coexist without undue harm 
to the former, or the invader is destroyed. 

In this area, as in so many instances where 
we are concerned with the relationship be- 
tween sickness and health, it is desirable to 
begin first with the observations of groups 
of people who have become sick under de- 
monstrable stress conditions, Having noted 
the reactions of those who become markedly 
ill, it should become possible to shift our 
attention to those whose emotional well-being 
is only slightly or temporarily impaired by 
similar stress. Finally, it may be possible to 
discover situations in which small amounts 
of stress render individuals increasingly able 
to cope with more massive doses of similar 
stresses in the future. The following state- 
ment by Frank (1957) from a national maga- 
zine expresses the belief that sibling rivalry, 
known in its virulent form to play a part in 
disturbing emotional equilibrium, may under 
certain conditions act so as to confer a kind 
of stability in a specific area: 


In families where each child feels he has re 
able access to his parents’ attention and affection, 
sibling rivalry is not only inevitable but desirable, 
It is inevitable because every child, at least initially, 
wants to be his parents? favorite. It is desirable 


because every child is bound t i 
o meet confi; ner 
or later, at school if n EN 


better place in which to 


ason- 


pi aisen for both parties. Sibling rivalry is one 
of the major ways by which children can develop 
a sense that the world is, within limits, a trust- 


worthy place in which they - : 
secure (p. 59), ey can feel reasonably 


This point of view must be examined and 
carefully tested within the framework of spe- 
cific responses to specific stress conditions. 
Otherwise, there is the Possibility that it will 
gain Popular Support prematurely and join 
the growing list of changing fashions in child 
mag The danger of such uncritical ac- 
ceptance is readily seen after consideration 
of the probability that, in some circum- 
stances, small stresses do not immunize, but 


rather cumulate and eventually overwhelm 
the individual. 


In the search for specific areas of acquired 
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stability, initial priority must go to the search 
for stressor agents, a search that is well 
underway. Ultimately, however, it becomes 
necessary to relate major stress areas, which 
have been shown to produce casualties, to 
prototypical situations in the early life de- 
velopment of the organism that occasion mild 
stress without disabling the child. For ex- 
ample, while all of us learn in a variety of 
ways a multiplicity of social skills, relatively 
little has appeared in the literature regarding 
such skills and how they are established.* 
In this search, it may be desirable to exam- 
ine a wide variety of disturbance of parent- 
child relationships noted in the child guidance 
literature because each may have its counter- 
parts in the repertoire of role interactions of 
most, if not all, parents and children. Such 
a search might have two desirable results: 
first, it might suggest conditions under which 
stressor agents exceed a safe tolerance level; 
second, it might point to conditions under 
which individuals are not exposed to sufficient 
amounts of stress, thus rendering them less 
stable in similar situations in later life when 
more enduring personality patterns have been 
developed. 


SUMMARY 


This discussion has put forth the sugges- 
tion that the search for definitions of mental 
health be abandoned, at least temporarily, 
in favor of a differentiated approach to the 
problem of human adjustments and malad- 
justments. It has been suggested that current 
concepts of mental health and illness are 
primarily concerned with the long standing 
emotional or psychological condition of the 
individual, for which the term “soundness’ 
is proposed. The immediate state of “well-” 
or “ill-being” of the individual at any par- 
ticular time is proposed as suitable for more 
intensive study, both because of its possible 
effect on the soundness level in some in- 
stances, and because it should be considered 
in and of itself as a period of acute illness 
or malfunctioning and worthy of note in 
population studies. The vulnerability of the 
individual to environmental stress has been 


5 One notable exception to this statement is Cam- 
eron’s (1947) work on the social origins of mental 
illnesses, which suggests that emotional breakdown 


is essentially founded upon inadequate or inappro- 
Priate social perceptions and skills. 
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discussed in terms of a possible “general sta- 
bility factor,” related to but not identical 
with soundness level, and of areas of “spe- 
cific stability” which offer a promising field 
for further study in terms of possible impli- 
cations for child rearing and educational 
practices. Perhaps by following this more 
differentiated approach to mental health, we 
may find that apparently conflicting doc- 
trines of today may be viewed as much less 
contradictory in a more adequate viewpoint 
of tomorrow. 


REFERENCES 


Burnet, F. M. The natural history of infectious 
disease. (2nd ed.) Cambridge: Cambridge Univer. 
Press, 1953. 

Cameron, N. The psychology of behavior disorders. 
Boston: Houghton Mifflin, 1947. 

Duur, L. Psychiatry and the community. Unpub- 
lished Manuscript, National Institute of Mental 
Health, undated. (Mimeo.) 

Erixson, E. H. Childhood and society. New York: 
Norton, 1950. 

Eysenck, H. J. The effects of psychotherapy: An 
evaluation. J. consult. Psychol., 1952, 16, 319-324. 

Franx, J. D. Are you a guilty parent? Harper’s, 
1957 (April), 214, 56-59. 

Fries, M. Psychosomatic relationships between 
mother and infant. Psychosom. Med., 1944, 6 
159-162. 

Janona, Marte. Toward a social psychology of 
mental health. In A. M. Rose (Ed.), Mental 
health and mental disorder. New York: Norton, 
1955. 

Janis, I. Air war and emotional stress. New York: 
McGraw-Hill, 1951. 

KLEBANOFF, L. B. Parental attitudes of mothers of 
schizophrenic, brain injured and retarded, and 
een children, Amer. J. Orthopsychiat., 1959, 

LEIGHTON, D. C. The distribution of psychiatric 
symptoms in a small town. Amer. J. Psychiat. 
1956, 112, 716-723. g 

LINDEMANN, E. Symptomatology and management 
of acute grief. Amer. J. Psychiat, 1944, 101 
141-148. ? 

Liypemann, E. Mental health—fundamental to a 
dynamic epidemiology of health. In I. Galdston 
(Ed.), The epidemiology of health. N k 
Acad. Med., 1953. EP Kae 

E B. Motility in infants, children, and 
adults. The psychoanalytic study of th i 
1954, 9, 142-177. iii 

Scuutz, W. FIRO: A three-dimensional theory of 
interpersonal behavior. New York: Rinehart, 1958 

5 k 

Suosen, E. Toward a concept of normal person- 
ality. Amer. Psychologist, 1957, 12, 183-189. 

WHITEHORN, J. C. Stress and emotional health 
Amer. J. Psychiat., 1956, 112, 773-781. ` 


3 


(Received July 7, 1959) 


a 
t 


Consulting Psychology 
HOG Wak 24, No. 4, 294-298 


EGO STRENGTH AND CONFLICT DISCRIMINATION: 


AN EXPERIMENTAL CONSTRUCT VALIDATION 
OF THE EGO STRENGTH SCALE? 


MAURICE KORMAN 
University of Texas Southwestern Medical School 


Barron’s Es scale was validated against 
the predictive criterion of successful outcome 
in psychotherapy (Barron, 1953). Because of 
the method of its construction, its content 
and correlates, it is suggested that it meas- 
ures ego strength, a presumably latent trait 
often obscured by the vividness of psycho- 
pathology. Research with the Es scale has 
tested its criterion validity by examining its 
predictive and postdictive power (Wirt, 1955, 
1956) and investigated its construct validity 
principally by computing its correlation with 
indices based on other psychometric instru- 
ments such as the Rorschach and the Bender- 
Gestalt (Tamkin, 1957; Tamkin & Klett, 
1957). This type of “psychometric” construct 
validation presents a number of difficulties, 
First, such tests frequently can marshal but 
little empirical or theoretical evidence for 
being acceptable operational specifications of 
the construct in question, a drawback not 
remedied by a plethora of low but positive 
correlations (see Jenkins & Lykken, 1957). 
Secondly, as Campbell and Tyler (1957) 
have pointed out, intertest correlations may 
represent common but construct-irrelevant 
features (e.g., response sets). They add: 
“Construct validation is epitomized by the 
correlation between two or more specifica- 
tions of a construct maximally different in 
apparatus or method” (p. 91). In Cronbach’s 
terms (1957), the present study attempts to 
incorporate two basically divergent modes of 
approach, the correlational and the experi- 
mental. 


1 This research was carried out at the Psycho- 
pathic Hospital of the State University of Towa. 
Carl Bickert’s help in running subjects is gratefully 
acknowledged. 

A version of this paper was given at the Annual 
Meeting of the APA, Cincinnati, 1959. 


Because the Es scale is empirically defined 
it is especially desirable for other specifica- 
tions of ego strength to be linked to a prom- 
ising theoretical network. A congruence be- 
tween such measures would then go beyond 
indicating that certain face valid experimen- 
tal procedures relate to the construct inter- 
pretation of the test; it would serve as a 
bridge relating the psychometric device to 
an existing theoretical network. Dollard and 
Miller’s explication of ego strength in terms 
of higher mental processes (1950) was taken 
as the starting point. The components of 
higher mental processes are many; princi- 
pally, they include the ability to appropri- 
ately mediate discrimination, generalization, 
transfer, learned drives, and reinforcement. 
In this study the capacity of an individual 
to resolve a discrimination conflict was chosen 
as the experimental index of ego strength. 
The reasons for this choice were twofold: 
first, there is an extensive literature beginning 
with Pavlov (1927) on the`conflictful charac- 
teristics of difficult discriminations; secondly, 
since the increased ability to solve conflicts 
and to make appropriate discriminations are 
both among the commonly accepted goals of 
psychotherapy, a more felicitous parallel be- 
tween the experimental procedures and the 
Es scale is achieved, In addition, the parame- 
ters of discrimination problems are sensitive 
to experimental manipulation; one can read- 
ily Increase the conflict loading of a discrimi- 
nation task by placing the subject (5) 
under instruction to be as fast as he can 
while at the same time informing him of the 
irrevocable nature of his decision once it 
is made, 
The hypothesis was therefore formulated 


that a high Es group should be able to resolve 
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discrimination conflict more speedily than a 
low Es group. The differences between the 
Es groups should be greatest for the condi- 
tion of extreme discrimination difficulty (the 
most conflict provoking) and decrease as the 
discrimination becomes easier. In the case of 
easiest discrimination some general disruption 
of performance might still be expected be- 
cause of the generalization on conflict 
(Brown, 1942), although the prediction here 
is less clear. 


PROCEDURES 


A discrimination situation involving the method 
of constant stimuli was used, patterned after Fest- 
inger (1943) and Block and Petersen (1955). The Ss 
sat in a darkened room approximately 444 feet away 
from a black box, behind which were two lights 
placed about 10 inches apart and shining through 
holes 9 of an inch. The top light (standard stimu- 
lus) was a 714-watt opaque bulb, while the variable 
light was a regular 40-watt white bulb. The voltage 
applied through the variable light at 7 different rheo- 
stat positions was 60, 70, 78, 80, 82, 90, and 100 
volts, respectively. In addition the voltages were 
chosen in such a way as to make the two lights 
nearly equal in intensity when 80 volts were applied 
at the variable light. The S sat behind a table with 
his preferred hand on a toggle. He was instructed to 
make a judgment as to whether the variable light 
was brighter or dimmer than the standard light and 
to indicate his choice by pushing the toggle in the 
appropriate direction. In order to increase the diffi- 
culty of the task, a .5-sec. interval occurred between 
the standard and the variable light, at the end of 
which a clock was activated which ran until the S 
pushed the toggle to the left or to the right. He was 
further told: “I’d like you to make up your mind 
as soon as it’s possible for you but keep in mind 
that once you’ve pushed the lever up or down you 
can’t change your mind again.” 

Each of the seven intensities of the variable stimu- 
lus were presented a total of 15 times by using three 
successive random orderings of the stimuli and by 
presenting this complete set of 21 trials five times. 


Subjects 


Forty-seven recently admitted psychiatric in- 
patients were used as Ss. Excluded were patients 
with a diagnosis or suspicion of organic brain dis- 
ease, patients undergoing some type of radical thera- 
peutic interference such as electroshock or insulin, 
and patients too uncooperative, agitated, or with- 
drawn to participate. The Ss. (31 women and 16 
men) were administered the MMPI and their Es 
scores obtained through the Hathaway and Briggs 
norms (1957). The entire range of Es scores rather 
than the extremes was used in this study so as to 
make it a more severe test of the scale’s power. 
Because of the sex-linked speed elements of the 
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task, the male and female Ss were considered 
separately. A cutting score (T score) of 41, the 
median, was used to separate high and low female 
Es groups. Since the Es scores for the smaller sample 
of males appeared to be markedly skewed, the same 
cutting score was adopted for them.? 


RESULTS 


Prior to any analysis of the data, rules 

were laid down for the identification of the 
easy,” “intermediate,” and “difficult” con- 

flict situations based on Ss’ individual per- 
formances. The criterion used involved each 
S’s consistency in labeling any one of the 
seven intensities of the variable stimulus 
brighter (or dimmer) than the standard 
stimulus. Those intensities of the variable 
stimulus which were most consistently seen 
as brighter (or dimmer) and which were 
furthest away from the point of objective 
equality (the middle intensity) were chosen 
as representing the least difficult discrimina- 
tion for that S. Likewise, the most difficult 
discriminations were those which were most 
inconsistently seen as brighter (or dimmer) 
and which were nearest to the point of ob- 
jective equality. The third category, that of 
intermediate difficulty, was defined by choos- 
ing those intensity positions which fell be- 
tween the easy and difficult discriminations 
with reference to the criteria outlined above. 
Each of these three difficulty levels was repre- 
sented by two intensities, 15 judgments being 
given to each. The mean of the 30 prejudg- 
ment latencies at each difficulty level was the 
basic data of this study. That no distortion 
of the data occurred through these procedures 
is shown by the fact that extreme positions 
were chosen as easy discrimination cells while 
the center positions tended to become diffi- 
cult discrimination cells; the high and low 
Es groups did not differ in this respect. 

As can be seen from Table 1 the high Es 
group holds a few more psychotics and is 
slightly older than the low Es group; these 
differences are far from significant (for diag- 
nosis, chi square = .18 wit a 
t= .83 with 43 df). nly Tae ee 


5 2 Due to circumstances beyond control, fewer sub- 
jects than desired were run. The statistical analyses 
were performed on females and males separatel; 
the latter group serving essentially as a small see 
cation sample. 
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TABLE 1 
CHARACTERISTICS OF HIGH AND 
Low Es Groups 
Variable High Es Low Es 
Diagnosis* 
Psychotics 40% 33% 
Neurotics 28% 33% 
Personality Disorders 32% 33% 
Age 
Mean 31.85 28.32 
SD 13.90 13.39 
Es 
Mean 52.25 31.10 
SD 7.75 5.76 


. a Based on N =43; diagnosis on four patients was deferred. 


The mean latency scores of the high and 
low Es groups on easy, intermediate, and dif- 
ficult discriminations are shown in Fig. 1. 
The low Es group has a longer latency 
throughout, although the difference becomes 
more marked at the intermediate and difficult 
levels. Figure 1 combines male and female 
data because of the remarkable similarity of 
their separate curves within each Es group. 

In order to test the significance of the 
differences between the curves of Fig. 1, ex- 
clusive of the easy discrimination category, 
the log transformed latency scores were sub- 
jected to separate analyses of variance for 
males and females after it had been con- 
cluded, by means of the Bartlett test, that 
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LOG LATENCY X 10 


Low Es 


Easy Inter- Difficult 
mediote 


DISCRIMINATION DIFFICULTY 


Fic. 1. Mean prejudgment latencies of Es groups for 
increasing levels of discrimination difficulty. 


the assumption of homogeneity of variance 
was not being violated. In order to compen- 
sate for the correlation introduced by re- 
peated measurements on the same Ss, the 
analysis suggested for such cases by Edwards 
(1951) was used. Table 2 reports these re- 
sults: in the case of both males and females 
there is a significant main effect between Es 
groups (p= <.03 for the females; p= 
<.06 for the males). The low ego strength 
patient takes longer in resolving the discrimi- 
nation conflict than does the high ego 


TABLE 2 


ANALYSES OF VARIANCE FoR REPEATED MEASURES oF LoG TRANSFORMED L. 
For Hicu anp Low Es Groups 


ATENCY SCORES 


Females ween 
Source df Mean Square F df Mean Square F 

Between groups 1 5618 4.94** 

Between Ss same 94 1 .3298 4.34* 
Group 29 .1138 i — 

Between Trials 2 .0674 5.14** 

Trials X Groups 2 0075 : ; ae 

Pooled Ss X Trials 58 0131 a 

* p =.06. 


** p=.03. 
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TABLE 3 


RESULTS OF T TESTS BETWEEN HicH anD Low 
Es Groups at THREE LEVELS OF 
DISCRIMINATION DIFFICULTY 


t 


a 


Discrimination Females Males 
Difficulty (29 df) (14 df) 
Easy 1.05 1.78* 
Intermediate 2.13% 1.87* 
Difficult 210% 2.07* 
* p <10 
er 


strength patient. The differences between the 
two levels of discrimination difficulty were 
significant at the .03 level for the females 
but did not differ significantly for the males. 

As Fig. 1 suggests, there is a disproportion- 
ate increase in latency for the low Es group 
as the discrimination difficulty moves from 
easy to intermediate and difficult as compared 
to the nearly linear relationship exhibited by 
the high Es group; the differences between 
easy and intermediate discriminations are 
significantly different for the two groups 
p < .02) as are the differences between easy 
and difficult discriminations (p < .03). The 
groups do not differ, however, in their incre- 
ment from intermediate to difficult discrimi- 
nations. These findings generally received 
support from a comparison of the Es groups’ 
performance on the three levels of discrimi- 
nation difficulty. Table 3 reports the obtained 
#’s for males and females separately. In the 
case of the females, while the high and low 
Es groups do not differ on the easy discrimi- 
nation, they differ significantly on both the 
intermediate and difficult levels of discrimina- 
tion difficulty (p < .05). This trend is ob- 
scured somewhat in the case of the males 
because of the smaller N; the male high and 
low Es groups differ at the .10 level only with 
reference to all three levels of discrimination 
difficulty. 

DISCUSSION 


This study corroborates the findings of 
Tamkin (1957) and Tamkin and Klett 
(1957) with regard to the Es scale’s inability 
to distinguish degrees of psychopathology as 
discriminated by psychiatric diagnosis. Con- 
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trary, however, to Tamkin and Klett’s view 
that this is a “serious challenge to its valid- 
ity,” the present writer would suggest that 
this lack of association with overt psycho- 
pathology is a predictable and necessary by- 
product of the Es scale’s construction which 
ignored the initial psychiatric status of the 
patients in favor of their treatment outcome. 
The difficulty, of course, lies in the surplus 
meanings of the term “ego strength” which 
lead to paradoxical conclusions. 

The hypotheses concerning the relationship 
of the Es scale to discrimination conflict were 
generally borne out. In two separate samples, 
patients with low Es scores (a measure of 
ego strength, psychometrically defined) took 
longer than high Es patients to resolve a dis- 
crimination conflict (a measure of ego 
strength, experimentally defined). The la- 
tency between stimulus and response had 
been chosen as the dependent variable in this 
study because of its sensitivity to conflict in- 
structions. The question arises as to whether 
the low Es group also made more errors in 
the successive discrimination judgments than 
the high Es group. Comparing the two groups 
on their total number of errors for the six 
intensities of the variable stimulus (leaving 
out the objectively “equal” center set) yields 
a nonsignificant ¢ of .14 with 45 df. Since 
there is no a priori reason for suspecting per- 
sonality variables to be related to the judg- 
ments in a psychophysical exercise, the null 
hypothesis can reasonably be accepted. 

The Es scale’s empirical derivation in the 
context of psychotherapeutic outcome had 
previously suggested a tentative identification 
of its underlying construct with ego strength, 
independently of any consideration of ego 
strength as one of the higher mental processes 
in the Dollard and Miller sense. The results 
of this study suggest that the “theory” under- 
lying the Es scale is one which appropriately 
includes the construct ego strength as it is de- 
fined in the Dollard and Miller framework. 
Reciprocally, it is also possible to look upon 
these data as lending validity to an explica- 
tion of ego strength patterned after Dollard 
and Miller and couched in terms of appropri- 
ateness of discrimination and conflict resolu- 
tion. This can be maintained because part 
of the variance in such behaviors can be ac- 
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counted for by the variance of Es scores 
which are in turn associated with personality 
characteristics that make for a better “ther- 
apy bet.” 

With reference to the postulated relation- 
ship between Es scores and levels of discrimi- 
nation difficulty, the results are likewise in a 
predicted direction although not in all re- 
spects. As the discrimination becomes more 
difficult, the low Es S’s performance becomes 
disproportionately worse, while the high Es 
S’s latency increases in a more stable, linear 
fashion. However, as Fig. 1 shows, the low 
Es S’s latency on the most difficult discrimi- 
nations is not very much greater than his 
latency for discriminations of intermediate 
difficulty. A number of possibilities present 
themselves. Low Es Ss possibly make only the 
distinction between quite easy and all other 
discriminations; this would suggest another 
type of inappropriate generalization on their 
part. It is also not improbable that their 
latency on discriminations of intermediate 
difficulties is close to the asymptote they can 
reach under the particular instructions given, 
i.e., to be both speedy and correct, and that 
a further increase in the difficulty of the dis- 
crimination has little effect. 


SUMMARY 


As a test of the construct validity of Bar- 
ron’s Es scale, 47 psychiatric inpatients were 
placed in a discrimination situation based on 
the method of constant stimuli. The hypothe- 
ses were (a) that a high Es group should be 
able to resolve a discrimination conflict more 
speedily than a low Es group and (b) that 
group differences should increase as the diffi- 
culty of the discrimination becomes greater. 

The hypotheses were generally supported. 
The construct which the Es scale measures 
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is related to ego strength as defined in the 
Dollard and Miller framework in terms of 
capacity for appropriate discrimination and 
conflict resolution. 
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THE DURATION OF THE ARCHIMEDES SPIRAL 
AFTERIMAGE IN THE DIAGNOSIS OF 
BRAIN DAMAGE +' 


JEROME D. SCHEIN 
Florida State University 


Since Price and Deabler (1955) introduced 
the Archimedes spiral afterimage as a tech- 
nique for differentiating brain damaged pa- 
tients from other neuropsychiatric patients, 
there has been an increasing accumulation 
of evidence suggesting that the diagnostic 
efficiency of this test is far below that which 
clinicians would find useful as a routine 
screening device (Berger, Everson, Rutledge, 
& Koskoff, 1958; Gilberstadt, Schein, & 
Rosen, 1958; Goldberg & Smith, 1958; 
Philbrick, 1959). However, these investiga- 
tors have relied on a pass-fail criterion for 
each trial, requiring only that the subject 
report whether or not he sees the negative 
afterimage (henceforth referred to as AIR 
for “afterimage report”). On the other hand, 
attempts to explain the failure of brain in- 
jured patients to report this phenomenon 
have implied that damage to the cortex re- 
duces the duration of the afterimage (hence- 
forth referred to as AID for “afterimage 
duration”) with complete failure to report 
the afterimage representing an endpoint on 
the duration continuum. 

Three theories have recently been proposed 
to account for the tendency of brain damaged 
individuals to report the negative afterimage 
induced by visual stimulation less frequently 
than patients with intact cerebrums. Saucer 
(1953) and Saucer and Deabler (1956) con- 
sider the perception of the afterimage to be 


1This research is a portion of a PhD thesis sub- 
mitted to the Department of Psychology, University 
of Minnesota, The author is indebted to William 
Schofield, Ephraim Rosen, Paul E. Meehl, and David 
T. Lykken of his dissertation committee for guidance 
and criticism. Appreciation is also due the staff of 
the Minneapolis VA Hospital for their unfailing co- 
Operation and encouragement. 


a function of a global effort of the entire 
cortex. Applying a principle of mass action, 
they conclude that brain injury would fore- 
shorten AID. Klein and Krech (1952) use 
the Pavlovian principle of inhibition to sup- 
port their position that brain injury reduces 
AID, though their own research involved 
tactual aftereffects. Shapiro (1954), dealing 
directly with the perception of apparent 
movement, also applies the concept of inhibi- 
tion in predicting that brain injury will fore- 
shorten AID. 

Though the structure of their theories dif- 
fer somewhat, these theorists are unanimous 
in predicting the effect of brain damage on 
AID. -They provide the basis for avoiding the 
embarrassment of an all-or-none explanation 
of the earlier studies; i.e., one can account 
for brain damaged individuals who do give 
an AIR by assuming that they have a reduced 
AID, albeit not reduced below their percep- 
tual limen. Thus, because duration is reduced, 
brain injured individuals should give fewer 
AIRs. 

The empirical evidence on spiral AID js 
mixed. Eysenck, Holland, and Trouton 
(1957) asserted that some depressant drugs 
have effects isomorphic with brain injury, in 
that both increase inhibitory potentials in 
the brain. When they administered a placebo, 
a stimulant, and a depressant drug to 6 nor- 
mal subjects (Ss), they found that the de- 
pressant drug (sodium amylobarbitone) sig- 
nificantly reduced AID, though the stimulant 
did not increase it. They felt that this result 
supported those theories of brain functioning 
which make use of the concept of cortical 
inhibition. 

Three studies of brain injured patients 
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have included some observations on AID. 
Gallese (1956) noted that the false negatives 
(brain damaged patients who did report the 
afterimage) seemed to experience a less 
marked and apparently shorter AID than 
control patients. Page, Rakita, Kaplan, and 
Smith (1957) stated that they found no dif- 
ferences in AID between brain damaged and 
schizophrenic patients for two of the trials 
on which it was measured. In complete con- 
tradiction to these two studies, Spivack and 
Levine (1957) found that their brain dam- 
aged subjects reported longer AIDs than did 
a group of neurotic controls. 

It is apparent that, while the theorists are 
in complete agreement, the small amount of 
evidence available is not. It was the purpose 
of this study to gather further data to help 
clarify the situation. Furthermore, if the 
theorists cited were correct, measuring the 
duration of the afterimage should increase the 
diagnostic efficiency of the test by identifying 
those patients whose brain injury does not 
reduce the afterimage to zero duration. These 
false negatives would be identifiable by their 
markedly reduced AID, even though they 
could report an afterimage. 


METHOD 
Subjects 


An ongoing sample of all male veterans admitted 
to the Neurology and Psychiatry services of the 
Minneapolis Veterans Administration Hospital were 
tested within five days of their admission. This 
sampling procedure was adopted to control for 
chronicity of illness, eliminate patients undergoing 
shock therapy, and reduce the number of patients 
on a drug regimen. This method of selecting the 
sample also had the advantage of reducing errors 
which might have arisen had the experimenter had 
knowledge of the patients diagnosis at the time 
of testing. 

Of 129 cases admitted to the Neurology service 
31 could not be tested: 3 expired, 17 were confined 
to their beds, 8 were too confused, 3 were discharged 
before testing. Seventeen additional cases were lost 
due to apparatus failure which occurred during the 
time the sample was being drawn. Of 50 admissions 
to the Psychiatry service, 5 could not be tested: 
4 were confined to their beds, and 1 was discharged 
before testing. 


Diagnosis 


All diagnoses were final diagnoses arrived at by 
staff action. All cases having a diagnosis indicating 
central nervous system damage rostral to the fora- 


Jerome D. Schein 


men magnum were further classified by a boarded 
neurologist * who determined whether or not the 
injury included damage to the cerebral cortex; if 
it did not, case was classified under subcortical brain 
damage. The psychiatric patients were divided into 
those having a psychotic diagnosis and those having 
a nonpsychotic diagnosis. Three cases who were 
tested left the hospital before diagnostic work-up 
could be completed. They are listed as “equivocal” 
in the section to follow. The remaining patients, 
being neither brain damaged nor psychiatrically ill, 
constituted the normal hospitalized control sample. 


Drugs 


Each patient’s medication at time of testing was 
noted from the order sheet in his hospital chart. 
Two physicians and a pharmacologist 3 then sorted 
the various drugs administered to patients in this 
study into four categories in accordance with their 
effect on the central nervous system within the range 
of therapeutic dosage. Drug Group 1 consisted of 
drugs having no effect on the central nervous system. 
Drug Group 2 contained all central nervous sys- 
tem depressants. Drug Group 3 included all central 
nervous system stimulants. Drug Group 4 consisted 
of any combination of drugs in Categories 2 or 3 
which were being simultaneously administered to a 
patient. 


Apparatus 


An Archimedes spiral of 920° or 2% turns about 
its center was constructed by painting a black spiral 
on a white disc 6 inches in diameter. The black and 
white lines were each 1% inch in width. The wheel 
was affixed to a reversible motor with a variable 
speed control which was set at 100 rpm. 


Procedure 


The S was seated 8 feet from the spiral. The 
illumination at the surface of the spiral was 16 foot 
candles, measured by a GE Exposuremeter, Model 
8DW. 

Two trials, one clockwise and one counterclock- 
wise, were administered in accordance with Price 
and Deabler’s (1955) instructions. Scoring was done 
in accordance with the rules established by Gilber- 
stadt, Schein, and Rosen (1958). If the patient 
failed on one or both of these trials to report the 
negative afterimage, he was given two additional 
trials. At the end of stimulation on these trials he 
was told to look at an orange wooden block placed 
next to the apparatus and to describe what he saw- 


If he then reported a negative afterimage correctly 
on two successive trials (ie. 


the block appeared 
to move, grow, or change), he was administered the 


2? Grateful 
Miller, consu 
tients. 

3 Grateful acknowledgement is made to Henry A 
Johnson, Fred Miller, and A. Kleven for their assist- 
ance in classifying the drugs. 


acknowledgement is made to Zondal 
Iting neurologist, for sorting these Pa 
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trials measuring duration. If he did not, he was 
given two further trials using a black circle drawn 
on white cardboard as the projection device. A pa- 
tient who failed to report a negative afterimage on 
two successive trials of the six was considered to be 
diagnosed “brain damaged” by the test and no fur- 
ther testing was done. ú 

As soon as two consecutive AIRs were given, the 
patient was instructed as follows: 


What you just saw is called an afterimage; i.e., 
the line (block, circle) seemed to be moving even 
though you knew it wasn’t. People with normal 
vision see these afterimages, so,in this sense your 
eyes are normal. Now I want to know how long 


Direction Cc cc 
Stimulation Time 15 sec. 30 sec. 
Direction cc G 


Stimulation Time 15 sec. 30 sec. 
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the afterimage lasts. To find this out, I will run 
the wheel several times just as before. But this 
time when it stops I want you to look at this 
block (circle). If you notice anything change, tell 
me when the change stops. If it doesn’t change 
in any way, tell me that, too. I have this very 
accurate stop watch here, so I can tell just how 
long your afterimage lasts. Any questions? All 
right now, keep your eye on the wheel until I 
turn it off. Then immediately look at the block 
(circle). As soon as the block (circle) comes to 
a complete stop, you say “Now.” 


Eight trials were administered in the following 
order: 


C ce Cc cc 
30 sec. 15 sec. 15 sec. 30 sec. 


(C = clockwise; CC = counterclockwise) 


The variations in time and direction were intro- 
duced to discourage a fixed set to say “Now” at 
the same time following each stimulation, regardless 
of what may have been perceived. 

At the conclusion of the fourth trial patients 
whose first afterimages had been passed off on the 
block were instructed to look at the circle following 
stimulation and vice versa. Whether a patient was 
exposed to the block or the circle on his first four 
trials was determined by a letter which had been 
written alternately on each scoring sheet prior to 
testing. 

Two test objects were used (a) to provide variety 
in the situation to help keep the patient’s attention 
and (b) to test the hypothesis that brain damaged 
individuals would have longer AIDs for the three- 
dimensional block than for the two-dimensional 
circle. The latter notion was suggested by the fact 
that brain damage is thought to reduce the indi- 
vidual’s ability to deal with the world in an “ab- 
stract” fashion, thus the three-dimensional block 
might be more logically capable of motion than the 
two-dimensional circle. 2 

Duration was measured by allowing a split- 
hundredth watch to run continuously from the start 
of the trial until the patient indicated the afterimage 
had stopped. The AID was determined by subtract- 
ing the stimulation time from the total time elapsed. 
This method reduced watch-reading and “snap- 
back” errors. 

The interval between trials was maintained at 15 
sec. from the time the patient indicated the after- 
image had ceased. On the AIR trials, intertrial time 


was 30 sec. 

Patients who faile 
image or who reporte 
8 trials measuring A 


d entirely to report an after- 
d less than 5 AIDs during the 
ID were not included in the 
calculations of the durations, on the grounds that 
their perception was So incomplete (or their ability 
to report so poor) that the time values they gave 


were of little value. The prior decision had been 
made to classify these patients as having been diag- 
nosed brain-damaged by the test. 


RESULTS 


The distribution of total AID scores (the 
sum of the durations reported for the eight 
timed trials) is shown in Fig. 1. Only those 
patients who reported five or more AIDs for 
the eight timed trials were included in the 
analysis. The number of cases who were ex- 
cluded for this reason are listed in Table 1 
under the heading “Less than Five Reports.” 
Only 7 of the 123 cases failed to give an 
AIR in two consecutive pretrials, and only 3 
of the remaining 116 cases did not report 
negative afterimages for at least five of the 
eight timed trials. The 3 equivocal (undiag- 
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Fic. 1. Distribution of total AID (8 trials) 
by diagnostic groups. 
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TABLE 1 
MEAN AGE, INTELLIGENCE, AND Totat AID (rN MINTUES) For ALL DracxosEp CASES 


Less than 
Diagnosis N Age IQ AID Five Reports 

Brain damaged 53 42.78 106.36 9911 ; 8 

Cortical 45 43.15 105.33 9918 6 

Subcortical 8 40.33 113.00 9867 2 
Psychiatric 46 37.85 110.64 9758 1 

Neurotic 34 37.06 110.76 -9309 1 

Psychotic 12 40.00 110.33 1.0992 0 
Hospitalized Normals 24 50.35 106.78 9152 1 
Total 123 42.35 108.15 


-9696 10 


nosed) cases are not included in the following 
analyses since they could not contribute any 
information to the diagnostic ability of the 
spiral test. 

Table 1 summarizes the mean total AIDs, 
ages, and IQs for each diagnostic category. 
An analysis of variance of IQs indicated that 
the groups did not differ significantly on this 
variable. The product-moment -correlation 
between total AID and IQ was 034, which 
was not significantly different from zero. The 
analysis of variance of age, however, did show 
that the diagnostic groups differed signifi- 
cantly ( < .01) on this variable. The prod- 
uct-moment correlation between total AID 
and age was found to be — .092 (p> .05), 
thereby making a covariance adjustment for 
age unnecessary. 

The analysis of variance of differences in 
total AID score by diagnostic groups con- 
firmed what was apparent from inspection of 
Fig. 1. The three diagnostic categories do 
not differ significantly in the reported dura- 
tion of the afterimage. 


An analysis of variance in which the groups 


ant difference between 


ce only Drug Groups 1 
and 2 were represented in each of the diag- 


nostic categories, only these two groups were 
included in the analysis. None of the sources 
of variation contributed a mean square 
greater than that for individuals (within 


variance) which served as the error term 
(McNemar, 1955, pp. 304-306). 

The lack of differences between diagnostic 
groups on total AID as a function of drugs 
might have resulted from too gross a grouping 
of the various depressant medications, Drug 
Group 2 was, therefore, further subdivided 
into the more potent drugs (sparine, thora- 
zine, trilafon, vesprin, and compazine) and 
the less potent drug, meprobamate. Only the 
psychiatric group was included in this analy- 
sis, because there was no certain way to 
assess the effects of the anticonvulsant medi- 
cation which provided the main group of 
drugs prescribed for the brain damaged pa- 
tients. The latter patients’ AID scores were 
separately analyzed. 

The analysis of variance of AID scores for 
the psychiatric patients, comparing those who 
received central nervous system depressants 
with those receiving meprobamate and those 
receiving no centrally acting drug, yielded a 
nonsignificant F. The AID scores for the 28 
brain damaged patients diagnosed as having 
seizures were compared on the basis of 
whether or not they were on anticonvulsant 
medication at the time of testing. Again, the 
analysis of variance failed to indicate a sig- 


nificant difference attributable to this factor. 


The diagnostic groups had differed on yet 
another variable, order of presentation of the 
test object; i.e., whether the block or circle 
Was used as a test object for the first four 
duration trials, Table 2 presents the results 
of the analysis of variance of order of presen- 


5 
a 


De ae 


w 


SEE > -E 


Duration oj Afterimage and Brain Damage 


tation by diagnostic groups. Again, diagnosis 
Was not significant, while order of presenta- 
tion of the test object did significantly affect 
AID. The circle,tended to be associated with 
longer duration times than the block for all 
diagnostic groups. For those Ss to whom the 
circle was presented first the mean ratio of 
their first four to their last four duration 
times was .76; for those receiving the oppo- 
site presentation the mean ratio was .96. 
Therefore, when the circle was used as a test 
object on the last four trials, there was a 
mean decrement of only 4% in AID reported; 
while, when the block was used on the last 
four trials, the mean decrement was 24%. 

The possibility remained that order of 
presentation of the test objects might have 
affected the total AID; and, since the num- 
ber in each diagnostic category receiving the 
two orders of presentation were not equally 
distributed, this might have contributed to 
the failure of the AID to distinguish between 
the groups. However, an analysis of variance 
showed that the total AID did not differ 
significantly between those presented with the 
block for the first four trials as against those 
presented with the circle for the first four 
trials. 

Another factor which might have been of 
value in distinguishing between the diagnostic 
groups was stimulation time. A briefer stimu- 
lation time might reduce AID more for brain 
damaged than for other patients. The analysis 
of variance for each of the stimulation times 
showed that neither the 15-sec. nor 30-sec. 
period was successful in significantly differ- 
entiating the diagnostic groups. $ 

Stimulation time itself, however, was sig- 
nificantly related to the AID. The mean total 


TABLE 2 


ANALYSIS OF VARIANCE OF EFFECT OF ORDER 
or TEST PRESENTATION BY DIAGNOSTIC 
Grours on Torat AID 


Source df F 
Order 1 9.9396* 
Diagnosis 4 5202 
Interaction 4 .0896 
Individuals 101 (1,096.77)* 


* Significant at less than 01 level, 


^ Error variance, 
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TABLE 3 
DISTRIBUTION OF NUMBER OF FAILURES 
TO REPORT AFTERIMAGES 

Frequency Brain Damaged Psychiatric Normal 
of Failure (N=53) (V=46) (N=24) 

6+ 5 1 $ 

5 0 0 0 

4 3 0 0 

3 1 1 0 

2 1 1 0 

1 10 1 2 

0 33 42 21 


AID following 15-sec. stimulation was .4306 
min. and following 30-sec. stimulation was 
-5388 min. The critical ratio of this difference 
was significant at less than the .001 level. 
Turning from the analysis of AID, the 
number of zero responses was examined for 
possible diagnostic significance. The distribu- 
tion of number of times that patients reported 
no afterimage is given in Table 3. All cases 
except the three having no diagnosis are listed. 
The category “6+” contains those patients 
who either reported no afterimage for six or 
more trials determining the duration of the 
afterimage or who did not report two con- 
secutive afterimages in the pretrials. When 
the number of zero reports was dichotomized 
between none and one or more, a chi square 
of 13.69 was obtained (p < .01, df = 2). In 
percentage terms, 37.7% of the brain dam- 
aged cases reported one or more failures to 
perceive the afterimage on either the initial 
or duration trials. This figure compares with 
8.7% of the psychiatric group and 12.5% 
of the hospitalized normals who failed to 
report an afterimage one or more times. 


Discussion 


The results of this investigation fail to 
support any of the theories which reason that 
brain damage causes a decrease in cortical 
efficiency or an increase in cortical inhibition 
—either of which would result in the reduced 
duration of the afterimage. The failure of 
the results to satisfy the predictions of such 
theories cannot be laid to differences between 
the criterion groups in age, intelligence, drug 
status, or gross differences in diagnosis (i.e., 
psychosis vs. neurosis, cortical vs. subcortical 
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brain damage). Neither age nor IQ correlated 
significantly with AID, nor did depressant 
drugs have a different effect on AID than 
no drug when patients with similar diagnoses 
were compared. 

Eysenck’s et al. (1957) finding that de- 
pressant drugs would foreshorten the AID 
was not confirmed by this study. However, 
their experimental Ss were without any 
known pathology, and the drug’s effect on 
the S’s behavior was measured shortly after 
a single large dose had been administered. 
The patients in this investigation were se- 
lected because they were ill in one way or 
another, and those who received depressant 
drugs were given them in doses spaced over 
a period of time, with the elapsed time be- 
tween drug ingestion and test administration 
being uncontrolled. In addition, the depres- 
sant drug used by Eysenck et al. (1957) was 
a barbiturate, whereas in this investigation 
almost all of the patients in the depressant 
category were given ataractic or anticonvul- 
sant medication. That these factors are im- 
portant in assessing the effects of drugs is 
attested to by Lehmann and Csank (1957) 
who write: 


It is probable that the same drugs would produce 
differing results after the subjects had taken them 
regularly for a time because of tolerance or sensiti- 
zation . . . . It is also possible and even probable 
that severely disturbed patients will show reactions 
that differ considerably from the pattern observed 
in healthy control subjects (p. 231). 


According to Shapiro’s (1954) theory of 
cortical inhibition, the differences between the 
diagnostic groups should have been most 
marked following the longer stimulation pe- 
riod, since cortical inhibition would be 
aroused more quickly in the brain injured, 
depriving them of the increase in AID which 
would follow longer stimulation. Instead, it 
was found that differences in diagnostic status 
did not contribute to the significant difference 
in AID found between the 15-sec. and 30-sec. 
stimulation periods. The fact that amount of 
stimulation did affect ATD, but not differently 
for the diagnostic groups, leads to a further 
question as to the adequacy of cortical in- 
hibition and efficiency theories as now pro- 
posed. 

The average duration following 15-sec, 
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stimulation was 6.5 sec. and following the’ 
30-sec. stimulation was 8.0 sec. Considering 
this brief period of time, the possibility of a 
sizeable error of measurement caused by indi- 
vidual differences in interpretation and exe- 
cution of the task is manifest. Some Ss might 
be expected to delay slightly before report- 
ing cessation of the afterimage, in order to be 
certain that they were reporting accurately. 
Others might report cessation even before it 
had occurred in hopes of bringing the testing 
to a close more quickly. With the duration 
period per trial being so small, idiosyncrasies 
in reporting would have a substantial effect 
on the results. With this thought in mind, it 
is interesting to note that the correlation 
between the AID following the four 15- 
sec. trials with those following the four 
30-sec. trials was -838, indicating a fairly high 
degree of consistency in reporting the after- 
image. It would seem that whatever factors 
underlie the reporting of the AID they are 
relatively constant for each individual. 

That there was difficulty in interpreting 
the task is attested to by the fact that the 
AID associated with the circle as test object 
was significantly longer than that associated 
with the block. This finding suggests that two 
afterimages were involved, one resulting from 
stimulation by the rotating spiral and one 
due to the contrast of the black circle on the 
white ground. It is well established that pro- 
longed exposure to a highly contrasted, sta- 
tionary stimulus will result in a negative 
afterimage. It is likely that most of the pa- 
tients interpreted this afterimage as also 
being a part of the same phenomenon they 
were requested to report. The orange block 
against the neutral ground either did not 
produce this type of afterimage or did so to 
a lesser extent. 

f The conclusion that duration of the after- 
image does not differ between brain damaged 
and other diagnostic groups is consistent with 
the previously reported study by Page et al. 
(1957) and inconsistent with that of Spivack 
and Levine (1957). The former study in- 
volved adults, while the latter sampled ado- 
lescents. Page et al, unfortunately did not 
report their data on AID, other than to note 
that it was not significant. Spivack and Le- 
vine, however, report the mean durations for 
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their brain damaged and control groups, 
which were 20.2 and 14.8 secs., respectively. 
Either group has substantially longer AIDs 
than the 8.0-sec. average for the four 30-sec. 
stimulation periods of this study. One ex- 
planation for this difference can be found in 
the fact that they seated their Ss five feet 
from the spiral. At this distance, more rod 
vision is involved which should decrease the 
AID (Granit, 1928; Wohlgemuth, 1911). At 
the same time, however, it has been noted 
(Granit, 1927, 1928) that the duration of an 
afterimage produced in the periphery of the 
retina is less distinct and more difficult to 
measure even by highly trained Ss. It is pos- 
sible, then, that differences in sampling and 
in procedure may account for the inconsist- 
ency of the results found by Spivack and 
Levine and those obtained in this study. 

One further limitation of this study as well 
as the two mentioned above is that differ- 
ences, or lack of differences, in AID found 
between brain damaged and other groups 
may be the result of slowed reaction time 
in the brain damaged. The brain damaged 
patient may perceive the afterimage as cessat- 
ing sooner but be slower to report this fact. 
A method which circumvents verbal report 
would be necessary to resolve this question. 
One suggestion, currently being investigated, 
would be to use electroencephalographic re- 
cordings of occipital potentials as the measure 
of afterimage duration. If the stimulation by 
a rotating spiral results in changes in the 
occipital cortex, then it is likely that this 
change will result in a suppression of occipital 
alpha for a period following stimulation 
which is closely related to the perceived dura- 
tion of the afterimage. The use of such a 
measure would circumvent verbal report and 
permit a more direct study of this phe- 
nomenon. 

Turning to another explanation of these 
findings, it must be noted that, while AID 
did not differentiate the diagnostic groups, 
AIR did significantly separate them. Signifi- 
cantly more brain damaged individuals failed 
to report a negative afterimage one or more 
times than did the psychiatric or normal 
groups. Spitz (1958) has argued that failure 
to report the illusion is evidence for lack of 
an “abstract attitude.” Certainly there is 
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considerable ambiguity in the question: What 
does the line appear to be doing now? It is 
conceivable that some patients structure this 
query as referring to the objective condition 
of the spiral when the experimenter has 
turned it off. In other cases, failure to report 
the afterimage may be due to a suspicion on 
the part of the patient that the perception of 
the illusion is indicative of some kind of 
pathology. Thus, denial of the afterimage 
may be a symptom related to anosognosia, 
another symptom fairly common in brain 
damaged individuals (Tobis, Lowenthal, & 
Maringer, 1957). 

Since only 10 of the 123 patients tested did 
not report a negative afterimage on half or 
more of the trials, there is no question that 
the use of a test object increases the likeli- 
hood of AIR. That this is true suggests that 
some of the failures to report the afterimage 
may be due to confusion about the task. We 
may look upon this finding as lending further 
support to the alternative hypothesis that 
brain damage does not effect the perception 
of the afterimage, though it may differentially 
effect the frequency with which this phe- 
nomenon is reported. 


SUMMARY 


An ongoing sample of male veterans ad- 
mitted to the neurology and psychiatry serv- 
ices of a general hospital were tested with the 
Archimedes spiral. In addition to determining 
whether or not the patient could report the 
negative afterimage, the duration of the nega- 
tive afterimage was also measured. To de- 
crease the ambiguity of the task, two test 
objects were used on which the patient was 
instructed to pass off his afterimage. This 
technique greatly increased the number of Ss 
who reported the afterimage. Neither age, 
intelligence, drugs, nor diagnosis differenti- 
ated the patients with regard to the duration 
of the afterimage. However, when the num- 
ber of failures to report the afterimage on 
any of the 10 trials was used, it was found 
that the brain damaged patients could be 
separated from the psychiatric and hospital- 
ized normal patients at a highly significant 
level. It was suggested that these findings call 
into serious question theories of brain func- 
tioning which depend upon a cortical effi- 
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A STUDY OF IDENTIFICATION IN MALE HOMOSEXUALS? 
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Ye In discussing the etiology of male homo- 
sexuality, Freud (1952) has written: 


The typical process . . . [in homosexuality] is 
that a few years after the termination of puberty, 
the young man, who until this time has been 
tá strongly fixated to his mother, turns in his course, 


i 
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identifies himself with his mother, and looks about 
for love-objects in whom he can re-discover himself, 
» and whom he wishes to love as his mother loved 


A him (p. 240). 


i According to Freudian theory, therefore, 
homosexuality in males is a manifestation of 
an identification with the mother figure rather 
than the usual identification with a father 
figure. As a consequence of his twisted iden- 
tification, the homosexual seeks narcissistic 

=| rather than anaclitic love-objects. 

4 The purpose of the present study is to test 
a consequence of this notion, namely, that 
male homosexuality involves both a turning 

J toward the mother as a figure with se 

< identify and the turning away from me tat a 

3 as an identification figure. Specifically, the 

7 major hypothesis of this investigation was 

that frankly homosexual males have a goat 

of parental identification such that t hos 
more strongly identified with aan 
and, conjointly, more strongly disi enti ed 
` with their fathers than are males in a re eant 

contrast group. This conjoint hypothesis o 

course can be further partitioned into, two 

separate hypotheses of interest, tet 

comparison of the homosexual and non a 

sexual subjects (Ss) with regard to their a ae 

lute degree of identification with mother an 
with father figures. The two subhypotheses 
that follow from the major hypothesis are: 

(a) overt homosexual males, when compared 

. _ to nonhomosexuals, will show a stronger iden- 


is i igati a larger research proj- 
1 This investigation, part ofa 
eect, wa supported by research grant en fom 
h the National Institute of Mental Health of the 
: United States Public Health Service. 


tification with their mothers, and (b) overt 
homosexual males will show less identifica- 
tion with their fathers than will members of 
the control group. 

The meaning of the Freudian term iden- 
tification has been the subject of some con- 
troversy (Sanford, 1955; Stoke, 1950). For 
research purposes, various operational trans- 
lations of the concept have been employed. 
The operational meaning of identification 
adopted for the purposes of the present study 
is based upon the following statement by 
Freud (1933): “. . . when a boy identifies 
with his father he wants to be like his father” 
(p. 90). According to Freud, at least in this 
late statement by him on the matter, identifi- 
cation with an object is said to have occurred 
when an individual strives to take on attri- 
butes which are those of that object as the 
individual perceives that object; this object 
is no more and no less than one’s ego ideal— 
the object “. . . towards which it . . . [the 
ego] strives” (Freud, 1933, p. 92). In this 
study, then, a measure of the extent of iden- 
tification with one’s mother or father follow- 
ing directly from this conceptual definition 
was derived by finding the degree of corre- 
spondence between the attributes of one’s 
ideal self, i.e., the person one would like to 
be, and the attributes of one’s mother or 
father, as described by the identifier. The 
higher the degree of correspondence between 
one’s ego ideal and one’s parent, the greater 
the identification with that parent. 


METHOD 


Two groups of Ss were used. One group, the 
experimental group, consisted of 20 overt homo- 
sexual males. These Ss were engaging in homosexual 
practices, i.e., most of them were involved in homo- 
sexual marriages. None of these Ss were receiving 
or, with the exception of one, had ever received 
psychotherapy, Thus, all of these Ss appeared to 
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have accepted their homosexuality, a most important 
and perhaps unusual property of this sample. Their 
cooperation to serve as Ss was obtained through 
personal and necessarily confidential channels. These 
‘Ss were aware that they had been asked to partici- 
pate in the experiment because of their homosexual 
practices but had not been informed of the purpose 
of the experiment. A second group of 20 males from 
the same urban community and equivalent with 
respect to age and education was used as a contrast 
group. Eight of these Ss were married, and twelve 
were still attending college. The mean age of the 
homosexual males was 31.9 years with a range from 
22 to 39, that of the contrast group, 29.7 years with 
a range from 22 to 39. The mean educational level 
was 14.5 years for the homosexual group and 14.9 
years for the contrast group. 

The measure of identification with parent was 
obtained through the use of a list of 79 adjectives 
selected to permit a comprehensive Personality de- 
scription.» Each S was instructed to describe his 
“ideal self” by the use of the adjectives, marking 
an X if he felt that the adjective was particularly 
characteristic of his ideal self or an O if it was 
particularly uncharacteristic. The $ was allowed to 
make only and exactly 30 X’s and 30 O’s in order 
to avoid the confusions engendered by response sets 
(Cronbach, 1950). Following this description, S was 

required to describe, in the same manner, “your 
mother,” “your father,” and “yourself (as you see 
yourself).” In order to facilitate independence of 
the several descriptions, each object description was 
made on a separate page of the experimental booklet. 

Of 35 booklets circulated into the available homo- 
sexual population, only 22 were returned. Of these, 
only 20 were found to be usable in the sense of being 
complete or correctly filled out. Data for the con- 
trast group then was collected by distributing book- 
lets among individuals who were similar to the 
homosexual group with respect to age and educa- 
tional level. Of 31 individuals requested to partici- 
pate in the experiment, only 21 were willing to do 
so. One of these 21 control Ss was arbitrarily elimi- 
nated from the subsequent data analysis in order to 
equalize the sample size. In view of the fractional 
returns from the populations sampled, the possibility 
of some kind of selective bias in the results should, 
of course, be recognized. 

For each S, a mother-identification score and a 
father-identification score was obtained by compar- 
ing his description of his ideal self with that of his 
mother and with that of his father, respectively. For 
a given adjective, a score of one was given if it 
was marked X, O, or left unmarked on both the 
ideal-self- and parent descriptions. An identification 
score was obtained by summing the number of these 
correspondences between the ideal-self-description 
and the parent description. The separate mother- 
and father-identification scores thus derived were 
used to test the two subhypotheses, For the major 
hypothesis, that male homosexuality represents sj- 
multaneously an idealization of mother together with 


? A copy of this list of ad 


s jectives is available 
writing to the authors, by 
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a disparagement of father, a derived score was ob- 
tained by subtracting for each S his father-identi- 
fication score from his mother-identification score. 
The score distributions for the mother-identification 
and father-identification variables had quite equiva- 
lent means and standard deviations (40.7 and 11.3; 
41.0 and 11.5, respectively). Accordingly, it was le- 
gitimate to calculate the derived score from the 
original data distributions without going to the 
trouble of standardizing the separate score distribu- 
tions. Thus, this derived score would be high if an 
S’s mother-identification score was high and/or his 
father-identification score was low, In addition, & 
self-acceptance score was obtained by comparing the 
ideal-self-description with the self-description, scor-, 
ing for correspondence in the manner described 
above. These four scores provided the basic data 
for analysis. y 
For each comparison of interest, the Mann-Whit- 
ney Test (1947) was used to calculate the signifi- 
cance of the difference between the scores obtained 
by the experimental and control groups. With the 
exception of the analysis of self-acceptance scores; 
all the statistical results reported in this study are 
based on one-tailed tests of significance because the 
direction of difference was predicted in advance. 


RESULTS 


For the homosexual group, the derived 
scores expressing the primary variable, i.e., 
distance between mother and father on S's 
identification continuum, are significantly 
greater (.001) than the derived scores of the 
contrast group. Some indication of the sepa- 


ration between the two groups on this vari- 


able is provided by the finding that 14 of 
the 20 homosexu 


als have positive scores 
where only 4 of the 20 nonhomosexuals are 
on the positive side of zero. The primary 
hypothesis, that overtly homosexual males are 
relatively more strongly identified with their 
mothers than with their fathers, is therefore 


groups is made, 
S show a significantly 
identification with the!” 
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Identification in Male Homosexuals 


significant but still appreciable trend in the 
direction of less identification with father by 
the homosexual group. The one-tailed test 
results in a p value of .07, almost reaching 
the conventional level of significance. From 
these separate and rather equivalent findings, 
it is clear that the striking separation between 
the two groups in the test of the first hy- 
pothesis is a joint reflection of the two con- 
stituent scores rather than being dominated 
by but one of these. This kind of result is 
both a subtle and a more interesting one. 


Self-Acceptance 


The homosexual and control groups did 
not differ significantly in their extent of self- 
acceptance, where self-acceptance is defined 
as the degree of correspondence between the 
perceived self and ideal self. It should be 
especially noted that there is no perceptible 
trend for the contrast group to be relatively 
more self-accepting, as indicated by the two- 
tailed p value of .64 for the obtained differ- 
ence between the sums of ranks. This finding 
is a rather interesting one for it calls atten- 
tion to the special nature of this homosexual 
sample as one not psychiatrically disturbed 
and apparently well-accepting of their homo- 
sexual activities. 


Analysis of Ideal-Self-Descriptions 


The results in the preceding analyses might 
have been due to differences between the 
kinds of ego ideals of the homosexual and 
control groups. For this reason, an item anal- 
ysis of the adjective descriptions of the ideal 
selves of the two groups of Ss was done. Only 
2 of the 79 adjectives discriminated (at the 
.05 level) between the ideal-self-descriptions 
of the two groups. The conclusion is made, 
therefore, that the differential valuations by 
the homosexual Ss and control Ss of their 
parents are not due to different ego ideals. 
For the sake of completeness, it may be 
recorded that nonhomosexual Ss considered 
“sympathetic” as an attribute of their ideal 
self more frequently than did homosexual Ss. 
Homosexual Ss, more than control Ss, tended 
to deny the characteristic “dependent” as an 
attribute of the ideal self. No additional ad- 
jectives were found to discriminate when the 
threshold of significance was lowered to the 


-10 level. 
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Analysis of Self-Descriptions 


Since the homosexual and control Ss did 
not differ in their degree of self-acceptance 
and in their ideal-self-descriptions, one might 
expect that they would also not differ in their 
self-perceptions. But if this expectation were 
to be realized, then some of our preceding 
findings become questionable. The reasoning 
behind this remark is that homosexuals and 
normals are quite obviously rather different 
sorts of people and, consequently, these dif- 
ferences should, if only on a priori grounds, 
be manifest if a reasonably sensitive instru- 
ment is employed. Failure to find these dif- 
ferences would be presumptive evidence of 
the technique’s inadequacy, a conclusion that 
might also explain away the failure to find 
differences with respect to self-acceptance and 
ego ideals. On the other hand, it is possible, 
despite the failure of the self-acceptance and 
ideal-self-analyses to show differences, for the 
self-description analyses to separate the 
groups. This possibility arises because the 
self-acceptance for neither group is so high 
as to require (because of equivalent ideal 
selves) that the content of the self-descrip- 
tions be equivalent. With moderate corre- 
lations of self- with ideal-self-descriptions 
the different dimensions of self-description are 
free to operate and, hence, be seen. 

Item analysis of the self-descriptions of 
homosexual and control Ss finds six adjectives 
discriminating at the .05 level, with an addi- 
tional six adjectives meeting the .10 criterion 
—results not accountable by a proper state- 
ment of chance significance.” Moreover, the 
discriminating adjectives are consonant with 
the concept of the homosexual male. Com- 
pared to control Ss, the homosexual male sees 
himself as “affected, dependent, determined 
personally charming, restless, and not tact- 


3 By reference to “a proper statement of chance 
significance,” we are alluding to a recently developed 
recognition that conventional expectations of chance 
significance have been appreciably wrong rather 
often. In the situation where the response distribu- 
tions of both groups being compared are highly 
skewed (or in the dichotomous case, are extremely 
unbalanced as in the present study), at the, for 
example, .05 level, many less than 5% of the And- 
ings will be significant when chance alone is operat- 
ing. The logic underlying this assertion and some 
empirical findings are detailed in a manuscript avail- 
able from the second author. 
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less” (all of these significant at the 05 level). 
The picture can be extended by noting that 
the homosexual tends to see himself as “im- 
pulsive” but not “frank” or “easily embar- 
rassed” or “ambitious.” Our homosexual males 
are less often “friendless or introspective or 
self-aware” than the control Ss, all of these 
last differences reaching the .10 level. 

Although the discriminating characteristics 
cluster and portray someone with the person- 
ality traits usually associated with homo- 
sexuality in this culture, it is especially note- 
worthy that such adjectives as “confident, 
psychologically secure, relaxed, dissatisfied, 
helpless, worried” and the like failed to dis- 
criminate between the self-perceptions of the 
two groups. 


DISCUSSION 


The major hypothesis of this study, that 
frankly homosexual males more than control 
males show a positive identification with their 
mothers together with a disidentification with 
their fathers, has been supported by the re- 
sults of this study. Subsidiary hypotheses 
have also been supported. What now shall 
we make of this finding? 

Clearly, no one study is able to test the 
Freudian hypothesis, in all its complexities, 
about homosexuality. The present research 
does, however, in its limited way, support the 
psychoanalytic position. Additional studies, 
by the present and alternative methods, are 
required to investigate the multifaceted 
Freudian theory that the relatively strong 
identification with mother found in homo- 
sexual males has been preceded by and is the 
result of a strong fixation upon the mother, 
that the starting point of homosexuality 
comes quite discretely in , and 
that homosexuality represents a narcissistic 
object-choice. Such a research represents an 
undertaking of a much larger dimension than 
the present one. 

With respect to the present study, the un- 
usual nature of the experimental sample used 
—self-accepting homosexuals—should espe- 
cially be noted. Our results may well be an 
expression of the special character of this 
group. In more conflicted homosexuals, where 
ydentifications have been less decisively made, 

resently obtained relationships can pe 
the ¥ d to exist less strongly, One should 
expecte! 
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find, among individuals with more ambiguity 
of identification, that there is less self-accept- 
ance and more personal despair than was 
observed among the effectively functioning 
and well-received homosexuals studied in the 
Present research. 


SUMMARY 


The present study was concerned with a 
test of the Freudian notion that homosexual- 
ity in the male is based upon an over-identi- 
fication with the mother figure together with 
an under-identification with the father figure. 


Two groups of Ss, equivalent with respect | 
to age, education, and socioeconomic level, | 


were used: a group of 20 males engaging, 
apparently successfully, in homosexual prac- 
tices, and a group of nonhomosexual males. 
Each S described his ideal self, his mother, 


his father, and himself, by the use of a list 
of adjectives. On the basis of an “ego ideal” 
conception of identification, various identifi- 
cation scores and a self-acceptance score were 
obtained for each S, by comparing his de- 
scription of his ideal self with that of the 
other individuals he described. 

As tested, the hypothesis was supported 
r it was found that overt homosexual males 
were more identified with their mothers and 
more disidentified with their fathers than 
were the nonhomosexual males with whom 
they were compared. In addition, it was 
found that the two groups of Ss did not 
differ significantly in their degree of self- 
acceptance nor in regard to the kind of ego 
ideal toward which they aspired. The findings 
were briefly discussed, 


fo 
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THE AFTEREFFECT OF SEEN MOVEMENT 
AND BRAIN DAMAGE 


R.H. 


University 


If a moving stimulus object which has 
been visually fixated for about one minute is 
stopped suddenly, there occurs a character- 
istic illusory movement in a direction oppo- 
site to that of the real movement. This after- 
effect of seen movement is experienced fol- 
lowing fixation of various types of moving 
stimulus objects including a rotating spiral 
pattern, a sectored disc, or a pattern of black 
and white bands (waterfall illusion). The 
phenomenon is not restricted to the stimulus 
object but can be observed in the field of 
the closed eye and on other textured surfaces. 
If one eye is used during fixation and that 
eye closed on the cessation of real movement, 
the aftereffect can be observed using the non- 
stimulated eye.* 

Recent studies have drawn attention to a 
relationship between this aftereffect and brain 
damage. So far only the rotating spiral pat- 
tern has been used as a stimulus object 
(Aaronson, 1958; Davids, Goldberg, & Lau- 
fer, 1957; Freeman & Josey, 1949; Gallese, 
1956; Garrett, Price, & Deabler, 1957; Hol- 
land & Beech, 1958; Price & Deabler, 1955; 
Spivack & Levine, 1957). Results from these 
studies have indicated that organically im- 
paired subjects (Ss) can be discriminated 
from normal or functionally disordered con- 
trols with a high degree of certainty. Gener- 
ally it has been found that, whereas normal 
individuals experience a vivid and unmistake- 
able aftereffect, cases of brain damage either 
report no aftereffect or one which is reduced 
in its duration or vividness. Some studies, 
however, have not confirmed this and have 
reported that little or no differentiation can 


1 The apparent interocular transfer of an after- 
effect from the stimulated to the nonstimulated eye 
cannot be interpreted as evidence for its central 
(cortical) locus (Day, 1958). 
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be made between brain damage and normal 
controls (Goldberg & Smith, 1958; Johnson, 
Bauer, & Brown, 1959; Page, Rakita, Kap- 
lan, & Smith, 1957; Philbrick, 1959; Pickers- 
gill & Jeeves, 1958). 

It is intended here to direct attention to 
issues which appear to have been overlooked 
and which have an important bearing on re- 
search in this area. It is believed that failure 
to consider certain behavioral outcomes of 
brain damage as well as insufficient attention 
to methodology and sampling have resulted 
in the lack of agreement between the results 
from independent studies. 


VISUAL FIXATION AND MOVEMENT 
AFTEREFFECTS 


The aftereffect of seen movement occurs 
when the eye is fixated on a stationary point 
during the period of stimulation by the mov- 
ing pattern (Holland & Beech, 1958; Wohlge- 
muth, 1911). If the eye follows the moving 
pattern, little or no aftereffect occurs. In order 
to confirm this 12 normal Ss were required 
to follow one of the sectors on a black and 
white sectored disc on one trial and to fixate 
the center of the disc on another. During the 
following trial eight Ss failed to report an 
aftereffect.and four reported a slight but brief 
aftereffect. All Ss experienced a strong after- 
effect (apparent movement in the reverse di- 
rection) following fixation of the center of 
the disc. 

When the eye follows a point on a moving 
stimulus pattern, the retinal image of the pat- 
tern either remains stationary or moves less 
relative to the retina. When a fixed point such 
as the center of the disc is fixated, in retinal 
image moves with respect to the retina. An 
essential condition, therefore, for the occur- 
rence of a clear aftereffect of seen movement 
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is motion of the retinal image of the stimulus 
pattern. This condition is achieved by steady 
fixation of a stationary point in or near the 
stimulus object. 


FIXATION AND BRAIN DAMAGE 


In an investigation of the cortical integra- 
tion of ocular movements it has been shown 
that, whereas some organics experienced diffi- 
culty in voluntarily changing the point of 
fixation, their eyes followed a moving object 
involuntarily (Holmes, 1938). In an account 
of the effects of frontal brain damage on eye 
movement and fixation Holmes has stated: 


This condition at first shows all the appearance 
of loss of paralysis of movement, but that this is 
not present becomes obvious when the subject is 
asked to follow with his eyes a slowly moving ob- 
ject; then, providing the rate of movement is uni- 
form and not too fast a full range of deviation may 
be obtained . . . when the patient looks at a rotat- 
ing drum on which lines or figures are displayed the 
eyes also deviate in the direction in which the fig- 
ures move . . . We therefore find that though vol- 
untary effort fails to produce movement the eyes 
follow a moving point the images of which fall on 
the retina (p. 106). 


In discussing the work of Holmes (1938) 
Sherrington (1940) has presented a similar 
view: 


But if asked to follow with his eyes a slowly 
moving object, his eyes do so. That is, though willed 
effort fails to produce the act, the eyes follow ‘of 
themselves’ a moving point whose image is fixated 
by them, The eyes remain anchored to that object 
by the ‘fixation reflex.’ Those afflicted with this con- 
dition when they want to look at something fresh, 
close the eyes for a moment, or toss the head, or 
place a hand between the eyes and the object fixing 
their gaze. By breaking the gaze a moment they 
free themselves from the fixation reflex; though by 
willed effort they cannot break it (p. 226). 


Thus in those cases in which voluntary eye 
movements and visual fixation have become 
difficult or unattainable, but in which optical 
fixation reflexes are preserved, the eyes re- 
main fixated on the moving object. This 
symptom of frontal brain damage has been 
called “following movement” by Walsh 
(1957). It would be expected then that the 
moving elements of a stimulus pattern would 
tend to be followed, rather than a stationar 
point fixated. It has been pointed out that 
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this method of observation does not result 
in a persistent or vivid aftereffect. It seems 
probable that the absence, weakness, or short 
duration of the aftereffect of seen movement 
observed in certain cases of brain damage 
derives in some measure from an inability to 
meet the conditions of visual fixation neces- 
sary for its occurrence. Even if fixation of 
a stationary point is achieved during part of 
the period of stimulation, the aftereffect would 
be reduced as compared with that resulting 
from steady fixation during the full period of 
stimulation. The length of the period of 
stimulation is an important determinant of 
the aftereffect (Wohlgemuth, 1911). 

A possible relationship between the well 
known distractability of organically impaired 
Ss and reduced aftereffects of seen movement 
has already been pointed out (Holland & 
Beech, 1958). It has been suggested that 
such distractability could result in inadequate 
fixation and thus contribute to the weakness 
of the aftereffect. The view Proposed here, 
however, is that the weakness of the after- 
effect is based on an inability on the part of 
the organic S to fixate rather than on a tend- 
ency to inattention. 


MOVEMENT AFTEREFFECT AND THE Locus 
or BRAIN DAMAGE 


The control of eye movements is not con- 
ig to a single cortical center. Stimulation 
of the frontal region (Brodman’s Area 8), 
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Aftereffect and Brain Damage 


It would be expected, therefore, that the de- 
gree to which the aftereffect of seen move- 
ment is affected would depend upon the na- 
ture, locus, and extent of brain damage, the 
most marked deterioration occurring with 
damage to those centers controlling voluntary 
fixation. There have been no studies so far 
in which the locus and magnitude of damage 
have been considered in relation to the after- 
effect. The most common practice has been 
to compare the aftereffect reported by a 
heterogeneous organic group with that re- 
ported by normal and functionally disordered 
controls. Since the constitution of the brain 
damaged groups has varied widely between 
the numerous investigations, it is not surpris- 
ing that inconsistencies between results have 
occurred. 

It seems probable that the discrepancies 
between independently collected data is due 
in part to a failure to consider the locus, 
chronicity, and extent of brain damage. Con- 
sidering the wide variety of behavioral symp- 
toms manifested by such diverse conditions 
as epilepsy, traumatic brain injury, Parkin- 
son’s disease, and postsurgical conditions 
there is little justification for regarding brain 
damage as a single category characterized by 
a common experience (or lack of it) follow- 
ing visual stimulation. It is suggested here 
that the widely differing results derive from 
the varying effects of different types of brain 
damage on ability to fixate under the condi- 


tions employed. 


Score METHODS 


Two methods for scoring the aftereffect of 
seen movement have been used. The simplest 
method is that of scoring the reported occur- 
rence and nonoccurrence of the aftereffect 
O and 1, respectively, and an equivocal report 
0.5 (Freeman & Josey, 1949). A variant of 
this method is to score nonoccurrence 0, 
occurrence 1, and to score 0.5 when a single 
phenomenal feature only (contraction-expan- 
sion, change in size or approach-recession ) 
is reported (Garrett et al., 1957; Philbrick, 

2 Whereas Freeman and J 
occurrence of the aftereffect as 1 and occurrence 0, 
other investigators have scored 1 for occurrence and 
O for nonoccurrence. The latter would appear to be 


a more meaningful method. 


osey (1949) score non- 
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1959; Price & Deabler, 1955). A second 
method requires the S to report the duration 
of the aftereffect (Holland & Beech, 1958; 
Page et al., 1957; Spivack & Levine, 1957). 

It has been found that, whereas some brain 
damaged Ss fail to report the aftereffect spon- 
taneously, they have reported it when the 
nature of the phenomenon has been described 
to them. This has been most noticeable when 
a rotating black and white sectored disc has 
been used as the stimulus object. The after- 
effect from this pattern (apparent reversal 
of movement ) is much less vivid than from 
a spiral pattern. It has been observed also 
that organically impaired Ss often report a 
single feature spontaneously but will describe 
another feature following a description of it. 

It is conceivable that with respect to both 
the presence or absence of the aftereffect and 
the number of reported features the differ- 
ences between organic and normal Ss derive 
from the adequacy with which they report. 
If the duration or persistence of the after- 
effect were used as the score after it has been 
established that the S experiences the phe- 
nomenon, the reliability of the scores could 
be readily established. It has been found that 
duration can be reported with confidence by 
most organic Ss with the exception of those 
who exhibit marked deterioration and con- 
fusion. 


Discussion 


Two explanations of the absence or weak- 
ness of the aftereffect of seen movement re- 
ported by organic Ss have been suggested. 
The first postulates a synthesizing or inte- 
grating process in the CNS responsible for 
movement perception. It is argued that this 
process is either absent or adversely affected 
by brain damage (Price & Deabler, 1955), 
This view is based upon a recent theory of 
motion perception (Saucer, 1953, 1954). A 
second explanation derives from a concept of 
central nervous inhibition. The effect of brain 
damage is to intensify cortical inhibitory 
processes and thus to weaken the aftereffect 
(Holland & Beech, 1958; Price & Deabler 
1955). Each of these explanations emphasizes 
the role of central nervous mechanisms as 
determinants of the aftereffect of seen move- 
ment. 
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The view expressed here is that the failure 
of some organic Ss to experience the after- 
effect from a moving pattern, or to experience 
it with less vividness than normal Ss, is due 
to a tendency to visually follow the elements 
of a moving pattern. This tendency is con- 
sequent upon damage to those parts of the 
brain initiating and controlling voluntary 
fixation and a replacement of these voluntary 
activities by reflex movements controlled from 
occipital centers. Although visual observation 
of the eye movements of brain damaged and 
normal Ss has shown that the former exhibit 
markedly more frequent and grosser eye 
movements during “fixation,” this finding has 
yet to be confirmed by ophthalmographic 
recording. 

Tn view of the wide variations in the locus, 
extent, and nature of brain damage there 
seems little justification for attempting to 
establish a relationship between a sensory 
aftereffect and brain damage. It is possible 
that this simple test may prove useful in 
differentiating between types of damage 
rather than differentiating it from other con- 
ditions. If the hypothesis outlined here is 
supported by subsequent data, a more effi- 
cient technique would be that of observing 
the magnitude and direction of eye movement 
during the period when S is attempting to 
fixate. Such a method would have the advan- 
tage of dispensing with subjective reports of 
an experience which may be an indirect out- 
come of brain damage. 


SUMMARY 


Recent studies have shown that with brain 
damaged Ss the aftereffect of seen movement 
from a rotating spiral pattern is either absent 
or weaker than that experienced by normal 
Ss. Some studies have failed to confirm this 
observation. 

A necessary condition for the occurrence 
of this aftereffect is steady fixation of a sta- 
tionary point in or near the moving stimulus 
pattern so that the retinal image of the pat- 
tern moves relative to the retina. Voluntary 
fixation and eye movement is affected in cer- 
tain cases of brain damage so that indi- 
vidual’s eyes tend to follow the moving pat- 
tern. It is suggested that the weakness or 
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absence of the aftereffect from a moving pat- 
tern derives from this failure of voluntary 
fixation. 

Since the main centers controlling volun- 
tary eye movement and fixation are located 
in the frontal region of the brain, it would be 
expected that damage affecting. that region 
would have the most marked effect on the 
aftereffect. 

Inconsistencies between the results from 
various studies probably derive in large part 
from a failure to consider the locus, extent, 
and nature of brain damage as well as the 
inadequacies of the methods used to measure 
the aftereffect. 

Some preliminary results in support of the 
eye movement hypothesis have been pre- 
sented. 
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Theoretically, and from the standpoint of 
research, the therapist appears to be finding 
his way back into therapy. This trend is in 
keeping with a renewed interest in counter- 
transference or, more broadly, the therapist’s 
contribution to the treatment process. Mensh 
(1956) states, “. » study of the therapist 
is now more explicitly indicated, an emphasis 
which has appeared with trends toward in- 
troducing experimental design qualities into 
research in psychotherapeutic process” (p- 
342). Butler (1952) asserts that “. . . exact 
observation of the therapist is a necessary 
precondition to understanding the behavior 
of the client” (p. 378); similarly, Gitelson 
(1952), Jackson (1956), and Wolberg (1954) 
stress the importance of therapist variables. 
Perhaps the primary impetus for focusing 
upon the therapist stems from a belief that 
orthodoxy is waning and “Viewing therapy 
as a very intimate relationship and recogniz- 
ing a trend toward less restrictive therapist 
roles, the expectation arises that differential 
therapist characteristics would have system- 
atic effects on process and outcome” (Winder, 
1957, p. 316). 

Previous attempts to explore the relation- 
ships between therapist variables and thera- 
pist performance in treatment have consid- 
ered mainly those factors which most readily 
lend themselves to objective quantification: 
ie, the therapist’s professional affiliation, 
theoretical allegiance, length of experience, 
number of hours of personal analysis, etc. 


1 This paper is a summary of a dissertation sub- 
mitted in partial fulfillment of the PhD degree to 
the Department of Psychology, University of North 
Carolina. 

This research is part of a larger project supported 
by a research grant (M-2171) from the National 
Institute of Mental Health, Public Health Service. 


The potential importance of less tangible per- 
sonality variables, long recognized in Freud’s 
conception of countertransference, is begin- 
ning to attract the attention of investigators 
seeking objective documentation. For in- 
stance, Parloff (1956) found that the thera- 
pist relates most satisfactorily to those pa- 
tients who most clearly approximate his 
“Jdeal Patient.” Strupp (1958) has shown 
that the therapist’s attitude toward the pa- 
tient was significantly correlated with the 
form of therapy, frequency and length of 
treatment, the kinds of technical problems 
expected in therapy, and certain qualitative 
aspects of the techniques the therapist ex- 
pected to employ. These observations strongly 
support the notion that emotional and atti- 
tudinal factors in the therapist may have an 
important bearing upon his clinical evalua- 
tions. 

The therapist, as a function of his life 
experiences, approaches each initial interview 
with needs, expectations, and wishes of his 
own. It may be hypothesized that, if his ex- 
pectations are sufficiently realized, he will 
consider the situation as “rewarding,” and a 
“warm” attitude toward the patient is likely 
to develop. More specifically, if in an initial 
interview the prospective patient approxi- 
mates fhe therapist’s conception of an “ideal 
patient,” he may develop a warm attitude 
toward the panent. While the conception of 

ideal patient” may differ from therapist to 
therapist, in a general sense, it may be as- 
sumed to imply a certain congruence between 
the kind of help the patient is seeking and 
the kind of help the therapist is able to p10- 
vide, as well as an explicit or implicit belief 
on the patient’s part that the treatment will 
be effective. Patients exhibiting such hopes 
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and beliefs may be said to be “motivated 
for therapy” or “ready for a psychothera- 
peutic experience.” Thus, patients whose be- 
liefs, desires, and expectations are similar— 
in some respects at least—to corresponding 
expectations held by the therapist may be 
said to arouse a warmer attitude in the thera- 
pist. This attitude, it is hypothesized, will be 
sufficiently strong to give rise to a kind of 
“halo effect” overshadowing certain “reality 
factors.” Operationally, this may be evi- 
denced by the character of the therapist’s 
perception of the patient, treatment planning, 
and the like. 

The initial halo effect, it is further hypothe- 
sized, will be self-realizing. This is to say, 
having achieved a “comfortable” or “warm” 
feeling toward the patient, the therapist will 
continue to focus on perceptions which are 
likely to engender further this effect, and he 
will propose a therapeutic regime which will 
reflect this attitude. Moreover, he may be 
more inclined to view the patient as a “good 
patient” and work toward a closer therapeutic 
contact with the patient. The present investi- 
gation aims to explore the relationships be- 
tween the therapist’s attitude toward the pa- 
tient on the one hand and his perceptions of 
the patient and treatment plans on the other. 
The causal direction, of course, remains an 


open question. 
HYPOTHESES 


1. A patient’s verbal or nonverbal expecta- 
tions concerning his “motivation for therapy 
or “readiness for therapy” is an important 
determinant of the therapist s attitude to- 
ward the patient. In particular, = or 
highly motivated patient will engender in the 
therapist a warmer attitude. 
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2. The therapist’s attitude toward the pa- 
tient, whether a function of the patient’s 
motivation for therapy or other factors, is 
correlated with the therapist’s perception of, 
and treatment plans for, the patient. More 
specifically, a warmer therapist attitude to- 
ward the patient is associated with more 
favorable perceptions of the patient, includ- 
ing clinical judgments, prognostic estimates, 
and treatment plans implying closer thera- 
peutic contact. 


PROCEDURE 


To test these hypotheses it was planned to present 
a group of therapists with a patient manifesting rela- 
tively high motivation for therapy, and a comparable 
group of therapists with the same patient showing 
relatively low motivation for therapy. Furthermore, 
to lend greater generalizability to the findings, 
therapists were presented with two patients. 

Two written case histories of neurotic patients 
were adapted for experimental purposes. Each case 
history consisted of alternate forms which were 
identical in all respects, except that one form pre- 
sented the patient as highly motivated and the 
second form as poorly motivated for therapy. “Mo- 
tivation” was communicated primarily in terms of 
the patient’s eagerness to seek and accept psycho- 
therapeutic help. Each case history was followed by 
an identical questionnaire of 27 items eliciting the 
respondents’ clinical impressions, treatment plans, 
and attitudes toward the patient. Subjects were 
presented with the case histories of both patients, 
the degree of motivation being varied systematically. 
In addition, several biographical questions were ap- 
pended to the second questionnaire. 


Sample 


Data were collected from medical psychothera- 
pists actively engaged in the practice of psycho- 
therapy. Three sources were utilized: (a) physicians 
known to the investigators personally, (b) col- 
leagues of psychologists at selected installations, and 
(c) psychiatrists selected at random from the Direc- 
tory of the American Psychiatric Association, Of 


TABLE 1 


BIOGRAPHICAL DATA OF RESPONDENTS 


Mean Mean Hours of Mean Years 

N Male Female Age Personal Analysis of Experience 
E T 10 1 33.91 140 (2) 3.05 
Peyciptie Reiden s 7 34 3 35.86 487 (26) 7.27 
Peye an a 34 30 4 44.47 625 (34) 14.31 
By y: 32 74 8 39.17 549 (62) 9.62 


a Figures in paren! 


theses indicate number of respondents who had personal analysis. 
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300 sets of material distributed personally or by 
mail, 82 were included in the analysis. One hundred 
and eighty-four sets were never returned, 26 re- 
spondents asked to be excused, and 8 respondents 
completed and returned the questionnaires after the 
analysis of the data was well in progress. Analysis 
of the biographical data (age, sex, professional level, 
years of experience, number of hours of personal 
analysis) supplied by respondents indicated that the 
sample was drawn from a relatively homogeneous 
universe of experienced medical psychotherapists 
(see Table 1), and that in terms of these variables 
the various subsamples showed no significant varia- 
tions. 
RESULTS 
Position Effects 


As a first step in the statistical analysis it 
seemed important to investigate the possible 
influence of order of presentation (patient 
and degree of motivation) upon questionnaire 
responses. Since 24 multiple-choice items were 
answered under four experimental conditions, 
96 comparisons were called for. Chi squares 
and ¢ tests of these comparisons yielded 15 
results significant at beyond the .1 level. 
These significant position effects appeared to 
be largely due to the fact that one of the 
patients seemed objectively more disturbed, 
so that respondents studying the male (more 
disturbed) patient first tended to attribute 
greater personality strengths and assets to 
the female (less disturbed) patient when her 
case was presented second. 


‘Effects of Communicated Patient 
Motivation 


Chi square or ¢ tests were employed to 
ascertain effects of the patient’s motivation, 
as communicated in the case histories, upon 
responses to the 24 multiple-choice items. 
Five figures significant at or beyond the .05 
level were obtained for the male case, and 
nine figures significant at this level were ob- 
tained for the female case. 

When Mr. J. was presented as possessing 
high motivation for therapy, respondents 
tended to attribute to him greater ego 
strength, more anxiety, more insight into his 
problems, greater motivation for therapy 
(check item), and they expressed greater will- 
ingness to accept the patient for treatment. 
Contrary to the hypothesis, therapists’ atti- 
tudes toward the patient did not vary signifi- 
cantly. However, there was a (statistically 


Martin S. Wallach and Hans H. Strupp 


nonsignificant) trend in the predicted direc- 
tion—a slight tendency for therapists to claim 
a warmer attitude toward the patient when 
he was presented as being more highly moti- 
vated for therapy. 

When Mrs. D. was presented as possessing 
high motivation for therapy, respondents 


„estimated ego strength, insight into her prob- 


lems, and motivation (check item) as being 
greater. They likewise judged her social ad- 
justment as more adequate, more often rec- 
ommended psychotherapy or psychoanalysis 
without adjunctive forms of treatment (drugs, 
shock), and indicated that they would strive 
to accomplish a more extensive change in the 
patient’s character structure. Presenting Mrs. 
D. with high motivation also accounted for 
a more favorable prognosis with treatment 
and a greater willingness on the part of the 
therapist to take her into treatment, Finally, 
as hypothesized, therapists claimed a warmer 
attitude toward the patient when high moti- 
vation was communicated. 


Effects of Perceived Patient Motivation 


The preceding comparisons were made in 
terms of the level of motivation communi- 
cated in the case material which did not 
necessarily coincide with the patient’s motiva- 
tion as actually perceived by the respondents. 
For the male case, there was high agreement 
between the kind of motivation communi- 
cated and the kind of motivation perceived 
by respondents. For the female case, how- 
ever, the disparity between communicated 
and perceived motivation was greater, as evi- 
denced by 13 shifts (16%). Ten of these 
respondents “misread” the case history in- 
tended to convey low motivation by estimat- 
ing the patient’s motivation to be relatively 
high. Consequently, it was thought advisable 
to determine the relationship between per- 
ceived motivation and other variables, and 
chi square tests were performed between the 
respondent’s estimate of the patient’s motiva- 


tion and each of the other 23 multiple-choice 
items. 


For the male case 


» Questionnaire items al- 
read: 


y cited as being significantly related to 
communicated motivation were again signifi- 
cantly related to perceived motivation. In 
addition, five other items were significantly 
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related to perceived motivation at or beyond 
the .05 level. Respondents who perceived Mr. 
J. to be highly motivated also tended to esti- 
mate social adjustment to be more adequate, 
suggested more frequent therapeutic sessions, 
stated that they would be less willing to alter 
the kind of therapy they usually conduct, 
estimated the prognosis to be more favorable 
with treatment, and found it easier to em- 
pathize with the patient. 

For the female case, estimates of greater 
ego strength, more insight, better social ad- 
justment, more favorable prognosis with 
treatment, greater willingness to accept the 
patient in treatment, and warmer therapist 
attitudes accompanied perceived motivation, 
as they did communicated motivation. On the 
other hand, perceived high motivation (un- 
like communicated high motivation) was not 


significantly related to form of treatment or 
to the extensiveness of attempted change in 
the patient’s character structure. However, 
respondents who perceived Mrs. D. as highly 
motivated for therapy also tended to judge 
her as being more emotionally mature and 
less seriously disturbed. 


Effects of Therapists’ Attitudes 


In order to test the second hypothesis, the 
respondents’ stated attitudes toward the pa- 
tient were compared, by chi square, with the 
23 remaining multiple-choice items. 

For the male case, eight items varied sig- 
nificantly with the therapist’s attitude toward 
the patient, and only one of these (willingness 
of the therapist to alter his usual way of con- 
ducting therapy) was in the opposite of the 
predicted direction. Briefly, a positive thera- 


TABLE 2 


SUMMARY OF STATISTICAL COMPARISONS BETWEEN PATIENTS’ MOTIVATION FOR THERAPY, 
THERAPISTS’ ATTITUDE, AND OTHER QUESTIONNAIRE ITEMS 


(N = 


82) 


Male Case Female Case 


= Therapist Therapist 
Therapists’ Estimates Motivation Attitude Motivation Attitude 

Diagnosis NS* 02 NS NS 
Ego strength -001 01 05 NS 
Anxiety -05 NS NS NS 
Insight a Wl 01 05 
Emotional maturity ns v .01 NS 
Social adjustment aE a or 05 
Degree of disturbance ae NS 01 NS 
Similarity to patients in therapy as 4 NS NS 
Kind of treatment 2 E NS NS 
Motivation — i ae a8 001 
Chances of acting out ‘05 NS NS NS 
Frequency of a fy agen ‘ 2 NS 
Extensiveness of change in 

structure na a he NS 
Length of treatment ae S n W NS 
Permissiveness Pyn NS iR Na NS 
Encourage free association NS m I NS 
Recommendations to patient 1 Ns NS NS 
Change of usual therapy ps i Ns NS 
Prognosis without therapy pe n NS NS 
Prognosis with therapy i y a a 001 05 
Willingness to accept patient in Rx. A 001 ‘OO 
Ease of empathizing with patient .01 001 NS ont 
Environmental stress NS NS A NS 
Therapist’s attitude NS 001 


a NS—not significant at .10 level (chi square, two-tailed tests). 


b Patient's motivation as perceived by therapist. 
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pist attitude was found to be associated with: 
a diagnosis of psychoneurosis rather than one 
of psychosis or personality disorder, esti- 
mates of greater ego strength and insight, 
less willingness to alter his usual therapeutic 
procedure, more favorable prognostic esti- 
mates with or without treatment, greater 
willingness to accept the patient for treat- 
ment, and greater ease of empathizing with 
the patient. 

For the female case, six items showed a 
significant relationship with attitude at the 
.05 level, and all differences were in the pre- 
dicted direction. Therapists who claimed a 
more positive attitude toward Mrs. D. con- 
sidered her to possess greater insight into her 
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problems, more adequate social adjustment, 
and higher motivation for therapy. Warmer 
attitudes toward this patient were also ac- 
companied by estimates of a more favorable 
prognosis with treatment and greater ease of 
empathizing with the patient. 

In addition to the statistically significant 
results already reported, more than 75% of 
the nonsignificant comparisons showed trends 
in the predicted direction. The figures for 
these comparisons are given in Table 2. 


Additional Analyses 


The theoretical possibility that basic di- 
mensions might underlie the observed rela- 
tionships among questionnaire items led to 


TABLE 3 
INTERCORRELATION AMONG SELECTED ITEMS 
(N = 75 to 82) 


Ego Social Alter Accept 
Item Strength Insight Adjustment Motivation Rx. Patient Prognosis Empathy 
Male Case 
Insight 30 
Social adjustment 61 28 
Motivation 32 50 27 
Change of usual 
therapy — 30 —.31 —.06 —.34 
Prognosis with 
therapy 46 32 32 .26 —.51 
Willingness to ac- 
cept patient 34 30 .20 40 — 30 40 
Ease of empa- 
thizing with patient 43 14 .25 22 —.02 22 28 
Therapist’s attitude .52 23 .10 -22 —.29 42 55 A4 
Female Case 
Insight 49 
Social adjustment 37 35 
Motivation 40 65 37 
Change of usual 
therapy —.27 —.15 —.07 = 18 
Prognosis with 
therapy Al 37 .28 55 ~.52 fi A 
Willingness to ac- i 
cept patient 19 39 —.08 46 Sigi 68 I 
Ease of empa- 
thizing with pen 08 35 20 25 sü 32 40 
Therapist’s attitude 30 40 31 59 —.07 AS 54 Ad 


D 


Psychotherapists’ Judgments and Attitudes 321 


an attempt to isolate such factors. Since a 
factor analysis employing the intercorrelation 
of all 27 items seemed impractical, only the 
9 items already found to yield significant 
comparisons were utilized. For each case, 
these 9 items were intercorrelated (product- 
moment coefficients of correlation). A cen- 
troid factor analysis was performed for each 
patient separately, three factors were ex- 
tracted in each analysis, and both factor 
analyses were then rotated to an oblique 
simple structure. 

The similarity of patterns of the inter- 
correlation matrices (see Table 3) for each 
case suggests that certain items are consist- 
ently related to other items and that these 
relationships are rather stable. That is, pa- 
tient differences as represented by the two 
cases in this investigation have little effect 
upon the degree to which various therapist 
estimates and judgments are correlated, and 
the observed association between items can 
be assumed to be mainly a function of thera- 
pist variables. r 

The three factors extracted for each patient 
are difficult to interpret, and the findings for 
the two patients often failed to parallel each 
other. However, common to both factor anal- 
yses was the emergence of what might be 
termed a “good patient” factor. For Mr. i 
this factor includes high loadings on items 
which give the following description: the 
therapist is willing to accept a well motivated, 
insightful patient for his usual treatment pro- 
-odure. For Mrs. D. high loadings occur on 
such items as: the therapist perceives the 
patient as having insight, high motivation for 
therapy, and is easy to empathize with; the 
therapist has a warm attitude toward the 
patient and is eager to accept her for treat- 
ment.. 


DISCUSSION 


Since this investigation is not based on 
observations of psychotherapists in their 
natural setting, questions may be raised about 
the correlation between their behavior re- 
sponding to questionnaires and their behavior 
in actual psychotherapy when they have the 
opportunity for direct acquaintance with the 
patient. As one of the authors (Strupp, 1958) 
pointed out in a somewhat different context, 


It is not contended that the behavior of the psy- 
chiatrists in an initial interview with a “real” pa- 
tient bears a one-to-one relationship to his per- 
formance in the experiment. ., , What is asserted is 
that the evaluative processes, as they vere studied 
under experimental conditions, are sufficiently simi- 
lar to the “normal” behavior of the therapist to 
generate important hypotheses about his behavior 
in a first interview. Hypotheses thus derived are 
valuable if they raise further questions for research, 


and if they can be applied to practice and training 
in psychotherapy (p. 64). 


Asch (1952) comes to a similar conclusion 
in evaluating the generalizability of his ex- 
perimental findings which were based on 
descriptions rather than on observation of 
actual persons: “certain processes occurring 
in response to indirect data about persons not 
present are also at work in reaction to actual 
persons” (p. 219). Nevertheless, the possi- 
bility that concrete relations with persons 
introduce additional variables cannot be dis- 
missed lightly. 

Regarding our findings, it is seen that cer- 
tain therapist estimates pertaining to per- 
ceptions of, and treatment plans for, the pa- 
tient are related to the therapist’s professed 
attitude toward the patient. Estimates of 
diagnosis, ego strength, inclination to modify 
therapy, prognosis without treatment, social 
adjustment, and motivation for therapy ap- 
pear to vary with the therapist’s attitude and 
the patient’s degree of disturbance (it will 
be recalled that the female patient was less 
seriously disturbed than the male patient). 

It should be pointed out that only 10 of 
the 24 multiple-choice items yielded results 
which were consistently statistically signifi- 
cant. At the same time, the fact that these 
results were obtained for two patients adds 
weight to the evidence. Furthermore, the fact 
that the vast majority of statistically non- 
significant comparisons were in the predicted 
direction, coupled with the respondents’ fre- 
quently expressed claim that the written case 
histories did not allow them to use all the 
cues they ordinarily employ, suggests that 
many of the relationships are probably under- 
estimates. 

The direction of causation obviously re- 
mains indeterminate. Thus, it is not clear 
whether the therapist’s attitude “causes” the 
character of his clinical estimates, whether 
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he therapist feels more favorably disposed 
toward patients who are better therapeutic 
risks to start with, or whether both possibili- 
ties are more complex functions of other 
variables. It seems clear, however, that in- 
sight, or the capacity for insight, is a key 
variable entering into prognostic estimates, 
the therapist’s eagerness in accepting the pa- 
tient for treatment, and his (self-rated) abil- 
ity to empathize with the patient. Therapists 
tend to profess a more positive attitude to- 
wards a patient who meets these criteria. On 
the other hand, the data suggest that the 
patient’s motivation for therapy influences 
the therapist’s attitude in the predicted direc- 
tion only when the degree of disturbance is 
not too great. In other words, a therapist 
“likes” a patient who has the capacity for 
insight and is eager to be helped provided 
the clinical evidence supplies no contraindi- 
cations. 

These findings show some interesting paral- 
lels to the ingenious experiments of Asch 
(1946) on the formation of impressions of 
personality. Among many other findings, 
Asch reports that “a central trait determines 
the content and the functional place of pe- 
ripheral traits within the entire impression” 
but that central characteristics are “them- 
selves affected by the surrounding character- 
istics” (p. 276). In his studies, the “warm- 
cold” dimension appeared to occupy such a 
central position. In this investigation, moti- 
vation for therapy similarly seems to deter- 
mine a number of clinical estimates as well as 
the therapist’s attitude. (For further evi- 
dence on this point, see the study by Strupp 
& Williams, 1960). Our results also seem 
to corroborate Asch’s conclusions that one 
can see another person only as a unit, that 
qualities enter into a particular relation and 
become organized, and that the system of 
the traits points to a necessary center. Alto- 
gether it seems that application of Asch’s 
technique to the investigation of the forma- 
tion of clinical judgments presents a fascin- 
ating research task. 

Regardless of patient’s motivation for ther- 
apy, however, therapists profess a more posi- 
tive attitude towards patients who are less 
severely disturbed. While this investigation 
has nothing to say about therapeutic out- 
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comes, it dovetails with the findings of a 
number of such ‘studies summarized by 
Luborsky (1959): “Those who stay in treat- 
ment improve; those who improve are better 
off to begin with than those who do not; and 
one can predict response to treatment by how 
well they are to begin with” (p. 324). We 
may add, on the basis of our findings, that 
therapists are more enthusiastic about treat- 
ing patients whom they view as having a 
better prognosis—perhaps, because they are 
perceived as having greater capacity for 
profiting from psychotherapeutic procedures 
which rely both on the patient’s intellectual 
as well as emotional understanding. Our first 
hypothesis may thus be recast: A “warm” 
therapist attitude toward a patient is more 
likely to be present when the patient is well 
motivated for therapy, provided the severity 
of his emotional disturbance is not too great. 

The one result which ran counter to the 
prediction showed a negative relationship 
between a favorable therapist attitude and 
his willingness to alter his usual method of 
therapy. In the context of other findings here 
reported this disinclination to alter his usual 
method of therapy seems to reflect the thera- 
pist’s conviction that the patient can profit 
from the kind of psychotherapy the therapist 
employs most often and has the greatest 
faith in and that there is no need to press 
into service other techniques and therapeutic 
adjuncts—the use of which may, at times 
signify a subtle rejection of the patient as 
a person. 

Estimates of the patient’s insight into his 
problems showed the highest degree of asso- 
ciation with the patient’s motivation for ther- 
apy and the therapist’s attitude. This finding 
supports the notion that estimates of insight- 
fulness are highly descriptive of the good 
(potential) learner as far as psychothera- 
peutic achievement is concerned. It also sub- 
stantiates the belief that psychotherapeutic 
success requires, among other things, the 
patient’s ability to achieve insight and the 
therapist’s ability to foster a warm, under- 
standing, and accepting environment. Fiedler 
(1950) found these factors to be mentioned 
prominently in therapists’ descriptions of the 
ideal therapeutic relationship, and he ob- 
served that agreement on these variables was 
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higher among experienced therapists regard- 
less of theoretical orientation than among 
experienced and inexperienced therapists of 
like orientation. 

The demonstrated relationships between 
therapists’ attitudes and their clinical evalua- 
tions and treatment plans in general support 
the second hypothesis. This may also be in- 
terpreted as discrediting the conception of 
an impersonal, mirror-like therapist, a view 
which has already become outmoded. By 
drawing attention to these associations, this 
study supports the increasing emphasis given 
to the effects of personality variables in the 
therapist upon his clinical operations. The 
extent to which these variables play a part 
in the therapist’s intricate work remains to 
be explored much more exhaustively, but it 
is clear that they must come to occupy a 
more central place in selection, training, and 
therapeutic practice. From a methodological 
point of view, this study demonstrates the 
feasibility and potential value of quasiclinical 
experimental designs, which permit the intro- 
duction of controls not otherwise possible. 


SUMMARY 


This investigation deals with an explora- 
tion of two hypotheses positing an association 
between therapists’ attitudes toward patients 
and clinical evaluations and treatment plans 
as a function of the patient’s motivation for 
therapy. Eighty-two respondents, all of whom 
were practicing medical psychotherapists, 
read two case histories of a Danena, 

forms of the case histo è em- 
e vary systematically the patient’s 
motivation for therapy. Respondents recorded 
their clinical impressions, estimates, and atti- 
tudes by means of a 27-item questionnaire 
which followed each case presentation. 

The hypothesized association between ther- 
apists’ attitudes toward the patient and their 
perceptions of, and treatment plans for, the 
patient appear in general to be substantiated. 
It is pointed out that the direction of causa- 
tion remains indeterminate and that the de- 
gree of relationship between the therapist’s 
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“usual” performance and his performance in 
the present experimental situation is un- 
known. Nevertheless the evidence is believed 
to support the assertion that personality fac- 
tors of the therapist are an integral part of 
his clinical judgments and therapeutic pro- 
cedures. A better understanding of these rela- 
tionships might make an important contri- 
bution to more effective clinical practice as 
well as aid in the selection of candidates for 
training in psychotherapy. 
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OPERANT CONDITIONING OF HOSTILE VERBS IN 
RELATION TO EXPERIMENTER AND SUBJECT 
CHARACTERISTICS * 


DONALD C. FERGUSON asp ARNOLD H. BUSS 
University of Pittsburgh 


Virtually all operant verbal conditioning 
studies have used neutral materials. One of 
the exceptions is an experiment by Buss and 
Durkee (1958), who used a modification of 
Taffel’s procedure (1955). A list of past tense 
verbs was compiled, and judges sorted them 
into three categories: neutral, mildly hostile, 
and intensely hostile. Three verbs, one from 
each category, were placed on a card, and 
Ss were instructed to make up a sentence 
using one of the verbs. There were 60 such 
cards (trials). One group of Ss was reinforced 
for using intensely hostile verbs, and another 
group was reinforced for using neutral words. 
It was found that intensely hostile words were 
learned at a faster rate than neutral words. 
There were also sex differences in frequency 
of using intensely hostile words. 

This study was followed up by Binder, 
McConnell, and Sjoholm (1957), who used 
only the neutral and mildly hostile verbs. 
There were two groups of Ss, both reinforced 
for using hostile verbs. One group was run 
by a large, aggressive man and the other by 
a diminutive, mild, nonassertive woman. The 
group run by the “neutral” woman learned 
significantly better than the group run by 
the aggressive man. 

Evidently characteristics of the Æ affect 
learning of hostile material. However, it is 
not clear from the Binder et al. (1957) study 
which characteristics are important. One E 
was large, male, and aggressive; the other Æ 
was small, female, and neutral. It is doubtful 
that size of Æ is a major variable, which 
leaves aggressiveness and sex. Which of these 
variables retarded learning, aggressiveness or 

1 This report is based on a master’s thesis by the 


senior author under the direction of the junior 
author. 


masculinity? Perhaps both affected learning, 
The Binder et al. (1957) experiment does not 
furnish an answer because aggressiveness and 
gender of the Æ were confounded. The present 
study investigated these variables separately 
i.e., a male and a female E each played an 
aggressive or a neutral role with Ss. In addi- 
tion, sex of Ss was a variable in an attempt 
to discover interactions between character- 
istics of the E and gender of the S in the 
— verbal conditioning of hostile mate- 
rial. 


METHOD 
Experimental Design 


There were four groups of Ss. Two groups were 
run by a man and two groups by a woman Thi 
male £ was the senior author, and the female E S 
his wife.* Each E played a neutral role with bee 
group and an aggressive role with the other, Pri 3 
experimental rehearsal helped to make ae 
cach role similar for the two Es, 

The neutral E was patient, calm 
in an attempt to establish a nonhostil 
climate. The aggressive Æ was bru 
friendly. The S was made to wait if ca 
and criticized if late. The E Was 
tended to scowl and sneer, Reports f 
termination of the experiment reveale 


roles (neutral or aggressive) were en 
ately. 


enactment of 


and courteous 
e experimental 
sque and un- 
arly or on time 
impatient and 
rom Ss at the 
d that the two 
acted appropri- 


Procedure 


our ee was the same as that of Binder 
ae k e Red S was presented with a sample 
ey age Pa was typed the verbs WALKED and 
I , al e pronouns HE, THEY, SHE. He was 


e pronouns, Then he was pre- 
of 140 similar cards, each card 
On these cards one verb was 
mildly hostile, 


*The writers wish to than’ 
serving as an E. 


constituting a trial, 
neutral and one was 


k Elsie Ferguson for 


324 


ol 


‘i 


Operant Conditioning of Hostile Verbs 32 


The first 20 trials were used to establish an 
operant level of responding. Starting with Trial 21 
the E said “Good” each time a hostile verb was 
used. This procedure was the same for all Ss. 


Subjects 


The Ss were 80 college students recruited from 
psychology classes. The Ss were randomly assigned 
to the four experimental groups, with the restriction 
that half the Ss in each group were men and half 
were women. 


RESULTS 


An examination of the learning curves re- 
vealed no systematic sex differences. There- 
fore the learning curves of male and female 
Ss were combined. The data bearing on ag- 
gressiveness of the Æ are presented in Fig. 1. 
The neutral E curve is a composite of the 
data from the male and female Es, and the 
aggressive Æ curve combines groups run by 
the male and the female Es. The aggressive E 
curve is smoother and has a steeper slope 
than the neutral E curve. The superiority of 
learning with a neutral Æ is especially evident 
during the last 60 trials (Blocks 5, 6, and 7). 

The data bearing on gender of the Æ, pre- 
sented in Fig. 2, indicate that the sex of the 
E has little effect on learning. The female Æ 
curve starts at a slightly higher level than 
the male Æ curve. However, this difference 
disappears during the second block of trials, 
and thereafter the curves are similar. , 

None of the learning curves presented in 
the two figures is smooth, but the irregulari- 
ties do not fall into any pattern. In this re- 
spect they resemble the curves obtained in 
most operant verbal conditioning studies, in 
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which uncontrolled variables appear to be 
affecting the conditioning process. In this 
context neither the shape of the curve (which 
reflects random irregularities) nor the overall 
height of the curve is of particular impor- 
tance. Overall height is determined not only 
by increments in learning but also by initial 
level of responding. Therefore, the crucial 
measure of conditioning is the increment in 
learning, i.e., the net increase in frequency 
of hostile verbs above the operant level of 
responding. The score used was: frequency 
for last 20 trials minus frequency for the first 
(operant) 20 trials.* 

An analysis of variance of these scores is 
presented in Table 1. The only significant F 
is for aggressiveness of the E. Sex of the B, 
sex of the S, and interactions among the vari- 
ables were not significant. This analysis of 


3 One of the writers (Buss) has conducted several 
experiments on operant verbal conditioning with 
both neutral and hostile materials. These studies 
have yielded learning curves that vary not only 
from one S to the next but from one sample to the 
next. The lack of smoothness in curves has not 
obliterated the presence of group differences, prob- 
ably because such differences were large, but it 
warns against placing much faith in intertrial varia- 
tions. An analysis of trend, such as that used by 
Binder et al. (1957), takes advantage of variations 
among blocks of trials. Since such intertrial varia- 
tions tend to be unreliable, they probably consist 
mainly of error variance. Therefore, trend analysis 
was not used in testing the significance of the present 
data. Rather, an analysis of variance of the net in- 
crease in response frequency was employed. The 
net increase measure seems best because it is the 
most conservative and the most stringent measure 
of learning, since it does not take advantage of ran- 
dom variations within blocks of trials. 
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TABLE 1 


ANALYSIS OF VARIANCE OF DIFFERENCE SCORES FOR 
FREQUENCY OF HOSTILE VERBS 


Mean 
Source df Square F 

Aggressiveness of Æ 1 66.61 4.24* 
Sex of E 1 30.01 1.91 
Sex of S 1 1.01 .06 
Sex of EXSex of S 1 10.51 67 
Sex of EX Aggressiveness of E i 2.11 3 
Sex of SX Aggressiveness of E 1 2A1 13 
Sex of SXSex of E 

X Aggressiveness of £ i 32 02 
Within groups 72 15.70 

Total 79 

* p <.05. 


variance confirms the trends noted in the 
learning curves. Only aggressiveness of the E 
significantly affects learning, an aggressive 
E tending to retard conditioning. 

Since the curves in Figs. 1 and 2 differ 
in overall height, another score was used: 
total frequency of hostile verbs. An analysis 
of variance of these scores yielded no signifi- 
cant F. Evidently none of the variables influ- 
enced total frequency of hostile verbs. 


DISCUSSION 


Binder et al. (1957) found that an aggres- 
sive male Æ retarded learning in comparison 
to a neutral female E. The results of the 
present study indicate that the effective vari- 
able is not the sex but the aggressiveness of 
the E. Operant conditioning of hostile verbs 
is slower with an aggressive E than with a 
neutral Æ. In attempting to explain this find- 
ing two possibilities should be considered. 
First, when the Æ was aggressive, he may 
have been a less potent reinforcing agent. His 
saying “Good” would be less reinforcing than 
the neutral E’s “Good” and would lead to 
slower learning. 

An alternative explanation is that E’s being 
aggressive angered the S. The S could not 
aggress openly against the Æ, because of his 
higher status as an Æ. Therefore, it would be 
necessary to repress or suppress aggressive 
tendencies, and such inhibition would gen- 
eralize to the hostile verbs. The S would then 
tend to inhibit verbalizing the hostile choice, 
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and this would retard conditioning. Which of 
these two explanations is correct will be de- 
cided by studies in which other kinds of 
material are conditioned and the E assumes 
nonneutral roles other than aggressiveness. 

Two of the groups in the present study 
may be compared with their counterparts in 
the Binder et al. (1957) study: the group 
run by the neutral female Æ and the group 
run by aggressive male E. The neutral female 
E curves in the two experiments are similar, 
with the curves of the present study starting 
slightly higher and ending slightly higher; the 
slopes appear almost identical. The aggressive 
male E curves start from the same operant 
level of responding, but the curve of the pres- 
ent study has a steeper slope, ending at a 
higher point. Considering the variability in 
these curves from one block of trials to the 
next, it is doubtful that these differences in 
the aggressive E curves are significant. 

In both studies the slope of the neutral Æ 
curve was significantly steeper than that of 
the aggressive E curve, i.e., the increment in 
frequency of hostile verbs was significantly 
greater for the Ss run by a neutral E. Con- 
cerning the total frequency of hostile verbs, 
the two experiments differ. Binder et al. 
(1957) found a significant difference between 
the neutral E and aggressive E groups in total 
frequency of hostile verbs, whereas the pres- 
ent investigators did not, although the differ- 
ence was in the same direction. However, as 
peg ar total frequency is influenced ‘not 
only by increments in i 
initial level of pd ee a m 2 

aiian 5 S y irregulari- 
ties in the learning curves. Examination of 
Fig. 1 reveals that the curve of the aggres- 
B E started higher and because of irregu- 
Baro he one a a 
aap he lock of trials. These 
t s probably account for the difference 
in total frequency of hostile verbs not being 
significant in the present study. However, 
since the crucial dependent variable is incre- 
a an frequency of response, not height of 
ene this discrepancy between the 
er et al. (1957) study and the present 
one seems unimportant. Such minor differ- 
ences must be expected so long as we are 
dealing with curves that are subject to ran- 
dom irregularities. With respect to the major 
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finding—that an aggressive Æ retards condi- 
tioning of hostile verbs—the two studies are 
in agreement. 

SUMMARY 


This experiment investigated operant con- 
ditioning of hostile verbs in relation to ag- 
gressiveness of the E, sex of the Æ, and sex 
of the S. The Ss were presented with two 
verbs and three pronouns on a card. One 
verb was hostile and the other neutral. The 
Ss made up a sentence using one verb and 
one pronoun. They were reinforced for using 
hostile verbs. It was found that only the 
aggressiveness of the Æ significantly affected 
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conditioning: an aggressive E retards learning 
in comparison to a neutral Æ. 
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THE EFFECT OF A STRUCTURED INQUIRY ON 
RORSCHACH SCORES * 


MELVIN ZAX anp GEORGE STRICKER 
University of Rochester 


In spite of the fact that the inquiry is an 
integral part of the Rorschach examination 
there is little unanimity among experts con- 
cerning the proper way to conduct it, nor is 
much empirical data available on the effects 
of various procedures. All prominent Rors- 
chach theorists agree that the examiner 
should adopt a nonleading approach when 
conducting the inquiry. When they operation- 
alize their recommendations, however, it is 
apparent that they disagree on what consti- 
tutes “leading the subject.” 

Klopfer and Kelley (1946) advocated pro- 
cedures such as offering alternative concepts 
or comparing a previously given percept to 
a later one in order to establish the presence 
or absence of a specific determinant. In prob- 
ing for movement they also sanctioned ques- 
tioning about the positions of the figures 
seen. As a last resort in ruling on the use 
of color they agreed with Beck (1950) in 
employing such questions as, “Suppose this 
were exactly the same shape as now, but 
gray or black, would you still think it Sieh 

Klopfer, Ainsworth, Klopfer, and Holt 
(1954) cautioned against suggesting any 
particular determinant to subject (S) and, as 
a way of avoiding this, advised sticking closely 
to the terms and language employed by S 
himself. They were also critical of the practice 
of comparing the Rorschach plate with the 
small achromatic designs on location charts 
which was recommended by Piotrowski 
(1957). Klopfer et al. (1954), however, did 
advocate questions about the position of the 
percept and how it was seen when probing 
for movement. 


Among the most conservative of the Rors- 
1The authors are grateful to Dorothea Brayer, 


Florence Budin, A. Cooper, J. H. Weiss, and D. L. 
Wolitzky who served as examiners in this study. 


chach testers with regard to the inquiry are 
Rapaport, Gill, and Schafer (1946) who 
recommended conducting the inquiry after 
each card and with the blot not present. 
Questioning is kept to an absolute minimum. 
Thus, only what is spontaneously perceived 
is scored, and the record is kept free of ra- 
tionalization and secondary elaboration. Sara- 
son (1954) likewise favored a minimum of 
questioning, and apart from a reading back 
of the response with the opening query, 
“What was it in the card that reminded you 
of that?” only the clarification of any ambi- 
guity in the clients verbalizations 
allowed. 

Data on the differential effects of such 
procedures is sparse indeed. Gibby and Stot- 
sky (1953) have provided some information 
on the effects of the inquiry on Rorschach 
scores by comparing the scoring of the free 
association on patients’ records to the scoring 
of the free association plus the inquiry. The 
type of inquiry used was not standardized. 
It was found that the indices could be differ- 
entially ordered for amount of change with 
F ranking first and FV, FC, CF, M, YF, V 
and Y following in that order. Of course F 
decreased while all the other indices increased, 
and the changes for all but V and Y were 
statistically significant. 

Baughman (1958) proposed a new method 
of conducting the inquiry whereby the stand- 
ard inkblots are compared systematically to 
a set of especially prepared blots in which 
color, shading, figure-ground contrast, and 
complexity of form are varied. Ss’ responses 
to a standard series of questions reveal which 
of the determinants have been most influen- 
tial. Contrasting the scoring of records elic- 
ited by this procedure with records in which 
the traditional inquiry was done, Baughman 


was 
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found that significantly more shading deter- 
minants were given in response to the paired 
comparison technique. However, there were 
no significant differences in the number of 
scorable color responses elicited by either 
technique. 

In a study closely related to the one re- 
ported herein, Klingensmith (1956) investi- 
gated the effects of the amount of structure 
used in the inquiry on the quantity and type 
of indices reported. The scoring of the free 
association alone represented the least struc- 
tured procedure. Next the scoring of the free 
association plus the traditional inquiry was 
computed. Thirdly, Ss were asked to return 
to the cards and give whatever determinants 
besides shape were influential in their re- 
sponses and a separate scoring was done for 
this procedure. Finally, the determinants 
were described to S and he was told to report 
those he used. Again the records were scored 
using information elicited by this technique. 
When the scoring from the various proce- 
dures was compared, it was found that T, 
achromatic color, and V showed the greatest 
increase with the increase in structure while 
M, FM, C, and Fm showed less tendency 
to increase. The increase in M was insignifi- 
cant at all levels. Unfortunately Klingen- 
smith limited his Ss to two responses per 
card and only analyzed for 15 responses from 
each of the records taken. i 

This study is essentially a replication of 
that of Klingensmith with a slight procedural 
modification and with the entire record of 
each S being considered. This is looked upon 
as an exploratory study of the extent and 
type of change in scoring which results from 
a frank description of determinants concern- 
ing which the inquiry is being done. It should 
thereby provide data on the reliability of the 
indices as a function of the procedure by 
which they were elicited. 


METHOD 


Ss for this experiment were secured by five exam- 
iners who each administered 15 tests as part of the 
requirement for credit in a graduate course in Rors- 
chach administration. Before any examiner was per- 
mitted to begin actual administration of the test, 
he must have participated in all of the classroom 
instruction and have been observed and supervised 
in at least two practice examinations. Examiners 
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were instructed to use only Ss who were between 
16 and 50 years of age and who had never been 
hospitalized for an emotional disorder. In the sam- 
ple which was collected the age ranged from 16 to 
47 years with the mean age being 25.79 years. The 
mean education level of these Ss was 14.07 years 
(about two years of college) and the majority 
(55%) were students when tested, The sample com- 
prised 30 males and 45 females. 

Procedurally, the test administration w. 
duced by the following statement: 


as intro- 


I have a set of ten inkblots that I’d like you 
to look at. They are made simply by dropping ink 
on a sheet of paper and folding the paper so that 
the ink spreads out in a chance way. Different 
people can see different things in these blots and 
Tm interested in what you can see in them. So, 
tell me whatever you see as soon 
(After the first card has been returned add: 
From now on when you can’t see any more put 
the card down as a signal that you've finished.) 


as you see it, 


The free association period was followed by the 
inquiry which was conducted in the manner de- 
scribed by Sarason (1954), since he gives the most 
precise directions and also because his approach 
represents one of the least structured in current use. 
Subsequent to this inquiry, a second inquiry was 
done which was prefaced by this statement: ` 


Inquiry II. The first time we went through the 
blots you may not have been certain just what 
I was getting at. The point is there are a number 
of features of the inkblots which may possibly 
help to determine the many things one can see 
in them. The shape or contour of the blot is ob- 
viously one of these, but other Possibilities are the 
color of the blot, the shading within it, the sug- 
gestion of a three dimensional effect or perspective, 
the appearance of a textural quality to the surface 
and the illusion that the figures seen are in motion 
or assuming some position. To ensure that none of 
these possibilities have been missed in going over 
your responses, I’d like to take one more look at 
them and ask you to make certain that I under- 
stand in each case exactly what it was that caused 
the blot to look as it did to you when you first 
saw it. Let’s go over the cards once more and you 
tell me what about the blot led you to see it the 
way you did, when you first looked at it, 


In this second inquiry Ss’ original responses and 
their remarks from the first inquiry were read back 
to them and they were invited, but not encouraged, 
to give any additional determinants which applied. 

For each record a Separate scoring was done for 
the free association, the free association plus the 
first inquiry, and the free association plus both 
inquiries. Approximately half of the records were 
scored by each of the authors, both of whom have 
had extensive experience with the test. Fifteen rec- 
ords, three being taken from each of the examiners. 
were scored in common to provide a reliability 
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check. Scoring of determinants was done according 
to the method prescribed by Klopfer et al. (1954) 
except that Beck’s indices for shading were adopted. 


RESULTS 


To insure the reliability of the scoring sys- 
tem between the two examiners a series of 
27 rhos was computed. These compared the 
scoring of the authors on nine indices or com- 
binations of indices elicited, for each of the 
three aspects of the testing situation (free 
association and the two inquiries). The in- 
dices studied were M, FM, Fm, FC, CF +C, 
FY, VF + Y, FV, and FT + TF + T. Seven- 
teen rhos were greater than .90, eight fell be- 
tween .80 and .90, and only two fell below 
.80, one being .74 and the lowest being .68 
(for the scoring of FC on the free association 
in which case there were relatively few scor- 
ings made and small differences had a great 
effect on the overall rho). On the basis of 
these figures it was felt that the scoring sys- 
tem was a reliable one. 

Since the experiment was concerned with 
the effect of inquiry structure on the elicita- 
tion of determinants, each S was used as his 
own control and the scorings of his free asso- 
ciation, free association plus inquiry, and free 
association plus both inquiries were com- 
pared. To accomplish this, the total number 
of each index given by S in response to the 
entire test procedure was computed. The per- 
centage of this total which was scorable at 
each of the three levels of structure was then 
calculated, and statistical comparisons were 
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made between these figures. In this case the 
null hypothesis was that the levels of inquiry 
structure (the free association period was 
considered to represent the complete absence 
of such structure) had no effect on the tend- 
ency to produce the various determinants. 
By chance it would be expected that Ss would 
either give about as many indices of any type 
on one phase of the test as on others or that 
those who give the most at a single level of 
structure are counterbalanced by others who 
are most responsive at a different level. The 
statistical test used was the Friedman non- 
parametric analysis of variance for repeated 
measurements of the same S as described by 
Siegel (1956). 

A series of eight such analyses was done 
for the indices M, FM, Fm, FY, YF +Y, 
FV, FC, and CF + C. No analysis was done 
for FT+TF +T since so few texture re- 
sponses were given. In the case of each deter- 
minant, of course, analyses were restricted to 
those Ss giving them so that the N for each 
index varies somewhat. Table 1 presents the 
mean percentages of the indices given at each 
level of structure, the mean number of re- 
sponses involving these indices at each level, 
and the yr? for each analysis which is refer- 
able to the standard chi square table (in this 
case at two degrees of freedom). 

The analyses in Table 1 indicate that for 
all of the indices save FV there are highly 
significant differences between the percent- 
ages elicited at the various levels of structure 
The findings with regard to the moyement 


TABLE 1 


MEAN PERCENTAGES AND NUMBER OF RESPONSES AT EACH LEVEL OF STRUCTURE 
AND RESULTS OF NONPARAMETRIC ANALYSES OF VARIANCE 


Mean % Mean 


Mean 


Free No. of Mean % No. of Mean % an 
Index N Association Responses Inquiry I Responses Inquiry II A a 
xr? 

M 72 83 3.35 16 0.65 1 z 

FM 73 63 3.65 32 1.88 5 0.05 83.46 
Fm 41 50 1.03 48 0.99 > 0.28 62.82 
FY 75 4 0.31 58 3.81 38 0.05 24.74 
YF+Y 41 13 0.27 60 130 = 2.46 63.55 
FV 33 29 0.62 48 1.05 3 0.57 15.01 
FC 67 12 0.38 71 2.28 17 0.51 2.80 
CF+C 60 28 0.91 62 2.05 10 ae A 


* For two degrees of freedom xr? of 9.21 is significant at .01. 


' 


Structured Inquiry and Rorschach Scores 


TABLE 2 


TOTAL NuMBER OF Ss RESPONDING WITH THE VARIOUS 
DETERMINANTS AT EACH LEVEL OF STRUCTURE 


Free 
Index N Association Inquiry I Inquiry H 
M 72 68 27 3 
FM 73 67 48 10 
Fm 41 28 28 2 
ry 75 15 67 PE: 
YF+Y 41 9 31 17 
FV 33 13 20 11 
FC 67 21 60 23 
CF+C 60 33 51 11 


categories and the shading determinants, par- 
ticularly FY, are most striking. In the case 
of M only three Ss of the 72 who gave that 
determinant did so in response to the most 
structured level of inquiry, and in each case 
only a single M was added. On the other 
hand 60 of the 75 Ss ultimately giving FY 
failed to give any responses which could be 
scored as such on the free association, while 
57 of the 75 Ss added this determinant in 
response to the instructions indicating which 
indices could be given. A complete breakdown 
of the number of Ss giving each index at each 
level of structure is found in Table 2. 


DISCUSSION 


The results of this study indicate that the 
various determinants (with the exception of 
FV) are characteristically elicited at certain 
times in the Rorschach examination and that 
these times vary with the determinant. Fur- 
termore, they indicate that certain indices 
are more sensitive than others to the increase 
in inquiry structure. Ss give an average of 
2.46 additional FY responses m the most 
structured inquiry while giving only 0.05 addi- 
tional M responses in the same phase of the 
test, Rank ordering the indices in terms of 
this tendency to be elicited in the more struc- 
tured inquiry, FY is first followed by YF 
+Y, FV, FC, CF+C, FM, Fm, and M. 
This is in substantial agreement with the 
findings of Gibby and Stotsky (1953) and 
Baughman (1959), who concerned himself 
only with color and shading. This is also in 
close accord with the results of Klingensmith 
(1956) with the exception that FV was not 
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found to show the greatest increase with the 
increase in structure. On the contrary, of all 
the shade-determined indices FV showed the 
least increase on the second inquiry. 

Such findings have clear implications for 
the clinical use of the test. They would sug- 
gest that in the absence of a standardized 
inquiry the FY index is highly unreliable. 
Interpretations based on the number of such 
determinants present in a record must be 
made with considerable caution therefore. On 
the other hand the various movement deter- 
minants seem highly reliable. In spite of the 
fact that what appear to be high percentages 
of YF +Y, FV, FC, and CF +C are given 
on the most highly structured inquiry (see 
Table 1), the increment in terms of average 
number of such responses per S is never more 
than 0.57 which is quite small. Such differ- 
ences are perhaps less than one would have 
been led to expect in the absence of empirical 
data. 

While the findings in this study suggest 
that most scoring indices are reasonably reli- 
able for clinical use, the implication that a 
standardized inquiry procedure is unnecessary 
is not intended. The instability of FY alone 
under varying conditions would argue for a 
more universally accepted way of conducting 
the inquiry. Perhaps one of the obstacles to 
the development of such a procedure is the 
fact that textbook writers have failed to op- 
erationalize the procedure enough. The fear 
that gross distortions might result from a 
frank confrontation of S with the possibilities 
among the various determinants may well 
have been a deterrent to the development of 
a standard set of instructions. As a result a 
variety of subtleties are recommended which 
allow for considerable personal variation in 
approach. 

Baughman (1958) has recognized the need 
for greater standardization of the inquiry and 
has proposed a technique for eliciting color 
and shading responses. Unfortunately, his 
approach requires the use of six alternate sets 
of inkblots and a procedure which appears 
quite time consuming. The use of the kind 
of structuring employed in the second inquiry 
in this study might well accomplish the same 
purpose as his paired-comparison technique, 
and a comparison of the effects of the two 
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techniques would be one fruitful direction 
for further research. 

Another worthwhile investigation would in- 
volve a comparison of the effects of examiner 
differences under conditions where the inquiry 
is an unstructured one and when, by virtue 
of the instructions given, it is highly struc- 
tured. Conceivably, the S who is given few 
cues regarding the object of the inquiry might 
seize on whatever ones are inadvertently of- 
fered by the examiner, while the S who is 
given all of the cues he needs through the 
instructions may remain less sensitive to ex- 
aminer idiosyncracies. This possibility can be 
investigated using the data obtained in the 
reported study, and the authors are currently 
doing so. If this hypothesis is substantiated, 
it would lend further support for the estab- 
lishment of a standard inquiry based on a 
frank description of the various possible 
determinants to be found in the blots. 


SUMMARY 


The effects of inquiry structure on the 
determinants elicited were studied by admin- 
istering the standard Rorschach to 75 Ss, 
conducting a relatively unstructured inquiry 
and then a second one prefaced by a descrip- 
tion of the determinants, The free association, 
free association plus the first inquiry, and free 
association plus both inquiries were scored 
Separately and these scorings compared. It 
was found that the various determinants 
characteristically tended to be elicited in dif- 
ferent phases of the test procedure (i.e., most 
movement responses were scorable on the free 
association, while most shading and color re- 
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sponses were only scorable after an inquiry). 
It was also found that of the determinants 
studied only the change in number of FY on 
the highly structured inquiry appeared to 
take on clinical significance. The desirability 
of a standardized inquiry was discussed, and 
the possibility of basing it on a frank descrip- 
tion of the determinants which are the object 
of inquiry was considered, 
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PREFERENCE TEST 


RAYMOND B. CATTELL axp ROBERT E. McMICHAEL 


University of Illinois 


The present investigation was undertaken 
with the limited purpose of determining any 
existing differences in the types of musical 
preference expressed by a normal control 
group and various pathologic, Psychotic, and 
neurotic groups. It is an exploratory piece of 
research in that the measuring instrument 
used is of quite recent vintage. Though it is 
convenient, consisting of a single long-playing 
record which can be responded to on answer 
sheets by a whole collection of patients, and 
although the independence of the factor scales 
has been factor analytically confirmed, very 
little information has yet come forth regard- 
ing its validity or reliability as a clinical 
instrument. 

A search of the literature has shown that 
even though several studies have employed 
measures of musical preference not one has 
directly concerned itself with investigating the 
determination of diagnostic classification of 
psychiatric syndrome groups by musical pref- 
erence. Rather, they have been principally 
concerned with determining possible variables 
influencing or associated with musical prefer- 
ence in normal subjects (Ss). Among others, 
such variables as age (Keston & Pinto, 1955; 
Valentine, 1926), sex (Fay & Middleton, 
1941; Kerr, 1943; Keston & Pinto, 1955; 
Schultz, 1933; Sopchak, 1955), mood (Coroso, 
1958; Podolsky, 1954; Sopchak, 1955), * in- 
tellectual introversion” (Keston & Pinto, 
1955), and music recognition (education) 
(Keston & Pinto, 1955; Sopchak, 1955), have 
been investigated. The results of such studies 
at best suggest some variables that may 
profitably be controlled when working with 
measures of musical preference. 

In a more dynamic mode, both Cattell 
(1951, 1957) and Sopchak (1957) have pro- 
posed the use of music as a “projective” 


device. The former starts with the premise 
that, “The powerful and immediate connec- 
tion of musical stimulation with emotional 
experience, and the many indications that 
unconscious needs gain satisfaction through 
this medium, have long pointed to measures 
of musical preference as effective avenues to 
deeper aspects of personality” (Cattell & 
Anderson, 1953b, P. 446). Believing that here, 
as in his humor studies, psychiatric judgment 
is not sufficient to justify a priori decisions as 
to what musical preferences should form the 
scale content for a particular syndrome, he 
factored the likes and dislikes for 120 brief 
musical excerpts and arrived at 11 independ- 
ent (simple structure) scale dimensions (Cat- 
tell & Saunders, 1954). These cannot be ex- 
plained in terms of historical musical periods 
or schools but Suggest dynamic factors of 
temperament and emotional need. On this 
research basis a 100-item, l-hr. test was re- 
corded and standardized as described else- 
where (Cattell & Anderson, 1953a; Cattell & 
Anderson, 1953b). Subsequently, a correlation 
study was carried out between the music 
factors of the musical preference test and the 
factors on the 16 Personality Factor Ques- 
tionnaire (Cattell & Anderson, 1953b). The 


are substantial, i.e., the test gets into the same 
“the set of 16 PF 
test factors associated with any one music 
factor has a Psychologically consistent and 
compatible character . | | in every case” 
1953b, p. 449), 
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egories. Such comparisons were suggested by 
reasoning that if the musical preferences are 
functions of personality factors then various 
psychotic and neurotic syndrome groups 
should show different music factor scores. 
This reasoning is based on the commonly 
accepted—however, often untested—proposi- 
tion that various syndromes are interpretable 
in terms of extreme patterns of common per- 
sonality factors (as shown, for example, in 
Cattell and Scheier’s recent studies [1960] 
of neurotic syndromes with the 16 PF test). 
Statistically significant results were in fact 
found, and on the basis of this suggestive, 
promising, and partially confirmatory evidence 
of the test’s capacity to discriminate among 
pathologic groups, it was concluded that fur- 
ther investigation, using other samples, was 
desirable. Consequently, in overall design the 
present research is similar to that of Cattell 
and Anderson (1953b). However, because of 
modifications in the diagnostic categories and 
sizes of the pathologic groups, in testing pro- 
cedure and statistical analysis, and in the sex, 
age, and other characteristics of all the sam- 
ples, it is by no means a straightforward 
replication. 


METHOD 
Design 


The IPAT Musical Preference Test, permitting 
scores on 11 distinct dimensions of choice, was ad- 
ministered to several diagnostically different abnor- 
mal groups, in addition to a control or “normal” 
group. Intergroup comparisons, on their mean scores 
on each factor, were made between (a) the various 
pathological groups and the control group, (b) the 
control group and the total abnormal group, and 
(c) among the various pathological groups. The 
design included the additional features that only 
same sex comparisons were made and that any pos- 
sible effects attributable to age were first separated 
out. These comparisons provided the basis for con- 
clusions concerning the discriminating power of the 
music preference test. 


Subjects 


The Ss of this investigation consisted of seven 
groups—organics, sociopathic personalities, schiz- 
ophrenics, paranoids, affectives, psychoneurotics, and 
normals. Table 1 presents general, descriptive, iden- 
tifying data on all the samples used in this investiga- 
tion. All of the abnormal Ss were patients — in 
Manteno State Hospital, Manteno, Illinois. The crite- 
rion used for diagnostic classification was the hospital 
diagnosis as determined in case conferences. A re- 
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striction imposed on the abnormal samples was that 
no severely regressed ‘patients participated in the 
study. Every patient was able, at least initially, to 
attend to the music. 

Since it seemed undesirable to take students as a 
control group, as was done in an earlier study, we 
obtained the cooperation of members of a church 
social group of roughly similar social status and age 
range to the mental hospital group. Descriptive data 
on this sample are, however, limited because these 
Ss were not required to list any identifying data 
other than their age and sex. However, educationally, 
they seemed somewhat more homogeneous, though 
not necessarily higher, than the abnormals. 


Materials 


The IPAT Music Preference Test of Personality 
(Cattell & Anderson, 1953a) was the test instrument 
used in the present investigation. The test consists of 
100 musical excerpts which have been put onto a 
12-in., 3344 rpm record by RCA Victor. The test was 
tape recorded in order to facilitate group testing. 
Good fidelity of reproduction was obtained. A Knight 
tape recorder and an external speaker system were 
used for the playing of the music. 


Testing Procedure 


The normal Ss were tested in one group in a large 
room at the church. The abnormal Ss were tested in 
groups as formed by wards. Therefore, various diag- 
nostic groups were represented at a given testing. 
The taking of the test was strictly a matter of volun- 
tary participation on the part of the patients. There 
were always from 8 to 10 monitors present who 
encouraged and supervised the Ss, showing them 
where to record their choices if they appeared to 
need help. Talking during the test was generally not 


permitted. The total testing time varied but usually 
was about one hour. 


Scoring 


Scoring of the answer sheets w 
answer keys provided in the test materials. These 
keys yield total “weighted item” raw scores on each 
of the 11 musical factors which together comprise the 
music preference test. Once these weighted raw scores 
were obtained from every S’s answer sheet, the actual 
Scoring was completed. These score values are the 
data used in the statistical treatments. 

Although a total of approximately 300 patients 
Were present on the wards, only 230 actually partic- 
ipated and submitted complete answer sheets, Of 
these 230 complete records, 190 were usable. Forty 
records had to be discarded for one reason or an- 
other, e.g., filling all the blanks out before the test 
actually began, copying the responses of one’s neigh- 
bor, attending to the music intermittently, and so 
forth. Since the diagnostic labels were obtained from 
the hospital only for those patients who met the 
testing criterion and had usable answer sheets, the 
diagnoses of these 40 patients are not known. 


as done by using the 
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Statistical Treatment of Results 


The statistical treatment of the results involved 
three steps, each step designed to determine whether 
or not a certain variable or category had an influence 
on scores of music preference. First, an analysis of 
covariance was carried out on each of the 11 music 
factors involved in the test. This analysis was em- 
ployed in order to demonstrate that a classification 
based on diagnosis and sex would show significant 
effects when effects attributable to age were removed. 
Second, on those factors for which the classification 
showed significant effects, a two-way analysis of 
variance (sex by categories) was carried out to 
ascertain whether significant score differences should 
be attributed to sex, diagnosis, and/or interaction. 
Finally, contingent upon the results of the analyses 

of variance, ¢ tests were carried out to determine the 
location of statistically significant differences among 

the diagnostic groups. It was also planned that the 
analyses of variance and ¢ tests should be carried out 
on scores adjusted for regression on age. 


RESULTS 


An analysis of covariance was run on each 
of the 11 music factors. Consequently, any 
effect of age on the music preference scores 
was statistically controlled for. Out of the 11 
analyses of covariance, 7 were significant at 
the 10% level of significance agreed to justify 
further analysis.1 Although this was the value 
of significance agreed to warrant further anal- 
ysis, the probabilities associated with all but 
one of the F ratios were at the 5% level or 
less. For these 7 factors on which such a result 
was obtained it may be said that the variance 
among the means of the 14 groups, i.e., 7 
categories by two sexes, was large enough to 
warrant further analysis. On the remaining 4 
factors for which the null hypothesis was not 
rejected, no further analysis was undertaken. 

An analysis of variance was run on each of 
the seven music factors which showed a sig- 
nificant analysis of covariance result. For pur- 


1 Due to limitations of space, the summary tables 
of raw score means and variances (Tables A and B), 
the summary tables of the analyses of covariance and 
variance (Tables C and D, respectively), and the 
summary table of regression coefficients (Table E) 
are not presented in this report, but they have been 
deposited with the American Documentation Insti- 
tute. Order Document No. 6331 from ADI Auxiliary 
Publications Project, Photoduplication Service, Li- 
brary of Congress; Washington 25, D. C., remitting 
in advance $1.25 for 35 mm. microfilm or $1.25 for 
6X8 in. photocopies. Make checks payable to: 
Chief, Photoduplication Service, Library of Congress, 


poses of exploratory analysis, if the F yielded 
by the analysis of variance had a chance prob- 
ability of .10 or less, it was to be considered a 
statistically significant value. Though this was 
the level of statistical significance adopted by 
the authors, the actual F ratios for the main 
effect of diagnosis had associated probabilities 
of .05 or less. 

The seven analyses of variance had in com- 
mon the result of a significant main effect due 
to diagnostic category. The findings showed 
that for a given factor this effect might be the 
only statistically significant one or that it 
occurred in addition to a main sex effect or 
an interaction effect. A main sex effect was 
found on Factors 1, 8, and 11, indicating that 
a consistently higher or lower score is ob- 
tained by a given sex regardless of diagnostic 
group membership. Using observed score 
values, an interaction effect was found on 
Factors 2 and 6, indicating a variable higher 
or lower score by a given sex depending on 
group type. (These interactions drop out, 
however, if adjusted scores are used.) Because 
sex effects were present on these factors, only 
same-sex comparisons were permitted in ¢ 
tests of mean comparisons, as stated earlier. 
Although sex effects did not appear in the 
analysis of variance of Factors 5 and 10, the 
next part of the analysis proceeds, ir aa 
sake of clarity and consistency, by cautiously 
assuming that they might. 

The ¢ ae oa mean comparisons were then 
run in order to ine j 
tistically deren ae Wiere ei 

s due to sample 
type occur. Except for Factors 1 and 11, tk 
means used in the compari ire 
served means of the sq : ee emi 
adjustment for age ee means site 
regression coefficient iene af ihe 
er nip S indicated that adjusting 
and 11. Th adits oe save on Factors 1 

: n, no multiple comparison 
technique seemed Justified si i icall 
involves a within ipsa _ y 
expression, and Gee “4 eS 
the eterogenei 2 was not compatible with 
some of th yo variance which exists in 
€ comparisons. Therefore, inde- 
pendent estimates of the population ariance 
were used in the calculati P va a 
and the ares at ation of every ¢ ratio 
each direct] i ie associated with 
1953 4 calculated (see Walker & Lev, 
> PP. 157-158). The main consequence 
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TABLE 2 


Tue ¢ VALUES or DIFFERENCES OF CLINICAL 


GROUPS FROM CONTROLS, 


ONE ANOTHER, AND TOTAL ABNORMAL GROUP FOR MALES 


Factors 
Significance of Differences 1 2 5 6 8 10 11 
Controls vs. 
Total Abnormals 11 grme —11 —3.3*** 5,108" 2.17* —3_0*** 
Organics 0.7 48"** —18* —18 3.6*** 2.0** —1.6 
Sociopathic Personalities 0.3 27" =ó -1.3 1.9 2.0 —1.0 
Schizophrenics 14 24** 0.6 —2.8*** HO daa 0.5 —2.2** 
Paranoids -14 2.5** —1.0 —4.3** 2.6** 24** —0.3 
Affectives 0.2 0.2 —0.8 1.0 1.3 0.2 —1.0 
Psychoneurotics 4.2 2.4** 0.8 —2.4"" Sgt 2.1% 3.36 
Total Abnormals vs. 
Organics —0.2 2.0** —14 0.8 —0.4 0.1 0.8 
Sociopathic Personalities —0.7 -13 —1.1 2.0 —2.1** 0.6 12 
Schizophrenics 0.9 0.5 1.9* —0.7 1.7 -13 —0.7 
Paranoids —28 0.5 —0.6 3.0** —0.3 L3 0.9 
Affectives —0.5 —24** —0.2 0.9 =0.7 —1.0 0.0 
Psychoneurotics 434" 04 1.5  —0.9 3.6*** 1.0 As 
Organics vs. 
Sociopathic Personalities —0.3 -2g 0.1 0.7 =1.2 0.4 0.4 
Schizophrenics 0.8 —0.3 2.1** —0.9 1.5 -10 ~1.0 
Paranoids =2.1** -—03 0.1 —2.7** 0.0 1.1 0.5 
Affectives —0.3 —2.8** 0.4 0.3 —0.5 —1.0 —0.2 
Psychoneurotics 3.79 —14 1.8 —1.1 3.271 0.9 —2,4** 
Sociopathic Personalities vs. 
Schizophrenics 1.0 0.9 1.9* —1.6 24" -12 -12 
Paranoids —1.5 1.0 0.0 —3.4e** 0.9 0.7 0.2 
Affectives 0.0 =1.6 04  —0.1 0.2 —1.1 —0.5 
Psychoneurotics 3.0"* 04 1.7 -16 3.9% 2.3%" 2.6% 
Schizophrenics vs. 
Paranoids =2:6"* 0.0 -1.4 —1.8 —1,2 1.7 11 
Affectives —0.8 —19 —1.2 1.0 —1.3 —0.2 0.4 
Psychoneurotics 2.6** —0.6 0.3  —0.4 2.0 1.5  —1.7 
Paranoids vs. 
Affectives 1.2 —2.0 0.3 2.6 —04 AS —0.6 
Psychoneurotics 5.1* —0.6 14 11 2.8** —01  —2.4** 
Affectives vs. 
Psychoneurotics 3.08" 17 13 -1.2 2.6** 1400-17 


Note.—A minus value indicates that the group preceding “vs.” has the higher mean. 


* Significant at the 10% level. 
** Significant at the 5% level. 
*#* Significant at the 1% level. 


of this procedure is in keeping with conserva- 
tive presentation of results since the error in- 
volved is one of obtaining fewer significant £ 
ratios than if a multiple comparison technique 
were employed. 

In the following presentation of the ¢ test 


results it seemed advisable merely to set for- 
ward the findings at a descriptive level without 
any attempt at deeper Psychological interpre- 
tation. This approach seems most appropriate 
considering the exploratory level and intent of 
this investigation. It also seems recommended 
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by the fact that, although the factors are well 
established empirically, their psychological 
interpretation is necessarily tentative. 

Table 2 presents the significance of differ- 
ences of the clinical groups from the controls, 
from one another, and from the total abnor- 
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mal group, for the males only. Table 3 pre- 
sents the same results for the females. No £ 
value having a probability greater than 5% 
was considered noteworthy except for such an 
occurrence on Music Factors 1 and 5. On 
these factors a probability of 10% was con- 


TABLE 3 


THE £ VALUES OF DIFFERENCES OF CLINICAL GROUPS FROM CONTROLS, 
ONE ANOTHER, AND TOTAL ABNORMAL GROUP FOR FEMALES 


Factors 


Significance of Differences 1 2 


Controls vs. 


Total Abnormals 0.4 garme 
Organics 0.2 1.6 
Sociopathic Personalities —2.3** 2.6** 
Schizophrenics 0.1 Sat 
Paranoids 1.2 3.3°** 
Affectives 0.1 3.8" 
Psychoneurotics 0.3 15 
Total Abnormals vs. 
Organics —0.1 —1.2 
Sociopathic Personalities —3.5*** 0.0 
Schizophrenics 2.0** —0.6 
Paranoids ia | 0.7 
Affectives —0.4 1.3 
Psychoneurotics 0.0 0.0 
Organics vs. 
Sociopathic Personalities —1.8* 0.7 
Schizophrenics 0.6 0.7 
Paranoids 0.8 0.2 
Affectives —0.1 1.6 
Psychoneurotics 0.1 0.5 
Sociopathic Personalities vs. 
Schizophrenics 3.0** —0.2 
Paranoids ¥ 2.9%* 0.4 
Affectives 2,1° 0.9 
Psychoneurotics 21 0.0 
Schizophrenics vs. 
Paranoids 0.4 0.8 
Affectives —0.9 1.3 
Psychoneurotics =05 0.2 
Paranoids vs. 
Affectives —1.0 0.4 
Psychoneurotics —0.8 —0.3 
Affectives vs. 
Psychoneurotics 0.2 —0.6 


; eee 
=20 ~24 2.9% 58 fs 
-10 204 11 13 03 
p sons i me —3.0%** 
t miry j gt 
0.6 «4.24 gore 03 = 
-03 -24 Qaee TA 08 
21" OA 0.0 0.7 14 
-03 3.5 -15 =l 12° 
02% aif 13 05 234 
27% 6: 19 11 1.9 
10 —13 06 —24% 43 
—0.2 ; 
; 03 01 0.2 17 
1.0 1S wi {Ow 0.2 
2ome 99 05 -03 24+ 
47 07 12 0.4 0.2 
2.6% 1 03 -20 -i7 
10 ~01 =<04 oi 0.6 
07 3.9%" 12 0% = 
25% 41 02 07 i 
0.0 
03 32 15: ada wis 
03° 18 1.0 0.2 04 
23 23H 320 0.7 278 
08 -02 -02 -21 1 
0.0 09 =~06 0.1 2.28 
=0h 19 17 2.3% 21% 
-16 ~08 10  =03 0.4 
—0.6 09 pä 1.0 2.0 


Note.—A minus value indicates that the group preceding “ys,” has the higher mean. 


* Significant at the 10% level. 
#* Significant at the 5% level. 
see Significant at the 1% level. 
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sidered Statistically significant. This seemed 
in keeping with the 10% level of significance 
that was accepted in the analysis of variance 
on Factors 1 and 5. Three levels of signifi- 
cance are reported merely in order that a 
given reader may consider those values which 
suit his particular research requirements, not 
because the authors have three levels of con- 
fidence in the present results. 

Tables 2 and 3 reveal that the control group 
differs significantly from each of the patho- 
logic groups on at least one factor and often 
on five or six. Tables 2 and 3 also present the 
results which indicate the ability of the seven 
music factors to discriminate among the var- 
ious syndrome groups themselves. Examina- 
tion of Table 2 reveals that the male organics 
show a significant difference, when compared 
with the total abnormal group, on Factor 2. 
They are also distinguished from each of the 
various other diagnostic groups by a differ- 
ence on some one factor. The female organics 
are miore easily distinguished from the other 
groups since they differ most on only one 
factor. Their lower score on Factor 5 dis- 
tinguishes them from every group except the 
neurotics and sociopathic personalities. 

The male sociopathic personality group 
shows no difference from the affectives. How- 
ever, they are distinguished from all the re- 
maining groups by a significant difference (s). 
The female sociopathic personalities differ 
from the total abnormals, schizophrenics, and 
affectives by being higher on Factor 6, and 
from the paranoids by being lower on 5. In 
addition, they differ significantly from the 
total abnormal and all the other pathologic 
groups by scoring lower on Factor 1. o 

The male schizophrenic sample is distin- 
guished from the organics by a higher score 
on 5, and also from the sociopathic personal- 
ities by a higher score on Factors 5 and 8. 
They are also characterized by scoring higher 
on Factor 1 than paranoids, and lower than 
psychoneurotics. The female schizophrenics 
show a greater number of distinguishing com- 
parisons. Possibly Factor 11 will prove to be 
the most discriminatory of this reaction. 

The lower score of the male paranoids on 
Factor 6 distinguishes them from all other 
groups except for the neurotics and schiz- 
ophrenics. The psychoneurotics score higher, 
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while the schizophrenics score lower, 
Factor 1 than the paranoids. They show 
other difference from the schizophrenics, 
have a lower score on Factor 8 than 
neurotics. The female paranoid sample is 
principally characterized by being signifi- 
cantly higher on Factor 5 than most of the 
other groups. 

A higher score on either or both Factors 1 
and 8 distinguished the male psychoneurotics 
from the other samples. Moreover, a signifi- 
cantly lower score on Factor 11 is obtained by 
the neurotics except in comparison to the 
schizophrenics and affectives. The female 
psychoneurotics have no consistent significant 
differences from the other clinical groups on 
any given factor. 

The male affective group differs significantly 
from organics, paranoids, and psychoneurotics 
on some factors, as have already been indi- 
cated. The female affective group differs from 
the organics by being higher on Factor S; 
from the sociopathic personalities by being 
lower on Factor 6, higher on 1, and from both 
schizophrenics and paranoids by being lower 
on 10. They also differ from paranoids by 
being lower on 11. 

To summarize these differences in terms of 
those which hold for both sexes—and these 
are of most value for the practicing clinician 
—we find: 

1. Abnormals are higher on Factors 2, 8, 
and 10, and lower on 6 (and 11, if one goes 
to the 5% level). Among these, organics are 
higher only on Factors 2 and 8; neurotics on 
8; and schizophrenics on 2, 6, 8, and 11, 

2. Sociopaths differ from all other abnor- 
mals in being high on Factor 6. 

3. Schizophrenics differ, notably from or- 
ganics, in being high on Factor 5 and rela- 
tively low on 11. 

4. Neurotics distinguish from most groups 
by being higher on Factor 1 and, less def- 
initely, on Factor 8. 

5. Paranoids are higher on Factor 11, and 
affectives are lower on 11, than most. 

Since the component factor scores on these 
patterns are virtually independent, it will be 
seen that multiple correlations between the 
specific diagnostic category and the scores on 
the component factors in the different patterns 
could well be appreciable, but such finer cal- 


on 
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culation must be left until confirmation is 
available on other samples. Meanwhile it 
would seem that 7 of the 11 factors in this 
test are important for clinical diagnosis. 


DISCUSSION 


Although, as stated, the Music Preference 
Test factors are at a descriptive rather than 
an interpretive stage, some discussion of their 
associations with abnormality may be profit- 
he descriptions are at present based on 
16 PF test correlations (Cattell & Anderson, 
1953b) and for the seven factors involved here 
are as follows: 


F1. Adjustment (vs. Frustrated PE 
which correlates negatively with ergic tensioni { a 
F2. Parasympathetic Immunity vs. Sensitivity 
Threat. A P 

F5. Introspectivness vs. Social Contact, haina 
substantial correlation with I factor (see y 
1957). , N 

F6. Anxiety vs. Paranoid Imperviousness, correlat: 
ing with O factor (guilt proneness). po 

F8. Resilience vs. Withdrawn Schizothymia (H 
factor). 

F10. Lack of Drive. 

F11. Tenacity vs. Cyclothyme Relaxedness. 


Some “face validity” is given to these de- 
scriptions from the normal range, when we 
move to the present abnormal sample, in sev- 
eral respects. First, paranoids are higher on 
“paranoid imperviousness,” and schizophrenics 
differ from other abnormals in being high on 
Factor 5, introspectiveness. However, abnor- 
mals generally are marked by toughness, i.e., 
parasympathetic predominance and resilience, 
plus lack of drive and lack of guilt proneness. 
The neurotics score more “adjusted, Factor 
1, than the psychotics, and more resilient. 
Affectives show more “cyclothyme relaxed- 
ness,” Factor 11(-). : 

The last relation was also found in the 
Cattell & Anderson study (1953b), as well as 
the higher scores of schizophrenics on intro- 
spectiveness; but, except for schizophrenics 
and affectives the studies are not comparable 
in their groups. Although this much consist- 
ency exists, there are several other instances— 
such as the high score of sociopaths on anx- 
iety, Factor 6, relative to other abnormal 
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groups—which are relatively inapt and sug-“ 
gest that we are only just beginning to grope 
our way to the true meaning of these music 
preference factors. Some similar oddities of 
label have been noted by Coroso (1958). This 
is not surprising considering how little has 
yet been done to relate the factors to other 
landmarks in personality measurement. De- 
spite our inability to interpret, however, the 
fact remains that the test has on both occa- 
sions of experiment shown highly significant 
differences of abnormals from controls and 
among abnormal, psychiatric syndrome groups. 

Age as a possible influencing variable in the 
present study with abnormals was shown to 
have no appreciable relation to music prefer- 
ence. This seems to be in agreement with stud- 
ies which reported the same finding, although 
they used a different type of measurement and 
normal Ss (Keston & Pinto, 1955; Valentine, 
1926). Theoretically, one might argue that 
such results are confirmatory evidence for 
music preferences being considered more a 
function of the temperament of the perceiver 
than a function of experience, transient needs, 
familiarity, and musical training, which vary 
with age. Perhaps temperament largely deter- 
mines the music one likes and becomes fa- 
miliar with. 

Questions concernin: 
variables as race and 
music preference factor 
answer from the presen 


g the influence of such 
marital status on the 
Scores are difficult to 


t samples. Insofar as 
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„test. Previous researches in the area had 
shown age and/or sex might influence music 
preference. Consequently their possible influ- 
ence was examined in the present study. 

The results showed that age is not signifi- 
cantly related to music preference but that 
there are sex differences associated with music 


preference. Subsequently all music preference. 


comparisons were made within the same sex. 
Seven out of the original 11 test factors 
yielded statistically significant ¢ tests of com- 
parisons. Several of the music factor scores 
obtained by various groups were considered in 
terms of their psychological meaning. as de- 
duced from personality test correlations in 
normals, with the result that some designa- 
tions seem appropriate and others in need 
of revision. 

It was concluded that the Music Preference 
Test of IPAT holds promise as a diagnostic 
instrument, as indicated by the marked score 
differences between various groups. The con- 
clusion was also made that the test is best 
employed by treating results for males and 
females separately. Because of limitations of 
the present investigation it is believed that 
further studies should be carried out in order 
to determine whether or not the relations iden- 
tified in this research are replicable. 
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Wirt and Broen (1956) have questioned 
whether any relationship exists between anx- 
iety as measured by the Children’s Manifest 
Anxiety Scale (CMAS) (Castaneda, McCand- 
less, & Palermo, 1956; McCandless & Casta- 
neda, 1956) and the concept of anxiety as 
used in the clinic. Such a lack of relationship 
may be confounded by sex differences in de- 
fending against anxiety. Although boys and 
girls may obtain similar scores on tests like 
the CMAS, they may use different defenses 
in coping with anxiety. Such a viewpoint 
assumes that anxiety is a variable, with devia- 
tions in behavior viewed as defenses against 
it. If the assumption is correct, as reviewers 
of emotional sex differences in children lead 
one to believe (Kuhlen, 1957, p. 274; Ter- 
man & Tyler, 1954, p. 1075), one would pre- 
dict that different relationships would hold 
between the CMAS and various measures of 
adjustment in boys and girls. 

Psychoanalytic writers (Bonaparte, 1953: 
Freud, 1946; Greenacre, 1948: Silverberg, 
1952) have broadly suggested that defenses 
against anxiety may be divided into two 
major categories that are pertinent to the 
assiimption underlying the present study, 
namely: girls handle anxiety through auto- 
plastic defenses, while boys handle it through 
alloplastic defenses. One would predict that 
although boys and girls may report the same 


1 An earlier version of this paper was read at APA 
Convention, Cincinnati, Sept., 1959. The author ac- 
knowledges the cooperation of Pitt County Health 
Dept., and of the Board of Education, Greenville 
N. C. The data of this investigation were collected 
by Laura Ciporin through the cooperation of Edna 
Baker, who also collected the teachers’ ratings. This 
report was written during the author's tenure as 
USPHS postdoctoral fellow at the Instit 
chosomatic and Psychiatric Research and Training of 


Michael Reese Hospital, Chicago, Illinois. 
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amount of anxiety on a paper and pencil test 
like the CMAS, the relationships between 
their scores on such a test and various meas- 
ures of adjustment may vary considerably. 
Measures that could be divided rationally into 
autoplastic and alloplastic defenses should 
also differentiate between boys and girls. Al- 
though maladjustment as a total measure may 
represent the combination of autoplastic and 
alloplastic defenses, girls should show a 
greater degree of relationship between man- 
ifest anxiety scores and autoplastic defenses 
such as daydreaming and feelings of inferior- 
ity, while boys should show a greater degree 
of relationship between manifest 
scores and alloplastic q 
belliousness and 


investigation, there- 
e, a i : to study wheth 

sex differences exist in the ti Degg S 
as to specify the nature of 
relating the CMAS 
nality Adjustment 


ogers’ Rating Scale of 
931b), j 
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girls in the fourth to 
old) were administered 


Mean Otis Seli-Scoring IQ for boys 
= 8.22, while the mean Otis Self- 
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Scoring IQ for girls was 112.21, SD = 9.33. The 
background of the sample was strictly rural, lower 
to middle class economically. Their school was lo- 
cated in a town of 3,000 in eastern North Carolina. 

To determine whether our North Carolina sample 
was similar to the riginal Iowa City sample in terms 
of manifest anxiety level and the number of lies 
(the score for lies is derived from 11 questions that 
buffer the CMAS), the means of our fourth, fifth, 
and sixth grade boys and girls were compared to 
Levitt’s (1957) data. He compared the original rural 
sample to an urban (Chicago) sample, finding a 
much lower mean level of manifest anxiety in urban 
than in rural children (Table 1). Except for a sig- 
nificantly greater Lie score in boys, our North Caro- 
lina sample did not differ significantly irom the 
original Iowa City sample in terms of anxiety level. 


TABLE 1 

NCES ON THE CHILDREN'S MANI- 
BETWEEN NORTH CAROLINA, 
SAMPLES FOR FOURTH, 


EcoLocIcaL DIFFERE 
FEST ANXIETY SCALE 
Iowa CITY, AND Cricaco 


FıFTH, AND Sixti Grape Boys AND GIRLS 
o Anxiety 
Boys Girls 
Mean| SD | N Mean | SD | N 
peg amal 27 
i 18. 25 7 
North Carolina 380 ela 
a a 12.87 | 8.23 | 161 
icag 
Lies 
a ee 
‘ À Gi 
carolina | 3.23 |2.08| 29 3.29 |241| 27 
Tr aan 2.02 | 1.85 | 187 2.52 | 2.11 | 174 
Slime 3.92 | 2.42 | 161 4.29 | 2.30 | 161 
Significance of Differences between Means 
Anxiety 
—————— 
Boys Girls 
ae 
t p t p 
7 ; a City |110| ns | .02] ns 
‘arolina vs- Iowa City |1- E 
Nari Ei ys. Chicago 2.69 | .01 jau 01 
o Lies 
a i . Iowa City | 3.54 001} 1.71] ns 
North Carolina vs be eel 


North Carolina vs- Chicago 
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Furthermore, our sample differed from the Chicago 
sample to the same extent as the Iowa City sample 
did (Table 1) . 

In view of the above, conclusions and generaliza- 
tions based on our sample should apply only to 
comparable samples of rural children. Generalizations 
about the significance of our findings to urban chil- 
dren should be made with caution, because of the 
significant differences in anxiety level existing be- 
tween urban and rural children. Although the ecolog- 
ical comparison was made with children between the 
fourth and sixth grades, it should be noted that our 
sample also included children from seventh and 
eighth grades, grades not included in the original 
standardization of the CMAS. The two higher grades 
were included because of the applicability of Rogers’ 
Test to a wider age range than the CMAS. 


Description of Criterion Measures 


The Rogers’ Test of Personality Adjustment is a 
paper and pencil questionnaire filled out by the child 
himself. It yields separate scores for personal infe- 
riority, social maladjustment, familial maladjustment, 
and daydreaming. The four scores and their sum are 
so directed that the higher the numerical score, the 
greater the degree of maladjustment. Besides Rogers’ 
original validation (1931a, 1931b), a recent study by 
Smith (1957) showed that the test differentiated 
satisfactorily among three groups of well adjusted, 
adjusted, and maladjusted boys in sixth grade. The 
means of our 49 boys on the Rogers’ Test compare 
well by inspection with Smith’s sample (Smith, 1957, 
Table 22). The sole exception may be the daydream- 
ing score, which appears to be slightly higher in 
Smith’s all urban sample. 

The scores achieved on the Rogers’ Test by our 
public school sample were compared to those of 38 
children referred to a mental health clinic for be- 


havior problems. The clinic children were of the same | 


age span, intelligence, and socioeconomic and, g.0- 
graphic background as the normal school chifdren. 
There were more clinic boys (34) than clinic girls 
(4), the latter being an admittedly small sample. The 
mean scores of clinic children on Rogers’ Test were 
significantly greater on all four subscales than the 
public school children, except for family malad, ast- 
ment scores for boys and social maladjustment scores 
for girls (Table 2). There were no statistically sig- 
nificant differences between the means of clinic boys 
and clinic girls. Either in differentiating a patholog- 
ical sample of children from normal children, or in 
comparing our sample with a normal sample of sixth 
grade boys, Rogers’ Test of Personality Adjustment 
appears to possess sufficient validity to be used as an 
internal criterion measure for the CMAS. 

As an external criterion measure, each child was 
rated by two different teachers on Rogers’ Rating 
Scale of Adjustment (1931b). All ratings are so di- 
rected that a high score indicates a greater degree of 
maladjustment. For instance, a score of 1 on the 
scale of independence means “very independent, anx- 
ious to grow up,” while a score of 5, which is the 
higher limit, means “completely dependent, wishes to 
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SIGNIFICANCE OF DIFFERENCES BETWEEN MEANS OF CLINIC AND SCHOOL CHILDREN 
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TABLE 2 


ON 


ROGERS’ TEST OF PERSONALITY ADJUSTMENT 


Clinic Boys School Boys Clinic Girls School Girls 
(N = 34) (W = 49) (NW = 4) (N = 47) 

Mean SD Mean SD 1 $ Mean SD Mean SD U p 
Personal Inferiority 14.35 4.46 9.79 3.54 5.19 .001 17.25 5.54 10.72 4.63 2.05 .02 
Social Maladjustment 17.44 5.01 14.59 4.63 2.64 .05 17.25 4.15 15.81 4.15 .50 ns 
Family Maladjustment 9.12 3.50 7.77 340 1.75 ns 11.75 642 6.13 3.66 1.56 .06 
Daydreaming 2.94 2.22 1.73 1.93 2.62 .05 4.75 2.85 2.79 2.18 1.54 .06 
Total Personality Adjustment 43.85 8.09 34.00 6.52 6.08 .001 51.00 8.51 35.94 7.35 2.68 .005 


remain infantile.” A score of 1 on daydreaming means 
no daydreaming at all, while a score of 4 means 
“extreme daydreamer, of fantastic types.” Unfortu- 
nately the same two teachers did not evaluate all the 
children so that commonly accepted measures of re- 
liability like Pearson’s r could not be applied. 
Originally the Rating Scale had been constructed 
to be used by child guidance clinic workers. A few 
changes were made to adapt it for school teachers so 
that the teacher could rate a child according to what 
the child did in school in relation to teachers and 
peers rather than what he did at home in relation 
to parents and siblings. The total adjustment rating 
comprises seven subscales: daydreaming, rebellious- 
ness, inferiority, affection, independence, conflict over 
sex matters, and the child’s general emotional adjust- 
ment. The last two subscales were not considered 
individually because many teachers, all women, failed 
to rate children on conflict over sex matters and 
because the individual subrating of emotional adjust- 
ment would have added little information by itself. 
Each subscale consisted of one question with a choice 
of four or five statements of increasing degree of 
maladjustment. The inferiority ratings, however, are 
a composite of two questions, hence the greater nu- 
merical value for this subscale (Table 4). All of the 


TABLE 3 


RELIABILITY FOR TEACHERS’ RATINGS oF SCHOOL 
CHILDREN ON ROGERS’ ADJUSTMENT RATING SCALE 
(Means and SDs in Table 4) 


Boys Girls 
(N = 49) (WN = 47) 
F p F $ 
Total Adjustment 2.88 005 413 001 
Daydreaming 1.76 -05 84 ns 
Rebelliousness 3.38 001 3.44 001 
Inferiority 83 ns 2.64 005 
Affection 2.66 005 1.23 è ns 
Independence 2.72 005 2.79 005 


aS — - 


mean ratings reported are the sum of the ratings of 
both teachers for each subscale. 

With two different teachers rating each child, the 
reliability of their ratings could only be estimated 
(Ebel’s method, 1951). The teachers’ ratings were 
on the whole surprisingly reliable (Table 3). With 
exceptions of ratings of inferiority in boys and day- 
dreaming in girls, the high level of statistical signifi- 
cance reached by the ratings insures their usefulness 
as an external criterion for the CMAS. 


RESULTS AND Discussion 


f 


Although the mean IQ difference between, — 
boys and girls was statistically significant Y 
(ż = 3.32, p < .01), no significant correlation ` 
between manifest anxiety level and intelli- 
gence for either sex was found (r = .00 in 
both sexes). There were no differences be- 
tween the means of boys and girls on anxiety 
level and on the lie score of the CMAS or on 
personal inferiority, social 
total personality maladjustment agva g 
by Rogers’ Test. The mean a aay 
maladjustment was slightly higher in E A 
than in girls, while girls were slightly higher 
than boys on daydreaming. According to thei: 
ratings, teachers viewed boys as being con. 
sistently more maladjusted than girls in all 
aspects of Rogers’ Scale (Table 4). 

Highly significant correlations between sub- 
tests and the total score of Rogers’ Tests as 
well as between teachers’ subratings and the 
total ratings of maladjustment in both sexes 
were found. The correlation between the total 
score for personality maladjustment on Rog- 
ers’ Test and the total sum of ratings on | 


maladjustment, and 


Rogers’ Scale was statistically significant for 
boys and nearly so for girls. 


SIGNIFICANCE OF DIFFERENCES BETWEEN 
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TABLE 4 


Means For Boys AND GIRLS ON THE CHILDREN’S 
MANIFEST ANXIETY SCALE, ROGERS’ TEST, AND ROGERS’ SCALE 


Boys (N = 49) Girls (V = 47) 

Mean SD Mean SD t p 
` Anxiety 17.45 5.86 18.38 6.77 2 ns 
y Lies 3.12 1.92 2.74 2.42 85 ns 
$ Personal Inferiority 9.79 3.54 10.72 4.63 1.10 ns 
S Social Maladjustment 14.59 4.63 15.81 4.15 1.36 ns 
Family Maladjustment 7.77 3.40 6.13 3.66 2.27 05 
Daydreaming 1.73 1.93 2.79 2.18 2.52 .05 
Total Personality Adjustment 34.00 6.52 35.94 7.35 1.37 ns 

Total Adjustment Rating 30.92 6.42 25.42 6.09 4.29 .001 
Daydreaming Rating 3.31 1.04 2.70 83 3.18 OL 
Rebelliousness Rating 443 .91 3.79 58 4.13 OL 
Inferiority Rating 7.73 2.82 6.25 2512) 2.88 01 
Affection Rating 3.47 1.19 2.83 87 3.00 O01 
Independence Rating 3.94 1.19 3.21 1.20 2.98 01 


Correlations between anxiety and all other 
scores and ratings for boys were low, statis- 
tically nonsignificant, and consistently nega- 
tive. Boys who reported more feelings of 
inferiority on Rogers’ Test were also rated as 
being more maladjusted, more dependent, and 
as possessing more feelings of inferiority than 
boys not reporting the same feelings. Those 
boys who reported greater family maladjust- 
ment were also rated as being more malad- 
justed, as daydreaming more frequently, as 
possessing more inferiority feelings, and as 
lacking in affection. Boys who reported more 
overall maladjustment were also rated as feel- 
ing inferior and as lacking in affection. The 
two latter ratings correlated also with ratings 
of rebelliousness (Table 5). 

In girls two positive and significant correla- 
tions were found between anxiety and the 
total personality maladjustment score as 
measured by Rogers’ Test as well as between 
anxiety and teachers’ ratings of independence. 
The more anxious and maladjusted the girl, 
the greater was the probability of her being 
rated as being more dependent. Girls who re- 
ported greater family maladjustment were 
also rated as being more rebellious. Girls who 
reported greater personality maladjustments 
were rated as being more dependent. Self- 
reports of daydreaming correlated with rat- 
ings of greater rebelliousness, more intense 
feelings of inferiority, and more dependence. 


= 


The latter ratings correlated with ratings of 
lack of affection and of greater feelings of 
inferiority (Table 6). 

Although not all of the sex differences found 
were in the predicted direction, especially for 
boys, three additional analyses of the results 
were made to specify further the nature of 
such differences. In the first place, if manifest 
anxiety is related more to autoplastic rather 
than to alloplastic behavior, then girls should 
show a greater number of significant correla- 
tions between the measures of adjustment 
used. Girls do differ from boys on several 
correlations. A highly significant difference 
was found between correlations of anxiety and 
the total score on Rogers’ Test (z = 2.94 
p< .01). Girls showed significantly highe 
correlations than boys between anxiety anc 
daydreaming scores (z = 2.16, p < .05); be 
tween ratings of maladjustment and of inde 
pendence (z = 2.27, p< .05); and betwee: 
ratings of inferiority and of independenc 
(z = 2.05, p < .05). The only significant dif 
ference between correlations in which boy 
exhibited a higher correlation than girls wa 
between ratings of rebelliousness and of lac 
of affection (z = 2.25, p < .05). The diffe 
ences between correlations are in the expecte 
direction, but could also be attributed to tl 
teachers rating boys as being more mala 
justed than girls. 

In the second place, if girls do tend to u 
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TABLE 7 


ĪTEMS OF THE CHILDREN’S MANIFEST ANXIETY SCALE CHECKED POSITIVELY 
More OFTEN By EITHER SEX 


Description of Item x? b 


Girls More than Boys 


ti. I am secretly afraid of a lot of things. 4.13 .05 
= My feelings get hurt easily, 9.60 .001 
32, It is hard for me to go to sleep at night. 4.38 .O5 
39. I am afraid of the dark. 11.98 001 


Boys More than Girls 


8. I wish I could be very far away from here. 5.01 025 
10. (Lie Scale) I would rather win than lose in a game. 10.09 O01 
41, (Lie Scale) I never get angry. 4.00 .05 


autoplastic behavior more than boys, they 
should show it in the way they report about 
themselves. Indeed, an item analysis of the 
CMAS (Table 7) indicates that girls checked 
more often those items that show a greater 
tendency to admit feelings that boys would 
be leary of reporting, as Sarason and others 
have recently suggested (Sarnoff, Lighthall, 
Waite, Davidson, & Sarason, 1958). 

In the third place, additional developmental 
trends may account for some of the sex dif- 
ferences. Boys showed variations in personal 
inferiority scores F = 2.59, p < .05), with an 
increase up to sixth grade and a subsequent 
decline. Also among boys, daydreaming scores 
decreased significantly with age (F = 3.16, 
p <.025), reaching a mean of 1.00 in eighth 
grade. The decreases with age in daydreaming 
scores among girls was less pronounced (F 
= 2.44, p < .10). Variations in rebelliousness 
and daydreaming ratings with no clear de- 
clines or increases with age were also present 
among boys (F = 3.04, p< .05 and F 
= 425, $ < Ol, respectively). Girls showed 
variations in total adjustment ratings (F 
= 2.64, p< .05), daydreaming ratings (F 
= 7.21, p< .001), and ratings of affection 
(F = 3.32, p < .01), with no clear declines or 
increases with age. No other developmental 
trends were statistically significant in either 
sex. 

In conclusion, although mean anxiety scores 
may be the same for boys and girls, yet each 
sex seems to cope with anxiety with differ- 


ent defenses. Perhaps the construction of 
separate anxiety scales for boys and girls 
should be considered. 


SUMMARY 


The present study evaluates some aspects 
of sex differences between boys and girls in 
their handling of anxiety. Although children 
may obtain equal scores on a paper and pencil 
measure of anxiety like the Children’s Man- 
ifest Anxiety Scale, the relationships between 
manifest anxiety scores and various measures 
of adjustment may be different in boys and 
girls. It was expected that girls would show 
positive correlations between manifest anxiety 
level and measures of autoplastic adjustment. 
like daydreaming and feelings of inferiority. 
Boys were expected to show positive correla- 
tions between manifest anxiety level and 
alloplastic defenses, like rebelliousness and 
greater “nuisance value” than girls. To inves- 
tigate whether any sex differences existed ac- 
cording to these expectations, the Children’s 
Manifest Anxiety Scale was correlated with 
Rogers’ Test of Personality Adjustment and 
Rogers’ Rating Scale of Adjustment. 

Boys were consistently rated by their teach- 
ers as being more maladjusted than girls, but 
no positive correlation between manifest anx- 
iety scores and various measures of adjust- 
ment used was found for boys. Positive and 
Statistically significant correlations between 
manifest anxiety level and Rogers’ total score 
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of personality adjustment and ratings of in- 
dependence were found for girls. Addition- 
ally, sex differences between correlations, in 
an item analysis of the Children’s Manifest 
Anxiety Scale, and in developmental trends 
within each measure of adjustment were pre- 
sented and discussed briefly. 
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It has long been recognized that personality 
test scores are influenced by non-test-relevant 
response determinants. Wiggins and Rumrill 
(1959) distinguish three approaches to this 
problem. Briefly, interest in the problem of 
response distortion has been concerned with 
attempts at statistical correction for “faking 
good” or “faking bad” (Meehl & Hathaway, 
1946), the analysis of response sets (Cron- 
bach, 1946, 1950), and ratings of the social de- 
sirability of personality test items (Edwards, 
1957). A further distinction can be made, how- 
ever, which results in a somewhat different 
division of approaches to the question of re- 
sponse distortion. Common to both the Meehl 
and Hathaway corrections for faking good and 
faking bad and Cronbach’s notion of response 
sets is an interest in the test behavior of the 
subject (S). By social desirability, on the other 
hand, Edwards primarily means the “scale 
value for any personality statement such that 
the scale value indicates the position of the 
statement on the social desirability con- 
tinuum . . .” (1957, p. 3). Social desirability, 
thus, has been used to refer to a characteristic 
of test items, i.e., their scale position on a 
social desirability scale. 

Whether the test behavior of Ss or the 
social desirability properties of items are the 
focus of interest, however, it now seems clear 
that underlying both these approaches is the 
concept of statistical deviance. In the con- 
struction of the MMPI K scale, for example, 
items were selected which differentiated be- 
tween clinically normal persons producing 
abnormal test profiles and clinically abnormal 
individuals with abnormal test profiles, and 
between clinically abnormal persons with nor- 


mal test profiles and abnormal Ss whose 
test records were abnormal. Keyed responses 
to the K scale items tend to be statistically 
deviant in the parent populations. Similarly, 
the development of the Edwards Social De- 
sirability Scale (SDS) illustrates this proce- 
dure. Items were drawn from various MMPI 
scales (F, L, K, and the Manifest Anxiety 
Scale [Taylor, 1953]) and submitted to 
judges who categorized them as either socially 
desirable or socially undesirable. Only items 
on which there was unanimous agreement 
among the 10 judges were included in the 
SDS. It seems clear that the items in Edwards 
SDS would, of necessity, have extreme social 
desirability scale positions or, in other words, 
be statistically deviant. 

Some unfortunate consequences follow from 
the strict use of the statistical deviance model 
in the development of social desirability scales, 
With items drawn from the MMPI, it is ap- 
parent that in addition to their scalability for 
social desirability the items may also be char- 
acterized by their content which, in a general 
sense, has pathological implications. When a 
social desirability scale constructed according 
to this procedure is then applied to a college 
student population, the meaning of high social 
desirability scores is not at all clear. When 
Ss given the Edwards SDS deny, for ex- 
ample, that their sleep is fitful and disturbed 
(Item 6) or that they worry quite a bit over 
possible misfortunes (Item 35), it cannot be 
determined whether these responses are at- 
tributable to social desirability or to a genuine 
absence of such symptoms. The probability of 
occurrence of the symptoms represented in 
MMPI items (and incorportated in the SDS) 
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in a college undergraduate population is un- 
doubtedly low. Thus, the achievement of high 
SD scores may simply reflect the low fre- 
quency of pathological symptoms in this 
population and not the zeeds of Ss to present 
themselves in a favorable light. Of course, if 
one is only concerned with the properties of 
test items (their social desirability scalabil- 
ity), this is not a relevant issue. If, however, 
major importance is attached to the needs of 
Ss in psychometric situations and the influ- 
ence of these needs on test responses, it is 
essential to be able to discriminate between 
the effects of item content and the needs of 
Ss to present themselves in a socially desirable 
(or undesirable) light. 

In the present research, a social desirability 
scale was developed according to a different 
psychometric model, avoiding the ambiguities 
of the statistical deviance approach. Basic to 
this model is the sampling procedure employed 
in the selection of items from a defined uni- 
verse. The population from which items were 
drawn is defined by behaviors which are cul- 
turally sanctioned and approved but which 
are improbable of occurrence. This will read- 
ily be recognized as the rationale underlying 
the Lie scale of the MMPI (Meehl & Hath- 
away, 1946). Items in the present scale, how- 
ever, are probably less extreme than the 
Lie items. 


METHOD 
Development of Scale 


A number of current personality inventories were 
consulted by the authors in order to devise a set of 
items for a new social desirability scale (M-C SDS). 
For inclusion in the scale, an item had to meet the 
criterion of cultural approval described above and 
was required to have minimal pathological or ab- 
normal implications if responded to in either the 
socially desirable or undesirable directions. A set of 
50 items meeting these criteria was submitted to 10 
judges, both faculty members and graduate students 
in the Department of Psychology of Ohio State Uni- 
versity, for social desirability ratings. The judges 
were instructed to score each item in the socially 
desirable direction from the point of view of college 
students, using true and false response categories. 
Unanimous agreement was obtained on 36 items and 
90% agreement on 11 additional items. These 47 
items constituted the preliminary form of the scale. 

A major objective in the development of the M-C 
SDS was the elimination of pathology-relevant item 
content. To test this and for comparative purposes, 
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both the M-C SDS and the Edwards 39-item SDS 
(Edwards, 1957) were submitted to 10 additional 
judges, again including both faculty members and 
graduate students in the psychology department, for 
ratings of the degree of maladjustment implied by 
socially undesirable responses to the items. A 5-point 
scale, ranging from extremely well-adjusted (1) to 
extremely maladjusted (5), was employed for this 
purpose. The mean rating for all the items in the 
M-C SDS was 2.8, slightly below the midpoint of the 
scale (implies neither good nor poor adjustment). 
The mean rating for the Edwards SDS items was 3.9, 
indicating that the judges considered socially unde- 
sirable responses on this scale to be definitely indica- 
tive of maladjustment. The t test of the significance 
of the difference between these means is 15.27, which 
is significant well beyond the .0001 level. 

The preliminary scale was then administered to 76 
students in two introductory Psychology courses, and 
an item analysis completed. There were 33 items that 
discriminated at the .05 level or better between high 
and low total scores. Of the 33 items, 18 are keyed 
true and 15 false, making a response set interpreta- 
tion of scores highly improbable. These 33 items con- 
stitute the final form of the M-C SDS and are listed 
in Table 1 with the socially desirable response scor- 
ing indicated. 


Reliability 


The internal consistency coefficient for the final 
form of the scale, 
20, is .88. This was computed on 39 Ss, 10 males and 
29 females, who were enrolled in an undergraduate 
ate University. 
19 to 46 years. Thirty-one “| eh oie 


-89 was obtained. 
Relationship to Edwards SD Scale 


The correlation between the M-c SDS and the 
Edwards SDS is 35, which is significant at the 01 
level, The sample on which this correlation is based 
included, in addition to the 39 abnor 
students, 81 students in 
dren. The correlation sk 


SDS scores rather closel 
distribution, while negativ, 
previous findings (Edwar 
Edwards SD distribution, 
the Edwards SD mean fo 


Y approximates a normal 
e skewness, consistent with 
ds, 1957), is found for the 
It is interesting to compare 
A A und in the present research 
with that originally reported. The means of 28.6 and 
27.1 for males and females reported by Edwards are 
considerably lower than the value found in this study. 


Correlations with Other Scales 


A considerable portion of the research on social 
desirability has involved the correlation of SDSs with 
MMPI variables. To compare the present scale with 
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TABLE 1 


THE MARLOWE-CROWNE SOCIAL DESIRABILITY SCALE 


Personal Reaction Inventory 


Listed below are a number of statements concerning personal attitudes and traits, Read each item and decide 
whether the statement is true or false as it pertains to you personally. 


1. Before voting I thoroughly investigate the qualifications of all the 
candidates. (T) 


2. I never hesitate to go out of my way to help someone in trouble, (T) 


3. It is sometimes hard for me to go on with my work if I am not en- 
couraged. (F) 


4. I have never intensely disliked anyone. (T) 

5. On occasion I have had doubts about my ability to succeed in life. (F) 
6. I sometimes feel resentful when I don’t get my way. (F) 

7. Tam always careful about my manner of dress. (T) 


8. My table manners at home are as good as when I eat out in a restau- 
rant. (T) 


9. If I could get into a movie without paying and be sure I was not seen 
I would probably do it. (F) 


10. Ona few occasions, I have given up doing something because I thought 
too little of my ability, (F) 


11, T like to gossip at times. (F) 

12. There have been times when I felt like rebelling against people in 
authority even though I knew they were right. (F) 

13. No matter who I’m talking to, I’m always a good listener. (T) 

14. I can remember “playing sick” to get out of something. (F) 

15. There have been occasions when I took advantage of someone. (F) 

16. T'm always willing to admit it when I make a mistake, (T) 

17. I always try to practice what I preach. (T) 

18. I don’t find it Particularly difficult to get along with loud mouthed, 
obnoxious people. (T) 

19. I sometimes try to get even rather than forgive and forget. (F) 

20. When I don’t know something I don’t at all mind admitting it. (T) 

21. I am always courteous, even to people who are disagreeable. (T) 

22. At times I have really insisted on having things my own way. (F) 

23. There have been occasions when I felt like smashing things, (F) 


24. I would never think of letting someone else be punished for my wrong- 
doings. (T) 


25. I never resent being asked to return a favor. (T) 


26. I have never been irked when people expressed ideas very different 
from my own. (T) 


27. I never make a long trip without checking the safety of my car. (T) 


28. There have been times when I was quite jealous of the good fortune 
of others. (F) 


29. I have almost never felt the urge to tell someone off. (T) 
30. I am sometimes irritated by people who ask favors of me. (F) 
31. I have never felt that I was punished without cause. (T) 


32. I sometimes think when people have a mistortune they only got what 
they deserved. (F) 


33. I have never deliberately said something that hurt someone’s feelings, 
T) i 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF THE SOCIAL 
DESIRABILITY SCALES 


Scale N Mean SD 


M-C SDS 120 13.72 5 

Edwards SDS 120 31.83 5 

28.6 Males 6.5 
6.5 


(From Edwards, 84 
27.1 Females 


1957) 108 


wards SDS, Pearson product-moment correla- 
ae computed between the two SDSs and the 
following MMPI and derived scales: K—Test-taking 
attitude; Z—Lie; F—Validity and test-taking atti- 
tude; Hs—Hypochondriasis; D—Depression; Hy— 
Hysteria; Pd—Psychopathic Deviate; Pa—Paranoia; 
Pt—Psychasthenia; Sc—Schizophrenia; Ma—Manic; 
Pr—Prejudice (Gough, 1951); St—Status (Gough, 
1948); Es—Ego Strength (Barron, 1953); MAS— 
Manifest Anxiety (Taylor, 1953); A—Anxiety 
(Welsh, 1956) ; R—Repression (Welsh, 1956). 

The 39 Ss referred to above who served in the 
study were administered the M-C SDS, the 39-item 
Edwards SDS, and the MMPI in that order. The 
first two tests were given on the same day and the 
MMPI about a month later. Thirty-four Ss com- 
pleted all of the tests and 37 of them completed all 
but the derived MMPI scales. 

Table 3 presents the correlations between the M-C 


TABLE 3 


CORRELATIONS BETWEEN THE SOCIAL DESIRABILITY 
ScaLEs AND Vartous MMPI SCALES FOR 37 MALES 
AND FEMALES 


MMPI Scales M-C SDS Edwards SDS 
K A0* 65** 
L 54" 22 
F —.36* =o 
Hs —.30 gane 
D —.27 —.72** 
Hy s5 .09 
Pd —.41** —.73** 
Pa 21 —.02 
Pt —.30 —.80** 
Se — .40* —.77** 
Ma —.24 —.42* 
Pre —.27 —.58** 
Sis 16 14 
Es A7 E 
MAS? =.25 —.75** 
Ab —.23 —.61** 
Re 28 07 


* Significant at the .05 level. 
#* Significant at the .01 level. 

aN = 36. 

bN = 34. 
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SDS and the Edwards SDS and the 17 MMPI 
validity, clinical, and derived scales. It is at once 
apparent that uniformly higher correlations obtain 
between the Edwards SDS and the various MMPI 
scales than between the M-C SDS and these MMPI 
variables. A general trend, which is consistent with 
previous research, is found in the positive correlations 
between the SDSs and the validity scales of the 
MMPI, and negative correlations with most of the 
clinical scales. Four clinical scales correlate highest 
with both SDSs, with the single exception of D 
which correlated —.27 with the M-C SDS: Sc, Pd, 
Pt, and Hs. Two of these four, Sc and Pt, are 
considered to be among the most “pathological” of 
the clinical scales. 


DISCUSSION 


The most important feature of the findings 
of this study is found in the marked differ- 
ences overall in the magnitude of the correla- 
tions between the two SDSs and the MMPI. 
Consistently higher correlations were found 
between the Edwards SDS and the MMPI 
scales than were obtained between the M-C 
SDS and the MMPI scales. The high Edwards 
SDS-MMPI correlations, in general, confirm 
findings previously reported by Edwards 
(1957) and Fordyce (1956). Correlations be- 
tween the Edwards SDS and the Pt, Sc, and 
MAS scales, in fact, approach the asymptotic 
value of the reliabilities of the separate tests. 
With correlations this high, it is necessary to 
raise the question of whether the Edwards 
SDS and these MMPI scales are not, in 
effect, functionally unitary. It would appear 
to be difficult to hold the view that SD scores 
and MAS, Pt, and Sc scores can be interpreted 
differently. More in accord with the evidence 
would be to attribute the covariance of the 
Edwards SDS and these MMPI scales to item 
similarity and to the “pathological” content of 
both sets of items. This would lead to an 
interpretation of the Edwards scale as a meas- 
ure of the willingness to admit to certain 
symptoms of a “neurotic” nature or as a meas- 
ure of general “neuroticism.” But this does 
not enable one to discriminate between high 
SD scorers who genuinely do not have the 
symptoms represented in the SDS items from 
those Ss who conceal (consciously or uncon- 
sciously) their symptoms and whose responses 
are motivated by social desirability. To the 
extent, then, that the Edwards SDS measures 
social desirability, it does so in the very re- 
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stricted sense that high SD scores imply that 
it is bad or undesirable to have or admit to 
symptoms. Possibly, such attitudes have little 
generality and would not be related to other 
test behavior or social behavior. Sarason 
(1959) has also raised the question of the 
interpretation of Edwards SDS as an un- 
confounded measure of social desirability. 

In the development of the M-C SDS, social 
desirability was defined more broadly to refer 
to the need of Ss to obtain approval by re- 
sponding in a culturally appropriate and ac- 
ceptable manner. This conception does not 
involve the acquiescence or denial by S of 
pathology. The significantly different malad- 
justment ratings obtained on the two SDSs 
support the hypothesis that the Edwards SDS 
involves the admission or denial of maladjus- 
tive symptoms and indicate that socially unde- 
sirable responses on the M-C SDS do not 
imply maladjustment. 

The smaller correlations between the M-C 
SDS and the various MMPI scales would 
be predicted if one views social desirability as 
accounting for a fraction of the MMPI vari- 
ance but not all or most of it. The problem 
of overlapping meanings is thereby avoided. 
Thus, it is submitted that the M-C SDS- 
MMPI correlations more accurately indicate 
the amount of MMPI scale variance which 
may be attributed to differences in the need 
to give socially desirable responses, 

It may additionally be pointed out that the 
M-C SDS and the Edwards SDS differ con- 
siderably in the amount of content or item 
overlap with the various MMPI scales. The 
present scale contains one exact and four 
approximate replications of Z items and one 
repetition of a K scale item. By contrast, the 
Edwards SDS, it will be recalled, was con- 
structed from a heterogeneous pool of MMPI 
items and not inconsiderably overlaps with 
many MMPI validity and clinical scales. The 
two SDSs have no items in common. 

Certain additional aspects of the present 
findings are worthy of note. Positive correla- 
tions are found for both SDSs with the K and 
L scales, on which high scores are generally 
interpreted to indicate “defensiveness” and 
the attempt by S to cast himself in a favorable 
light. The M-C SDS correlates much more 
highly with Z, however, than does the Ed- 


wards SDS. The negative correlations with the 
F scale are accounted for by the interpretation 
of a high F score as an indication of “plus 
getting.” Regarding the clinical and derived 
scales, in general those MMPI scales on which 
a high score indicates maladjustment are neg- 
atively correlated with the SDSs. In part, the 
exceptions to this may be explained in terms 
of the distinction between subtle and obvious 
scoring on some of the clinical scales. Item 
subtlety, meaning the relative absence of so- 
cial desirability implications, would account 
for the negligible correlations between the 
Edwards SDS and the Hy and Pa scales, for 
example. The fairly substantial correlations 
between the Edwards SDS and the Pr and Es 
scales may again be a function of similarity 
in general item content. In the judgment of 
the present authors, about half of the items in 
the Zs scale would be classed as “patholog- 
ical,” while roughly a third of the Pr items 
would be so considered. 

The positive correlation between the M-C 
SDS and the Pa scale, however, is an interest- 
ing possible exception. While this 7 falls short 
of significance, it might suggest that high SDS 
scores (implying in the present definition of 
the construct a high need for the approval of 
others) tend to be associated with concern or 
suspicion about the motives of others. Corre- 
lations between the Edwards SDS and W elsh’s 
A and R scales have not, to the writers’ 
knowledge, been previously reported. The Ed- 
wards SDS correlated highly, as one would 
expect, with the 4 scale but not at all with R 
which has a rather heterogeneous item content 
in terms of pathology. The M-C SDS does 
better in this case with an r of 28. This is in 
the predicted direction since all of the items 
on the R scale are keyed false. The M-C Sps 
correlation with the A scale is of the same 
magnitude as the correlation with the MAS 
and is consistent with expectation. It would 
appear from the correlations of the SDSs with 
A and MAS that the latter are approximately 
equivalent measures. 


SUMMARY 


In this research, an alternative model to 
Edwards’ conception of social de: 


sirability was 
proposed. B 


asic to the present construct of 
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social desirability is the definition of a popula- 
tion of culturally acceptable and approved be- 
haviors which are, at the same time, relatively 
unlikely to occur. Test items were drawn from 
this population in the development of a new 
social desirability scale, the Marlowe-Crowne 
Social Desirability Scale. This scale was corre- 
lated with 17 MMPI validity, clinical, and 
derived scales and the results compared with 
the correlations of the Edwards SDS with 
these MMPI variables. The very high correla- 
tions obtained with the Edwards scale cast 
doubt on the interpretation of this test as a 
measure of the influence of social desirability 
on test responses. The magnitude of the corre- 
lations of the new scale with the MMPI was 
considered to be more in accord with a 
definition of social desirability in terms of 
the need of subjects to respond in culturally 
sanctioned ways. 
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Psychologists interested in the relationship 
of motivation to cognition have obtained some 
understanding of the ways in which cognitive 
impairment can result from emotional disturb- 
ance. Meanwhile, clinical inquiries have led to 
an expanding knowledge of the effects of par- 
ent-child relationships upon the emotional 
processes of the individual. Little has yet been 
done, however, to trace the sequence from 
family relationships, through the child’s emo- 
tional dynamics, to his cognitive processes. 
The present paper is designed to help bridge 
these areas by tracing a causal linkage be- 
tween disturbance in parent-child interactions 
and the child’s cognitive impairment. 

The boy in question, Robert Strong, is now 
15. He has had weekly hours of therapy for 
four years and has seen a therapeutically- 
oriented tutor for one to three hours a week 
over two academic years. His mother has seen 
a social worker for weekly hours during this 
whole period, and his father has seen the same 
person for the past two years. Eleven when he 
first came to the clinic, Bobby had repeated 
the second grade and had had near-failing 
marks in each grade through the fifth. Yet at 
the time of referral the Wechsler-Bellevue 
Intelligence Scale for Children confirmed that 
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his intelligence was average. After three and 
one-half years of treatment he has made 
marked gains in areas other than school per- 
formance, but he has yet to obtain an average 
report card. Regularly administered school 
achievement tests have left no doubt of his 
trouble in acquiring the basic skills and ab- 
sorbing knowledge. Although these tests indi- 
cate that he is now above grade level in 
reading skill, this gain has not been reflected 
so far in his school performance. 


THE PARENTAL RELATIONSHIP 


The immediate family consists of Mr. and 
Mrs. Strong, a couple in their early 40’s, their 
daughter, Flora, who attends college and 
comes home only on weekends, and Robert, 
Mr. Strong works as a semiskilled laborer in a 
factory, and his wife teaches nursery school, 

On the surface, Mrs. Strong would wish us 
to see her as calm. She seeks to control the 
expression of emotions by reason and by rigid 
rules for behavior taken mainly from her 
religion. Education and the development of 
reasoning powers have been of great impor- 
tance to her, since they help protect her 
against emotions and also constitute a means 
of expressing her rivalry with men. It is not 
a coincidence that her husband, in turn, is a 
person who has tiptoed through life. At 43 he 
is boyish in appearance and mannerism. In 
the beginning of treatment he described him- 
self as “meek and shy, a self-conscious man 
with rusty brains.” He bows out completely 
from the intellectual realm, deferring all to his 
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wife and stating that his only contribution is 
that of brawn. Like his wife, his major de- 
fense is that of denial. Were we to take his 
word, anger, resentment, and rivalry are non- 
existent in his makeup. Yet, in truth, they 
accumulate to such an extent that he has 
explosive outbursts which frighten him as well 
as the rest of the family. In contrast to his 
wife’s position, Mr. Strong has looked upon 
education and the ability to reason as powers 
which he could not afford to possess lest they 
make him destructive. 

In the parents’ marriage, a balance was 
formed that met both their needs. By virtue 
of her husband’s need for her, Mrs. Strong has 
been able to deny her rivalry with men, to 
maintain at the same time a superior position 
to men, and also to continue to see herself as 
a helping person. If he feels unable to carve 
the meat, answer the phone, do the family 
finances, then she must take over these duties 
to help him. For Mr. Strong,-the choice of 
marital partner was part of a lifelong renun- 
ciation. 

Consider now the children of this marriage. 

‘ Flora, their first-born, has managed to grow 
without disturbing this balance. She achieved 
a good academic standing and thereby formed 
a team with her mother of the educated 
women in the family. The role that was as- 
signed to Bobby, on the other hand, is best 
understood by considering what the birth of 
a male child meant to these parents. For quite 
different reasons, a son was a threat to both 
of them. To Mrs. Strong, a male child was a 
potential competitor, a potential danger if he 
grew to mature manhood. To Mr. Strong, 
Bobby was perceived as a sibling rival. It was 
as though the father feared that Mrs. Strong 
would not have enough to give to two. At the 
same time, he saw that Bobby could possibly 
excel over him in manliness. We can see that 
the role Bobby adopted—that of passivity, 
dependence, and underachievement—met the 
needs of both his parents in quieting their 
fears. 

Tue Boy’s EMOTIONAL DIFFICULTIES 

The parents’ everyday behavior continually 
suggested that people contain dangerous im- 


pulses and the world is a dangerous place in 
which to live. Bobby had listened to his 
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father’s detailed accounts of crippling acci- 
dents at the factory. He had heard his 
mother’s equally thorough descriptions of the 
motorcycle accidents that had killed his ma- 
ternal grandfather and maimed his maternal 
grandmother long before Bobby was born. 
He had overheard, while lying frightened and 
alone in his crib, the sounds that accompanied 
his maternal grandmother’s death from a 
heart attack and fall down the stairs. Events 
such as these could well reinforce fear and 
guilt about his own aggressive impulses, espe- 
cially when his mother did not let him forget 
that the bigger he got, the more easily he 
could hurt someone. Bobby found his at- 
tempts at mastery were often received by his 
parents as a hostile effort to put someone else 
out of first place. School was actually the most 
important sphere of achievement in Bobby’s 
life, and both his parents felt strong ambiva- 
lence about his success in this area. Accord- 
ingly, accomplishment in school became en- 
dowed for him with fear and guilt. 

On one occasion when the therapist dis- 
cussed Bobby’s fears about achievement in 
school, Bobby made no verbal response but 
worked his hand across the desk in a walking 
movement. When asked about this, he said it 
reminded him of a man with a broken leg on 
crutches. The weekend after receiving his first 
really encouraging school report, Bobby broke 
his skis in a fall. When asked how his sister 
would like it if he got a good report card, he 
replied that she would have a fit, When asked 
how his tutor would like it, he replied that his 
tutor would hit the ceiling. Bobby did not 
realize until it was pointed out to him that 
these phrases implied any hostility. On an- 
other occasion, he brought in a test paper 
containing an A and absent-mindedly erased 
the A from the paper while talking about it. 

Even when his daily work had improved, 
he continued getting poor report cards. The 
low grades could be attributed to various self- 
defeating maneuvers that will be discusssed 
below. No matter how genuine his distress 
about failure, it was evident that Bobby pos- 
sessed a clearly defined need to fail. 

Bobby seemed to perceive his relationship 
to his mother and father as laden with con- 
flict. It was as if Bobby and the rest of the 
family were continually engaged in playing 
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“king of the mountain,” and one could get 
enough space only by shoving someone else 
out of it. When asked if his father went ice 
skating with him, he replied with a grin, “No, 
I was using his ice skates.” When Bobby 
proudly appeared in a pair of size 9 shoes, 
the therapist asked him what his father had 
said about this new possession. Bobby 
promptly answered, “He said now he could 
wear my shoes.” When his wishes and fears 
about displacing father from the house were 
interpreted to Bobby, he replied with a 
dreamy smile, “You take the living room and 
TIl take the kitchen . . . and yow’ll starve 
to death.” 

Denial seemed to be Bobby’s main means 
of reacting to family conflicts. He could never 
discuss clearly any central aspect of his rela- 
tionship to his parents. When asked what was 
going on at home, he would say only that 
father had been doing some carpentry, or they 
had been working in the garden. During a 
period when the parents were known to be 
going through an explosive upheaval in their 
relation to each other, even the most per- 
sistent prodding brought no comment from 
Bobby. After extensive work with him, it 
began to appear that a close parallel existed 
between the way he thought about his family 
and the way he thought about schoolwork. 
Rather than grasp the crucial variables of any 
problem in their important relationships to 
one another, for instance, Bobby would fasten 
upon a fragmentary assortment of often pe- 
ripheral details. 


Tur Boy’s COGNITIVE PROCESSES 
Schoolwork’s Danger Potential 


Bobby was highly sensitized to the danger 
potential of everything in his environment. No 
matter with what he was engaged, those at- 
tributes of the object or activity that might 
injure him were the ones of which he was most 
aware, and hence the ones to which he re- 
sponded. During one session, for example, the 
tutor suggested a word game in which one per- 
son provides a key word, and then the other 
must write out, as quickly as possible, all the 
words that the key word suggests to him. 
When the tutor gave Bobby “danger” as the 
key word, he replied, “I can answer that one 


in a single word,” and wrote “everything.” 
He then proceeded to describe a car as some- 
thing that runs you over, a plane as something 
that crashes, machines in factories as things 
into which the unwary worker can fall and get 
chopped up, a door as something that can 
catch and squash your finger, and on without 
end. In the case of each such object, attributes 
which are not very important to most children 
were the critical ones for Bobby and served to 
control his thought about the object. 

If Bobby viewed everyday objects like cars 
and doors as potential sources of harm, this 
was even more true of his feelings toward 
school and tutorial sessions. Once, for ex- 
ample, when tutor and tutee were alternately 
making up radio “commercials” and recording 
them on tape, Bobby gave a commercial for 
the “Boston Singing Club,” telling how you 
can get singing lessons, and as a result you 
develop a pitch that will break a glass. Sing- 
ing lessons, his metaphor here for the tutorial 
sessions, apparently were seen as making him 
capable of doing harm. 

Consider also the time when he and his 
tutor were drawing a graph to represent the 
relation of temperature to pressure of a cer- 
tain gas. After working on the graph for a 
while, Bobby announced that he was sure 
there was going to be an explosion of the gas 
due to the pressure, so the pressure would go 
down; and he promptly extrapolated a dotted 
line on the graph which veered sharply down- 
ward. The tutor assured him that the scientist 
had designed a very strong steel chamber for 
the gas so it.could not possibly explode even 
if the pressure were increased; but Bobby 
claimed rather that the chamber was bound to 
explode, the scientist get hurt, and the gas 
pressure drop. He then sketched in further 
drastic rises and falls of the dotted graph line, 
put snowcaps on all the peaks of the graphs— 
saying they are snow-covered mountains— 
and drew a figure skiing down the mountain 
side. Immediately afterward he drew a tree 
squarely in front of the skier, so the latter had 
to crash into it. When the tutor noted that a 
crash was imminent, Bobby then drew in two 
ski tracks going around opposite sides of the 
tree so that the skier would have to split in 
two in order to follow them, Thus, if Bobby 
could not claim his schoolwork would lead to 
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death through exploding gases, he would make 
it lead to death by some other means. Bobby’s 
mind seemed far indeed from the ostensible 
goal of working on the graph. He was totally 
occupied with assaying the injury potential of 
his paper-and-pencil activity. 

Much of Bobby’s cognitive energy was ab- 
sorbed by the task of detecting and guarding 
himself against such imagined dangers as 
these. He showed an avoidance of academic 
material in general and also an avoidance of 
the concept of equivalence in particular. We 
shall consider each in turn. 


Physical Avoidance of Schoolwork 


There were two levels at which Bobby 
sought to avoid academic material in general: 
the level of physical activity and the level of 
thinking. First of all, Bobby had developed to 
a high degree strategies for physically getting 
himself out of, or not getting into, schoolwork 
situations. Several times, for instance, he man- 
aged to “forget” to bring his books to tutorial, 
too often for it to be mere chance and despite 
specific injunctions not to forget. If the books 
were brought, another kind of procrastination 
often ensued: hiding so the tutor could not 
find him. When finally engaged in some sort 
of schoolwork, he would be as inactive at it as 
possible—yawning and sprawling over the ta- 
ble as if he had not slept for days, acting as if 
the pencil were too heavy a weight to lift. And 
at every possible juncture, he would try to 
lure the tutor away from the topic of school- 
work by starting to talk about other things or 
looking around for any object that might pro- 
vide a distraction. At school he sometimes 
would do the wrong homework or fail to 
hand it in. 

All children, to be sure, would rather do 
other things than schoolwork. But most of 
them learn to engage in school activities 
nevertheless, and even to find pleasure in 
them. Bobby, on the other hand, showed a 
degree of insistence in avoiding academic ac- 
tivities which went far beyond playful truancy. 


Avoidance of Abstraction and Anticipation in 
Thinking 
When physical means failed, Bobby’s avoid- 
ance of academic material went on at the level 
of thinking itself. We have already suggested 
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that Bobby strongly denied his knowledge of 
crucial family relationships. Confronted in 
school with the task of determining which 
were the important facts and how they were 
related, Bobby was often at a loss. It seemed 
as if he were trying to avoid the kind of think- 
ing that might lead back to intolerable dis- 
coveries. One part of thinking has to do with 
abstraction activities: summarizing a pot- 
pourri of data in some terse and effective 
manner that is easier to retain and use than 
were the original facts, Such is the function of 
a formula or of an outline for a book. Another 
process in thinking concerns what we may call 
anticipation activities: predicting the conse- 
quences of present events by a kind of trial- 
and-error in imagination. The effect of avoid- 
ing such abstraction and anticipation activities 
will appear as a kind of “perceptual seduc- 
tion” by the specific information made avail- 
able to the senses. 

Bobby, in his tutorial work, gave evidence 
of avoiding both aspects of thought. Consider 
first the avoidance of abstraction, With regard 
to verbal material, for instance, he often com- 
plained that he never knew what he should 
remember, in history or science, from the jum- 
ble of words before him. In his note-taking he 
would copy out nonessential details verbatim 
from the text, and often miss the whole point 
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everything once more. Later he came to an- 
other identical place and went through this 
same procedure, and still later this happened 
again. Even though the three assemblies were 
exactly the same, he did not profit from his 
past experience and anticipate that several 
parts had to be held by the same screw when 
he came to each subsequent screw assembly. 
Knowledge was never transferred, as if the 
planning that would be involved was a burden 
he refused to assume. 


Avoidance of Equivalence Relationships 


Bobby’s avoidance of the threatening topic 
of relationships among family members spread 
to the matter of relationships among numbers 
too. Bobby had much difficulty with the 
notion that various arithmetic expressions are 
equivalent to one another—something his 
arithmetic teachers had been demonstrating to 
him for some time now. Even after studying 
these equivalences for weeks, he still could not 
tell the tutor the correct decimal equivalent of 
the fraction 1% or the percentage equivalent 
of that fraction. When forced to try an an- 
swer, he would guess at random or would 
respond in terms of the specifics of the 144 by 
claiming, for example, that 144% was the re- 
quired percentage equivalent. In similar fash- 
ion, Bobby could not translate from one frac- 
tion to another. For example, he could not 
guess what simpler fraction would be the ap- 
proximate equivalent of 250/525. Again, after 
tutor and tutee had indicated on a line the 
locations of 0, 14, 14, and 1, Bobby could not 
tell where % would be located on this line, 
even though 14 appeared there already; and 
he could not find the position of 74, even 
though 14 already appeared. This boy was 13 
years old at the time. ; 

During one session, Bobby was marking off 
sixths at equal intervals by means of vertical 
lines. He finally got to 8% and then went over 
one further equal interval and labeled it "y 
He thus put down % and 1 at different posi- 
tions and was quite satisfied with this until 
the tutor questioned him about it. It was as 
if he feared having the % and the 1 displace 
each other—and the only way he could avoid 
this was to imply that they were not equiv- 
alent but different. This latent fear could well 
derive from his conceiving of family relation- 


ships as involving no more than one possible 
actor for each role. When he .was forced to 
deal actively with number equivalences, in 
turn, we found that once again he did not 
believe that two terms could fulfill a common 
function. His modes of thinking about the 
family hence may have generalized to an 
abstract cognitive domain. 

What happened when Bobby was forced 
explicitly to consider number relationships is 
illustrated by such episodes as the following. 
The tutor, for example, set up the equation, 
3% = .6, and asked Bobby: “Does this mean 
that the % gets lost somehow or destroyed 
when we put down the .6, or rather does it 
mean the .6 and 3% are just different ways of 
talking about the same thing?” Bobby de- 
cidedly indicated than an arithmetic term got 
lost, destroyed, when you made something else 
equal to it, and held firmly to this view of the 
matter despite lengthy explanations by the 
tutor on many different occasions. The tutor, 
for instance, presented such examples as y% 
= %o, and had Bobby cancel through the 
S49 and thereby reduce it back to %. Here 
too he thought the 3% got destroyed when 
made equal to the % 9, and he did not feel 
his logic undermined when he found the % 
recoverable again from the %,». Even when 
he was finally able to grasp the idea in the 
case of one example, he was still quite in- 
credulous when the tutor told him that this 
is also what went on in the other cases that 
were discussed. 

Later the tutor demonstrated in yet another 
way that these two fractions really referred 
to the same thing, by illustrating each of them 
on a pie-cut circle graph of the same diameter: 
the segments representing 34 and 49 of each 
circle, hence looking exactly the same. Finding 
out in this way that the Proportion was the 
same on both circle graphs appeared to cause 
Bobby the greatest surprise, and further dem- 
onstration of other arithmetic equivalences by 
the same method continually brought forth 
exclamations of amazement, All this at an age, 
13, when work with arithmetic equivalences 
is second nature to most children, 

When the tutor drove Bobby home after a 
session in which the meaning of arithmetic 
equivalence had been explored, Bobby was 
inordinately prone to see the environment as 
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injurious: expostulating at great length on 
how cars might smash up against the curb 
and hurt it, on how dangerous cars were in 
general, and then himself almost running into 
the path of an oncoming car while crossing 
the street. During another session on number 
equivalences, he squirmed, was uneasy, and 
eventually broke off from his work to describe 
newspaper accounts of a boy who got twisted 
up in a clothesline and hung himself by ac- 
cident, and a girl who was climbing a tree 
when she fell into a Y formed by the branches 
and choked to death. During the work itself, 
_ Bobby had been punching holes viciously with 
a pencil into the side of a cardboard box. 
With regard to numerical equivalence rela- 
tionships, then, Bobby suffered an impairment 
in his capacity for what Piaget (1954) calls 
“conservation of identity.” In these number 
realms, any different manifestation or appear- 
ance of something meant, for Bobby, a differ- 
ent thing, and could never mean the same 
thing. This trouble with the conservation of 
identity perhaps had its roots in his fear that 
equivalence in the family was impossible, that 
one of the aspirants had to suffer destruction. 
This section has described the impact of 
Bobby’s emotional difficulties on his cognitive 
activities. We have noted his sensitization to 
the possibility of danger in school activities, 
his physical and cognitive avoidance of school- 
related matters in general, and his avoidance 
of the concept of equivalence in particular. 
Our discussion has traced these cognitive 1m- 
pairments to two main kinds of causes: fear of 
school achievement and avoidance of knowl- 
edge of crucial family relationships. 
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CONCLUSION 


We have sought to demonstrate how a long- 
standing disturbance in family relationships 
can result in cognitive difficulties for a child. 
The effort to avoid awareness of the family 
conflict has led to chronic interference with 
formal thought processes, while the meanings 
attached to school achievement have had a 
further inhibitory effect. Treatment over a 
long period has brought about a variety of 
marked changes in the parents and in the boy, 
and he seems to be meeting the other chal- 
lenges of adolescence with a degree of success 
that could not have been predicted as recently 
as a year ago. In particular, he has overcome 
an initial strong shyness with respect to girls 
and now boasts a steady girl friend, he does 
very well in school sports and shop, and he 
has been making more friends among peers. 
However, his general thinking capacity still 
shows partial impairment, and his capacity 
for reproducing knowledge under the stress of 
a school setting is still markedly impaired. In 
retrospect, it appears that successful ther- 
apeutic intervention in a case of this kind 
requires a much closer collaboration of ther- 
apeutic and educational methods than has yet 
generally been achieved. Most instructive for 
psychology, perhaps, is the degree of both 
specific and general relationship found be- 
tween family setting and the child’s cognitive 
processes. 
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CASE HISTORY CORRELATES OF THREE MMPI 
PROFILE TYPES* 


HAROLD GILBERSTADT ax» JAN DUKER 


University of Minnesota 


In spite of the accumulation of research 
studies attesting to the superiority of the 
actuarial over the clinical approach in de- 
scribing personality (Meehl, 1954), surpris- 
ingly little attention has been given to the 
task of compiling descriptions to be applied 
actuarially. As a result of his survey of this 
research, Meehl (1956) called for the de- 
velopment of “cookbook” descriptions. The 
purpose of the present paper is to report on 
a project designed to develop a source book 
of personality descriptions for narrowly speci- 
fied MMPI profiles. It is hoped that the use 
of such descriptions will improve communi- 
cation about patients, will make profile inter- 
pretation less dependent on the personal ex- 
perience and skill of the individual clinician, 
and will help to standardize classification of 
personality for further research. The present 
paper reports data from a trait and symptom 
check list which was applied to the case rec- 
ords of three MMPI types. 


METHOD 
Population 


Records were selected from the files for all pa- 
tients admitted to the Psychiatric Service of the 


1The authors wish to express appreciation to 
Starke R. Hathaway and to Robert D. Wirt of the 
University of Minnesota for their helpful suggestions 
concerning the preparation of this paper. 


Minneapolis Veterans Administration Hospital and 
examined by the psychology section during the years 
1952-1957. Of approximately 3,150 such cases, about 
2,300 were first admission. All patients were male 
veterans. First admission cases which met the fol- 
lowing criteria were studied: 


1. MMPI administered not more than 21 days 
before or 21 days after admission. 

2. Age range 20 to 60 years. 

3. Primary diagnosis not brain damage. 

4. MMPI scale T scores: L= 60 or less, F = 85 
or less, K = 70 or less, 

5. Shipley Institute of Living Scale IQ estimate 
105 or more. 


Selection of Profile Types 


Of several methods of profile analysis (Cronbach 
& Gleser, 1953; Lykken, 1956; Mosel & Roberts, 
1954), it was decided that the contingency method 
employed by Meehl and Dahlstrom (Meehl, 1959) 
was best suited for the project. This method con- 
sisted of the empirical derivation of a set of rules 
which would specify the elevations and the inter- 
relationships of the MMPI scales of a given type of 
profile and which would exclude profiles assumed to 
be not of the given type. The method permitted uti- 
lization of experience which the investigators had ac- 
cumulated with respect to the MMPI profiles of the 
population being studied. On the basis of this ex- 
perience, a set of specimen MMPI profiles was col- 
lected for study which was expected to be repre- 
sentative of frequently recurring personality types in 
the setting. Characteristics of three profile types will 
be reported in this paper; namely, MMPI codes 2- 
7-8 (D-Pt-Sc), 2-7 (D-Pt), and 2-7-4 (D-Pt-Pd). 
Starting from the specimen profiles, a set of con- 


TABLE 1 
Mran MMPI T SCORES FOR THE THREE CopE Types 


Scale 


N L F E Hs D 


Hy Pd Mf Pa Pi Sc Ma Si 


Code type 
2-7-8 2 48 72 49 73 99 
2-7 13 50 55 55 68 94 
274 27 48 62 50 66 90 


74 77 70 70 98 99 53 77 
74 62 58 60 84 62 47 
71 83 58 64 83 68 60 
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TABLE 2 


PERCENTAGE FREQUENCIES OF CHECKLIST ITEMS FOR GENERAL 
ABNORMAL (GA) SAMPLE AND EACH PROFILE TYPE 


Greater than GA Equal to or less than GA 
Check List Item GA 278 2-7 2-7-4 2-7-8 2-7 2-7-4 
Agitated 14 23 5 y 
Anxiety 33 46 85% 56* 
Anxiety reaction 
diagnosis 25 70** H 18 
Auditory hallucinations 4 14 0 4 
Back pain 5 8 11 5 
Blindness, eye complaint 3 5 15 0 
Blunted or in- 
appropriate affect 18 41* gr ox 
Cardiac complaint 3 5 23* + 
Chest pain 5 15 4 0 
Compulsive 6 14 23 7 
Conflict with parents 14 5 0* o** 
Conflict with siblings 11 x ai o* ore 
Conflict with wife 30 56* 14 15 
Confusion (Nonorganic) 11 9 OF Ore 
Dependent 19 23 30 18 
Depression 43 87** 85** 52 
Depressive reaction ` k 
diagnosis 8 18 23 30% 
Difficult concentration 2 256 8 0 
Divorced 12 E ae 5 o* 
Dizziness 6 15 5 o* 
Epigastric complaint 17 9 15 11 
Evasive-defensive 22 o% 8 ee 
Father alcoholic 9 5 O* 7 
Father domineering 4 23* . 0 0 
Father strict 3 18* 8 4 
Fearful 10 32* 8 7 
Feelings of hostility 17 9 8 re 
Feelings of sex Ş lä 
inadequac 0 
Reauplal dificulty 6 8 26** 5 4 
Guilt 13 18 15 26 
Headache 12 18 30 7 
Heavy drinking 32 r 96** 5+ oe 
Hostile a - 23 15 9 
Ideas of reference Wa * 
Immature 17 28 30 : OF 
Impulsive 6 . 11 0* 0 
Inadequacy feelings 8 ar 23 11 
Indecision 3 23°" 7 0 
Inferiority feelings 5 ry laa 23 
Insomnia i 11 14 46** 30* 4 
Irritable 11 30 26 9 
Loss of appetite 7 18 38* 15 
Loss of interest 2 23m 4 
Married 58 100** tate 0 
Mother physically ill 4 14 41 
Neck pains 3 5 15 7 0 0 
Nervousness 30 37 69* 48 
Numbness 6 14 8 
Obsessions 1 18** 30** ot 
Paranoid delusions 10 0 
Paranoid trends 10 14 o% o* or 
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TABLE 2 (Continued) 


| 
| 
| 


Greater than GA 


Equal to or less than GA 


Check List Item GA 2-7-8 2-7 2-7-4 2-7-8 2-7 2-7-4 
Passive 8 18 8 4 
Poor work adjustment 43 45 23 23 
Restless 13 23 5 11 
Rigid 3 15 0 0 
Rumination 1 28** 15* + 
Schizoid 8 46** 8 4 
Schizophrenic 

reaction diagnosis 16 50** o** ore 

Suicide attempt 6 9 15 0 
Suicidal preoccupation 12 23 o* 11 
Suspiciousness 9 9 o* on 
Tearfulness 7 14 8 4 
Tension 21 41 69** 527s 
Tremor 14 26 14 ge 
Unmarried 30 82** o** 7 
Weak, tired, fatigued 9 32* 38* 7 
Weight loss 6 14 0 4 
Wife pregnant or post 5 23 15 0 
Withdrawn, introversive 16 gis 15 4 
Worrying 9 46** 19 9 

* Difference significant at .05 level. 

** Difference significant at .01 level. 

tingency rules was formulated for each type which RESULTS 


would narrowly specify the characteristics of the 
profile type. 

The case records for each of the three types were 
studied in the following manner, First, all cases for 
the year 1956 whose MMPI profile fit the rules and 
the criteria outlined above were selected. These rec- 
ords were read. They consisted of psychiatric dis- 
charge summaries and social case history data. This 
first reading gave a subjective clinical impression of 
the patients in the group. ‘These records were then 
re-read and placed on a five-point scale from poor 
to excellent for goodness of fit to the preliminary 
subjective clinical formulation. Rules were then re- 
vised to eliminate cases which did not seem to fit 
well. Next, all records from the additional years of 
1952 and 1953 were read and rules were again re- 
fined to eliminate cases which seemed inappropriate. 
Finally, the records for the remaining three years 
(1954, 1955, and 1957) were read and cases which 
did not receive high ratings for the developing types 
were considered to be misses. The cases which were 
eliminated in the process of refining the rules are 
not reported here in detail, although some charac- 
teristics of them will be discussed below. 

The checklist was based on one developed empiri- 
cally by Cantor (1952) who tallied descriptive terms 
appearing in discharge summaries of psychiatric pa- 
tients in the Minneapolis Veteran’s Administration 
Hospital. For the present study, three judges inde- 
pendently checked the list for each case. When two 
of the three judges agreed, the item was included in 


the frequency counts. 


Table 1 gives the mean T scores of the 
MMPI scales for each of the profile types. 

Table 2 presents the incidence of the check- 
list items for the three code types and for pa- 
tients in general (Rosen, 1952). Items which 
were of less than 10% frequency for at least 
one of the groups have been omitted from the 
table. Significances of the differences in item 
frequencies between the groups were com- 
puted from a nomograph for calculating the 
significance of the difference between two per- 
centages (Lawshech and Baker, 1950). 


2-7-8 Type (N = 22) 


The following set of rules was developed to 
specify the 2-7-8 MMPI code type: 


1. D, Pt, and Sc over T score 70, 

2. D minus Sc less than 15 T scores, 

3. Pt and Sc less than 20 
or Pt or Sc greater than 100 in which case dis- 
crepancy can be greater), 

4. If Pt is peak, include only if all other scales 
are below 90 and Pt is not more than 5 7 
scores greater than D, 

5. Si greater than Ma, 

- Ma below T score 70, 

7. D minus Hs more than 10 T scores, 


a 
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TABLE 3 


COMPARISONS BETWEEN PROFILE Types on CHECKLIST ITEMS 


Check List Item 278 > 27 |278 > 274 | 27 > 278 | 27 > 274 |274 > 278 | 274 > 27 


Anxiety < l 


Auditory hallucinations a 


Blindness, eye complaint * 


Blunted or inappropriate affect ne cii < 


i Chest Pain - 


Conflict with wife ** +e 


Depression m * 


Difficult concentration a 


Dizziness z 


. . ** 
Father domineering “ 


Fearful j 


Feeling of sex inadequacy 


Financial difficulty * 


Heavy drinking + ei 
ee ie i 


Impulsive ‘ e ! 


Ideas of reference 


Inadequacy feelings 


se ** 
Indecision 


Inferiority feelings =f TA 


Insomnia * 


$ ** Se 
Loss of interest mee 


Mother physically ill # = bias ee 


Numbness of = 


Obsessions a 


++ 


Poor Work Adjustment 


Quiet * + 


Rigid 


Rumination š 


* Difference significant at .05 level. 
#* Difference significant at .01 level. 
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TABLE 3 (Continued) 
Check List Item 278 > 27 | 278 > 274 | 27 > 278 | 27 > 274 |274 > 278 | 274 > 27 
Schizoid i ka l 
| 

Sexual difficulty = 
Suicide attempt Y * 
Suicidal preoccupation we | * 
Suspiciousness A ¥ 
‘Tremor = * 
Weak-tired-fatigued * * 
Weight loss * 
Wife pregnant or post * + 
Withdrawn-introversive à sk 
Worrying * 


8. Pt minus Pd more than 10 T scores. 
9. Pt minus Pa more than 10 T scores. 


Patients obtaining profiles of the 2-7-8 
type were checked more frequently than pa- 
tients-in-general on the following items which 
were significant at the .01 level: depression, 
difficult concentration, inadequacy feelings, 
indecision, inferiority feelings, loss of inter- 
est, obsessions, rumination, schizoid, schizo- 
phrenic reaction diagnosis, unmarried, weak- 
tired-fatigued, and withdrawn-introversive. 
Significantly different in frequency from pa- 
tients-in-general at the .05 level were: blunted 
or inappropriate affect, father strict, fearful, 
and ideas of reference. Diagnostically, these 
patients appeared to fit the characterization 
of the group of pseudoneurotic schizophrenics 
described by Hoch and Polatin (1949). 


2-7 Type (N = 13) 
The following set of rules was developed to 
specify the 2-7 MMPI code type: 


. D and Pt over T score 70. 
. D greater than Pt. 
Pt minus Sc at least 15 T scores. 
i below T score 70. 
; A and Pa below T score 80 unless D and/or 
Pt not over 85 in which instance Pd and Pa 


must be below 70. 


Á 


nAPwn 


Patients obtaining profiles of the 2-7 type 
were checked more frequently than patients- 
in-general on the following items which were 
significant at the .01 level: anxiety, anxiety 
reaction diagnosis, depression, insomnia, mar- 
ried, obsessions, tension, and worrying. Sig- 
nificantly different in frequency from pa- 
tients-in-general at the .05 level were: car- 
diac complaint, loss of appetite, nervousness, 
rumination, and weak-tired-fatigued. Diag- 
nostically these patients seemed to fit the 
description of the anxiety reactions or obses- 
sive-compulsive personalities. 


2-7-4 Type (N = 27) 


The following rules were developed to 
specify the 2-7-4 type: 


. D, Pd, and Pt above T score 70. 

. Pt minus Sc at least 5 T scores. 

. Pt minus D 10 T scores or less. 

. D minus Pd 20 T scores or less, 

. Among scales D, Pd, and Pt, two scores be- 
low T score 100. 

6. If Pd is peak, include only if Pd does not ex- 


ceed D or Pt by more than 10 T scores, 
. Si below T score 70. 


7 

8. Ma less than Sc. 

9. Ma above T score 40. 

0. If Ma is below T score 50 include only if Si 
does not exceed Ma by more than 15 T scores. 


ARoOne 
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Patients obtaining profiles of the 2-7-4 
type were checked more frequently than pa- 
tients-in-general on the following items which 
were significant at the .01 level: depressive 
reaction diagnosis, financial difficulty, heavy 
drinking, and tension. Significantly different 
from patients-in-general at the .05 level were: 
anxiety, conflict with wife, and insomnia. 
Diagnostically these patients appeared to fit 
the stereotype of the oral-dependent alco- 
holics. 

Items which were less frequent for any 
profile type than for patients-in-general can 
also be found in Table 2. It will be noted 
that patients-in-general tend to be described 
more frequently as evasive and defensive than 
either the 2-7-8 or the 2—7—4 profile types. 

Table 3 shows the check list items which 
differed significantly between the various 
combinations of profile types. These differ- 
ences occur despite the fact that all three 
profiles are of the same two-point code type. 


DISCUSSION 


The rules for specifying profiles have been 
presented in detail. With experience, devia- 
tions from the specified configurations are in- 
terpretable. For example, 2—7-8s with higher 
4 (Pd) than permitted by the rules tended 
to have the general characteristics of the type 
but also tended to have histories of acting- 
out which frequently included police contacts 
and penitentiary sentences. With 6 (Pa) 
higher than permitted by the rules, an in- 
crease in hostility, in suspiciousness and, par- 
ticularly, evidence of overt psychosis was 
found. In short, inferences which could be 
made from single scale or pair-wise scale 
deviations from the basic configuration could 
be used to modify the interpretation of the 
profile. Similarly, although age and intelli- 
gence were restricted in deriving the descrip- 
tions, cases which deviated on these variables 
tended to fit less well but usually were not 
complete misses. The rules have been devised 
to select cases which are highly purified for 
research purposes but in many instances the 
cutting point necessarily was somewhat arbi- 
trary and resulted in the exclusion of cases 
which would have fit fairly well. For exam- 
ple, for the 2-7-8 profile type, the rule of 


D minus Sc of 15 T scores or less (Rule 2) 
was set to exclude depressive reactions even 
though some pseudoneurotic schizophrenics 
may be missed by rigid application of this 
tule. 

It should be noted on the one hand that 
the three MMPI profile types reported would 
be expected to show many common traits re- 
lating to anxiety and depression since they 
were all of the 2-7 code type; on the other 
hand, there are many clear-cut differences de- 
spite the fact that they are all of the same 
two-point code type. For the sake of brevity 
more extensive Clinical narrative descriptions 
which elaborate on the constellations of traits 
and contain inferences derived from them 
have been omitted. However, such descrip- 
tions have been developed for clinical use. 

Previous research (Gilberstadt, 1952) uti- 
lizing the CC index of MMPI profile simi- 
larity devised by Hathaway and Meehl 
(Sutton, 1952; Williams, 1952) served as a 
pilot study for the present research. The 
study of the 2-7-8 profile type is essentially 
a cross-validation of earlier findings. In both 
studies the cases obtaining profiles of this 
type i highly similar to the symdtenie 
of pseudoneurotic schi i i 
Hoch and Polatin oa AID ty 

Further validation of 
the Q sort technique is a ee by 
proach, when applied with descripti ede 
rived i from MMPI Atlas eana 
Meehl, 1951) case histori 
ax ae F i ies, has already been 

Current research includ. Se 
quent MMPI code types i one fre- 

oom oa : ) s in scoj 
to construct validation with other tests, 5 


Summary 


Contingencies fo 

r ifv; 

quently occurring ines of ig three fre- 
-Pt vari 


the D of MMPI profiles of 
as variety have been de tne es o 
Pool of Veteran’s veloped from 


—— O 
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of reference for profile interpretation or as a 
means of classification for research purposes. 
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BENDER-GESTALT 

THE EFFECTIVENESS OF THE 

IN DIFFERENTIATING A FLIGHT GROUP FROM 
AN AGGRESSIVE GROUP OF ADOLESCENTS 


LOREN V. COROTTO anp ROBERT H. CURNUTT 
Napa State Hospital, Imola, Calijornia 


There is some evidence to suggest that the 
Bender-Gestalt measures some aspect of a 
strength. Pascal and Suttell (1951) ava 
oped an objective scoring system for the B- 
that consists of tabulating certain nly oe 
in the reproductions of the designs. es 
authors believe that “if . . . we limit 7 m 
nition of ego functioning to the al oie) be 
reproduce faithfully the B-G eae 
presented, then we may say that 2 ge 
lies on a continuum from very = = 
Be ee ates cea on this quantiates 

he score obtained on this qt f 
iih the greater is me ndvi pa = ea 
strength. One of the authors in pno Yer wer’ 
was unable to ones F ae oo = sat 
with a Z score on the B- 

1954). In this study, however, an tag 
will be made to follow up ee ved “i 
at least by inference, to see : i n ae 
determined whether the B-G does mesne 
some aspect of ego strength when u 
i criterion. oe p 

ee a noted in sagen en 
that adolescents who tend to a one 
sively to their environment © a pre pe 
better B-G records than do ado — ays 
tend to utilize escape or flight ss a 
is generally assumed in the ieee Heat 
people who utilize aggressive be seat a 
to have more ego strength than individuals 
who tend to rely upon flight behavior. If the 
B-G does measure an aspect of ego strength, 
it should tend to differentiate between aggres- 
sive individuals and individuals who utilize 
flight. It is precisely this assumption which 
will be dealt with in this study. 

The hypothesis to be tested may be stated 
as follows: The Pascal and Suttell scoring 
system of the B-G test will not significantly 


differentiate adolescents who react primarily 
with flight behavior in contrast to those who 
react primarily with aggressive behavior. 


PROCEDURE 


The records of 92 adolescent patients who had 
been hospitalized in a children’s psychiatric unit were 
withdrawn from the files. The patients in the ag- 
gressive group and in the flight group were matched 
individually on the basis of age, sex, and education, 
This was done since it is already known that both 
age and education are related to B-G scores (Bender, 
1938; Pascal & Suttell, 1951). All individuals who 
had either a diagnosis or a known history of psy- 
chosis, mental deficiency, or organic brain damage 
were excluded. 

The flight group was composed of the adolescents 
who had a record of two 
home since the age of 10 
aggressive behavior. The 
fined as those subjects wh 


or more runaways from 

but with no resultant 
aggressive group was de- 
c o had a record of two or 
more physically aggressive acts: eg., destruction of 
property, fighting, or physical assault, but without 
any runaway behavior 


The selected sample was ind 
by one of the auth 


l ependently matched 
ors prior to obtaining the B-G 


tained results, 


sustaining the a significant difference, thus 
when the pai eo 1 hypothesis. However, 
on a sex b a individuals were examined 
obtained: į asis, significant differences were 
cancel out ae the male scores had tended to 
sults can þ 3 female Scores, Hence the re- 
of sex difference ned purely on the basis 


€s. In view of this finding, an 
368 


a an 


Bender-Gestalt in Differentiating Adolescents 


analysis restricted to the 27 individually 
matched pairs of males in the original sample 
was undertaken with the result that the B-G 
did significantly differentiate between the 
aggressive group and the flight group. The 
difference was significant at the .001 level 
when applying a two-tailed test of signifi- 
cance. In other words, the aggressive males 
had lower B-G scores, i.e., more ego strength 
as operationally defined by Pascal and Sut- 
tell, while the flight males had the higher 
B-G scores. 

When the 19 individually matched pairs 
of females in the original sample were exam- 
ined, an inverse relationship to that of the 
males was obtained. The difference was sig- 
nificant at the .001 level. In contrast to the 
male sample, the female flight group was 
found to have the lower B-G scores (more 
ego strength) while the aggressive female 
group obtained the higher B-G scores. The 
marked reversal of the behavioral correlates 
associated with the lower B-G scores by the 
female sample in comparison to the male 
sample would seem to emphasize caution in 
attempts to relate behavior to B-G scores 
without thoroughly allowing for sex differ- 
ences. Since Pascal and Suttell (1951) were 
unable to demonstrate a significant sex dif- 
ference in the standardization of their scoring 
system, the possibility is raised that social 
and cultural influences may affect the behav- 
ioral expression of that aspect of ego strength 
tapped by the B-G. 

These findings may have implications for 
the treatment of adolescents whose behavioral 
symptoms primarily revolve around aggres- 
sion and flight. Since the results indicate that 
boys who utilize flight reactions have rela- 
tively less ego strength than do aggressive 
boys, then possibly the therapist might wish 
to direct his efforts more towards supporting 
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and developing the ego of the individual. In 
the aggressive group of boys, however, the 
aims of the therapist might be different. In 
this case, the therapist’s efforts might be 
directed more towards channeling unaccept- 
able behavior into more socially acceptable 
modes of response. Similar considerations 
would seem applicable to a female population. 


SUMMARY 


In order to explore the effectiveness of the 
B-G in differentiating a group of adolescents 
who react primarily in an aggressive manner 
from a group which reacts primarily in a 
flight manner, 46 pairs of adolescents were 
matched by age, sex, and education. When 
the groups were segregated on a sex basis, 
significant differences (.001) were obtained. 
Adolescent girls who utilize primarily flight 
behavior tend to have lower B-G scores (more 
ego strength as operationally defined by Pas- 
cal & Suttell, 1951) than do adolescent 
girls who react primarily with aggressive be- 
havior. Adolescent boys who tend primarily 
to react with aggressive behavior, however, 
tend to have lower B-G scores (more ego 
strength) than do adolescent boys whose pri- 
mary behavior reactions are characterized by 
running away. Some implications of these 
findings are discussed. 
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THE EPPS PROFILE STABILITY COEFFICIENT 


DONALD W. FISKE, KENNETH HOWARD, anb WILLIAM RECHENBERG 
University of Chicago 


An unusual and almost uninvestigated feature 
of the Edwards Personal Preference Schedule 
(Edwards, 1957) is the Profile Stability Coeffi- 
cient, the correlation (over needs) of the two 
sets of part scores for each S. This correlation 
was obtained for each of 135 college students. 
While Edwards uses r as the stability coefficient, 
we used rho, since the hierarchy of the needs 
seems of primary concern. Edwards’ distribution 
and ours were highly congruent, even to a second 
mode at .85-.89. Our mean, obtained via a 
rho-squared transformation, was .73, while his 
was .74. 

Inspection indicates that the stability coeffi- 
cient is in part a reflection of the relative scat- 
ter or differentiation of the S’s profile of needs. 
It is also associated with separate scale scores. 
(In these analyses we used the square of rho to 
reduce the skewness of the distribution of sta- 
bility indices). For EPPS variables, the stability 
score correlated .37 with n Autonomy, .30 with 
n Exhibition, .23 with the Consistency score, and 
.17 with both n Achievement and n Dominance; 
it also correlated —.28 with n Abasement, —.27 
with n Order, and —.21 with n Deference. (.22 
is significant at the .01 level, .17 at the .05). 

Among 19 other test scores, it correlated .22 
with Dominant and .18 with Active on the Thur- 
stone Temperament Schedule. It also correlated 
.19 with Expressed Control and —.22 with 
Wanted Control scores from the FIRO-B Scale 
(Schutz, 1958, pp. 57-65). (No association was 
found with two tests of intelligence). These cor- 
relations are consistent with those for the EPPS 
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scores. Consistency of a need pr yi 
groups of items appears to be i es 
strivings for superior and controlling status 

; For the same Ss, the relationship between ; osi 
tive self-ratings and intra-individual variability 
was negative for each of six traits, although 
dependable relationships were found bet on 
such variability and the same set of test ae 
(van der Veen & Fiske, 1960). However gle 
temporal variability of mean peer ratings ti 
assertiveness and number of useful feast a 
tributed to a group problem solving session val 
negatively related to self-descriptions of asser- 
tiveness and sociability (Fiske, 1960). These 
several findings suggest that individuals who de- 
scribe themselves in positive and active terms 
may be less variable in both their test resp 

and their behavior. penses 
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MANIFEST NEEDS AND MANIFEST ANXIETY? 


CARL D. WILLIAMS, ROBERT B. TALLARICO, an» JAMES T. TEDESCHI 
University of Miami 


The present study was designed to contribute 
additional empirical properties of “anxious” and 
“nonanxious” groups on the Taylor Manifest 
Anxiety Scale (MAS) by comparing their mani- 
fest needs as measured by the Edwards Personal 
Preference Schedule (EPPS). The MAS was 
given to 543 college students, and extreme groups 
of 65 persons each were identified. The EPPS 
was given to the 38 MAS high scorers and the 
39 MAS low scorers who were willing to partici- 
pate upon invitation. There were 23 male and 15 
female high scorers and 19 male and 20 female 
low scorers. The MAS scores of the anxious 
group ranged from 21 to 34. The means for the 
participants and the nonparticipants from the 
65 high MAS scorers were 25.92 and 26.78, 
respectively. This difference was not statistically 
significant. The MAS scores of the nonanxious 
group ranged from O to 4. The means for the 
participants and the nonparticipants from the 65 
low MAS scorers were 2.46 and 2.31, respectively, 
This difference was not statistically significant. 

Significant differences were found between 
high and low MAS scorers for three of the mani- 
fest needs in men and for five of the manifest 
needs in women. For men, the mean EPPS scores 
for the low and high groups, respectively were 


1 This investigation was supported in part by a 
grant from the Penrose Fund of the American Philo- 
sophical Society. An extended report of this study 
may be obtained without charge from Carl D. Wil- 
liams, Department of Psychology, University of 
Miami, Coral Gables, Florida, or for a fee from the 
American Documentation Institute. Order Document 
No, 6290 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 


8.74 and 11.78 for succorance (p< .05), 18.58 
and 14.35 for dominance (p< .01), and 8.47 
and 12.26 for abasement (p < .01). For women, 
similar comparisons were 13.55 and 10.87 for 
deference (p< .05), 19.20 and 16.13 for intra- 
ception (P<.05), 9.90 and 15.47 for abase- 
ment (p<.01), 12.85 and 8.20 for endurance 
(p <.05), and 10.25 and 13.27 for aggression 
(p <.05). 

Abasement is the only manifest need for which 
significant differences exist for both men and 
women in the present study. The finding that 
both men and women nonanxious subjects scored 
significantly lower than anxious subjects on 
abasement suggests the possibility that the non- 
anxious groups may be characterized by a more 
favorable self-concept than the anxious groups. 

The data for five of the six remaining EPPS 
variables on which significant differences were 
found were compared with Edwards’ normative 
data. Edwards has reported significant differences 
between men and women in his normative data 
for all five of the variables compared. These 
comparisons showed that high MAS scorers dif- 
fered from low MAS scorers on the variables of 
succorance and dominance for men and defer- 
ence, intraception, and aggression for women in 
that the means of the anxious groups were more 
similar to the scores made by the opposite sex 
than to those made by the same sex in Edwards’ 
normative group. 

This result and the finding of lower scores on 
abasement for the nonanxious subjects appear 
to support the view that, as compared to low 
scorers, high scorers on the MAS are persons 
who are not as interested in making a good im- 
pression on others and may thus tend to say 
deviant things about themselves, 7 
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THE INFLUENCE OF ORDER OF ADMINISTRATION 
ON SELF-CONCEPT MEASURES * 


DAVID S. PALERMO 

Institute of Child Development and Welfare, University of Minnesota 
anD JOHN G. MARTIRE 
Southern Illinois University 


Many studies have reported relationships be- 
tween measures of actual- and ideal-self-concept 
and have frequently related measures of self-con- 
cept to other indices. This study is designed to 
investigate the effects of the order of adminis- 
tration of actual- and ideal-self-concept measures 
on their relationship to each other and their 
relationship to a third measure. 

Self-concept measures were administered to 
120 male and 120 female introductory psychol- 
ogy students at Southern Illinois University. 
Each S was given three trait ranking forms and 
a set of anchoring definitions for 26 traits used 
in previous studies by the second author. The Ss 
were read the anchoring definitions and then 
instructed to read the directions at the top of 
the first sheet and make their rankings accord- 
ingly. When they finished the first set they were 
to go on to the second and then the third. Dif- 
ferential instructions at the top of the three 
ranking sheets directed Ss to rank the traits so 
as to provide an assessment of self-ideal (sI), 
actual-self (As) and social-self (Ss). Half of 
the female and half of the male Ss responded to 
the rankings in the order As, sI, and Ss (Order 
I), and the other half of the Ss responded in the 
order sI, As, Ss (Order II). These orders were 
selected since most current interest is in ideal- 
and actual-self concepts. 

Pearson r’s obtained between the two orders 
by sex group for the mean rankings of the 26 
traits of the three scales ranged between 82 
and .91 (all significant < .01 level). These corre- 


1 An extended report of this study may be ob- 
tained without charge from David S. Palermo, In- 
stitute of Child Development and Welfare, Univer- 
sity of Minnesota, Minneapolis 14, Minnesota, or 
for a fee from the American Documentation Insti- 
tute. Order Document No. 6292 from ADI Auxiliary 
Publications Project, Photoduplication Service, Li- 
brary of Congress; Washington 25, D. C., remitting 
in advance $1.25 for microfilm or $1.25 for photo- 
copies. Make checks payable to: Chief, Photodupli- 
cation Service, Library of Congress. 


lations indicate that the relative position of each 
trait tended to remain the same on all three 
scales for both sexes regardless of order of 
administration. 
Interscale correlations for eac 

under each order ranged ie gee ae 
These correlations reached the .05 level of con- 
fidence with but one exception. The male corre- 
lation under Order I between As and Ss (.26) 
was not significant. Of particular interest are the 
correlations between Ss and the other two meas- 
ures under the two orders. Ss correlated .70 
and .88 for males and females, respectively with 
the sI scale when the sI scale was given second 
but only .58 and .74 when the sI scale was 
given first. The correlations between Ss and As 
were .52 and .68 for males and females respec- 
tively when As was given second and 26 and .62 
when As was given first. In each of the com ar 
sons the correlation between the Ss sc S 
third in all cases, 
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_ WECHSLER-BELLEVUE PATTERNS OF PSYCHOPATHS* 


LEONARD KINGSLEY 
Walter Reed Army Hospital, Washington, D. C* 


Wechsler (1944) reported that psychopaths 
generally obtain higher IQs on the Performance 
(P) Scale of his test (W-B) than on the Verbal 
(V) Scale and also described a pattern of sub- 
test characteristics for psychopaths. There have 
been several W-B studies of psychopaths and 
delinquents with differing findings. Wechsler 
(1958) again offers the same IQ and subtest 
characteristics, qualifying his observations to 
adolescent sociopaths, but then adding “further 
experience has shown that it is also applicable 
to the adult male psychopath.” 

Twenty psychopathic prisoners in a military 
disciplinary barracks were matched individually 
for age and education with a group of prisoners 
considered to possess “no psychiatric disease.” 
Both groups were later combined into a general 
prisoner category because of the similarity of 
the results and matched individually for educa- 
tion and intelligence with “normal” controls 
functioning in a military setting.* 

P IQ was greater than V IQ, at 1% level of 
confidence, in all 3 groups. This difference was 
not significantly greater in the psychopaths as 
compared to the other prisoners nor in the gen- 
eral prisoner group as compared with the con- 
trols. 

The pattern of subtests postulated by Wechs- 
ler was then considered to represent a theoretical 


1 An extended report of this study may be ob- 
tained without charge from the Secretary, Clinical 
Psychology Service, VA Hospital, Brooklyn 9, N. Y. 
or for a fee from the American Documentation In- 
stitute. Order Document No. 6294 from ADI Aux- 
iliary Publications Project, Photoduplication Service, 
Library of Congress; Washington 25, D. C., remit- 
fing in advance $1.25 for microfilm or $1.25 for 
photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. 

2 Now at VA Hospital, Brooklyn, N. Wi 

3 The author is indebted to Ernest K. Montague 
for permission to use these W-B protocols. 


rank order of subtest. The Average Spearman 
rank order correlation between the individual or 
combined Wechsler scores and the theoretical 
rank order was then computed for all 3 groups; 
the correlation was significant for all 3. The 
difference scores between each matched subject 
of each group for the subtests were then com- 
puted to see if this correlation was significantly 
greater in one group vs. the other; these differ- 
ences were not significant. 

Finally, the attempt was made to see if any 
of the individual W-B subtests, disregarding rank 
order, differentiated one group from the other; 
none of the subtests was significantly different 
in the psychopath vs. prisoner groups or in the 
general prisoner vs. control groups. 

The results of this study raise doubts about 
the use of difference scores and subtest patterns 
on the W-B for psychopaths. Wechsler’s ob- 
servations as to the difference between the P 
and V IQs and the order of subtest difficulty 
were found to be present for the psychopaths, 
but these differences and patterns were also pres- 
ent for the nonpsychopathic prisoners and the 
“normal” controls. The same general trend ex- 
isted in all three groups, with no significant 
differences. It is concluded that the IQ differ- 
ences and subtest patterns offered by Wechsler 
and based upon the results of adolescent psycho- 
paths cannot be extended to adult psychopaths 
in a military setting. 
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THE FACTORIAL VALIDITY OF ITEMS ON THE IPAT ANXIETY SCALE! 


A. W. BENDIG 
University of Pitisburgh 


The IPAT Anxiety Scale (Cattell, 1957) is 
composed of 40 items divided into five subscales 
with each subscale assumedly measuring one of 
Cattell’s five oblique first-order factors (Q3, C, 
L, O, and Q4) whose intercorrelations define 
Cattell’s second-order factor of “anxiety” (UI 
24). These items were assembled into this new 
scale on the basis of their factor loadings in other 
inventories, and an analysis of the factor struc- 
ture of the items in this form has not been 
reported to date. 

The scale was anonymously administered to 
200 undergraduate and graduate students in edu- 
cational psychology classes (100 men and 100 
women, mean age 27.8 years), and the items were 
intercorrelated by the usual product-moment 
method. Five factors were extracted by the com- 
plete centroid method, and the factors were ro- 
tated to oblique simple structure using the ana- 
lytic criterion of rotation developed by Pizka 
and Saunders. The correlations among the oblique 
first order factors were factored and the second- 


order factors also rotated to oblique simple 
structure,? 


1 An extended report of this study may be ob- 
tained without charge from A. W. Bendig, Depart- 
ment of Psychology, University of Pittsburgh, Pitts- 
burgh 13, Pa, or for a fee from the American 
Documentation Institute. Order Document No. 6295 
from ADI Auxiliary Publications Project, Photo- 
duplication Service, Library of Congress; Washing- 
ton 25, D. C., remitting in advance $1.25 for micro- 
film or $1.25 for photocopies. Make checks payable 
to: Chief, Photoduplication Service, Library of 
Congress. 

? The author’s appreciation is extended to Gary 
Lotto and William B. Kehl of the University’s 
Computation and Data Processing Center for pro- 
viding facilities for the statistical analysis. 


The obtained item factor loadings showed 
little relationship to the assumed factor content, 
and none of Cattell’s five factors could be clearly 
identified in this analysis. Of the 40 items 16 
had no loadings above .29 on any of the factors. 
The correlations among the primary factors 
ranged from .33 to —.12, and two second-order 
factors were found instead of the unitary UI 24 
factor expected with the correlation between 
these two oblique factors being approximately 
-01. Bifactor graphs indicated that further hand 
rotation would not materially improve simple 
structure or increase similarity to Cattell’s as- 
sumed factor structure, 

Internal consistency reliabilities were obtained 
for four of the scores Suggested in the test 
manual (Cattell, 1957): Total Anxiety, Covert 
Anxiety, Overt Anxiety, and the Difference Score 
(Bendig, 1959), Using Kuder-Richard Formula 
20 the reliabilities of these scores for this sample 
of 200 Ss were .81, -63, .76, and .24. 

The results, unless attributable to a sampling 
or methodological artifact, strongly suggest that 
the IPAT Anxiety Scale does not measure what 
it purports to measure, namely the unitary UI 
24 second-order factor of anxiety, but instead 
confounds two independent second-order factors. 
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OBJECTIVE CONFIGURAL RULES FOR DISCRIMINATING 


PSYCHOTIC FROM NEUROTIC MMPI PROFILES? 


PAUL E. MEEHL 


University of Minnesota 


AND 


a 

One contribution which psychologists are 
expected to make toward the clinical assess- 
ment of psychiatric patients is helping deter- 
J mine the presence or degree of psychotic 
f; tendencies. This problem presents itself for 
/ reasons of varying pragmatic import, such as: 
What formal (nosological) diagnosis should 
be given to the patient? To what extent is 
there a danger of serious acting out if this 
` patient is seen on an outpatient basis? Is 
there a psychotic process or structure behind 
the superficially neurotic manifestations, such 
that methods of treatment appropriate for 
neurotic patients are likely to be inefficacious 
p or even deleterious? What is the long-term 
prognosis, requiring consideration in voca- 
tional and educational guidance, advice to 
relatives, recommendation to a rating board, 
communications to a family physician, social 
agency, or court? Should a conservative staff 
feel entitled to utilize the more radical kinds 
of treatment, e.g., regressive electroshock, 
when their policy is to avoid such procedures 
in the treatment of psychoneuroses? Is this 
the sort of patient who should probably be 
treated by Dr. X, who is especially gifted in 
the treatment of mild schizophrenic condi- 
tions and, unlike our other two available 
therapists, has a preference for working with 
them? We shall not discuss the utility in clini- 
cal decision-making of answering these ques- 
tions. Suffice it to say that the assessment of 
“psychotic tendency,” phrased in one form or 
another, is one of the tasks with which the 
clinical psychologist in most installations will 

at times be confronted. \ 
The MMPI is among the tests utilized for 
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this purpose. If patients could be effectively” 
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sorted into nosological categories simply by 
identifying their highest MMPI score, each 
of the rubrics in the psychiatric nomenclature 
having a one-to-one correspondence with the 
MMPI variables as named, the present in- 
vestigation would be pointless. It is, however, 
well known that this simple procedure does 
not work. This is one reason why clinicians 
prefer to characterize profiles by code rather 
than by the original scale names. In the Uni- 
versity of Minnesota Hospitals the mimeo’d 
profile form used in patients’ charts has only 
code designation for scales. (See Hathaway, 
1947; Hathaway & Meehl, 1951a, 1951b; 
Meehl, 1950b; Welsh & Dahlstrom, 1956.) 
It is true that one can do significantly better 
than chance by paying attention only to the 
highest one or two T scores (Hathaway & 
Meehl, 1951a; Meehl, 1959a) but the amount 
of improvement over chance, while it testifies 
to the presence of some validity in the in- 
strument, is not great enough to be very use- 
ful. Clinicians even moderately familiar with 
the MMPI have, therefore, been accustomed 
for over a decade to interpreting the results 
by paying attention to the profile pattern in 
some kind of joint relation to the overall 
elevation and especially the elevation of the 
most deviant scores. 

While it would not be surprising to find 
that the “clinical eye” had trained itself to 
recognize configurations not readily identified 
by conventional linear methods of statistical 
analysis (Horst, 1954; Lubin & Osburn, 
1957), it might be presumed that the cli- 
nician’s subjective judgment_ however experi- 
enced, assigns less than optimal weights. In 
addition to this systematic bias, the human 
judge inevitably throws in some more or less 
random error variance due to his unreliability. 
In trusting the clinical eye (at that stage 
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of the total decision-making process which 
is concerned to classify the profile, although 
not necessarily the patient!) we treat the 
nonoptimality of the clinician’s unyerbalized 
function, and the temporal instability in its 
application, as a price paid in order to get 
the advantages of a configural approach to 
the profile pattern. Configurality tends to 
make the clinician’s global judgment effec- 
tive; nonoptimal weights and temporal fluc- 
tuation tend to make it ineffective. The net 
efficiency of the clinician’s “judgmental” clas- 
sification or ordering of profiles is the out- 
come of these conflicting forces (Meehl, 
1959a). For this reason, the present investi- 
gation has, in addition to its technological 
aim of providing an aid to MMPI users, an 
intrinsic methodological interest. To what ex- 
tent can the complex configural generaliza- 
tions exhibited by the behavior of MMPI- 
skilled clinicians be frozen into a set of cleri- 
cal operations? If this could be done, it would 
reduce the amount of clinical experience re- 
quired by the test user; more importantly, it 
is probable on theoretical grounds that if the 
essential features being reacted to by the cli- 
nician can be dealt with in an actuarial way, 
even the asymptote of correct decisions will 
be increased (Estes, 1957, p. 615). ; 
More important in the long run than either 
the immediately pragmatic or methodological 
interests, however, is the possible construct 
validity (APA Test Standards Committee, 
1954; Cronbach & Meehl, 1955) of formal 
criteria for the “psychotic profile.” Psycho- 
metric devices should ultimately reach a point 
of development comparable to the laboratory 
technics of internal medicine (biochemical 
tests, biopsy, roentgenology) such that they 
are thought of technologically as on the same 
level with the clinical interview, or ward rat- 
ings, rather than as clever devices for predict- 
ing these latter “criteria” (Meehl, 1959b), 
When a patient responds to the verbal stimuli 
which constitute the MMPI pool in a manner 
characteristic of previously studied patients 
recognized by the familiar clinical criteria as 
schizophrenic, but the clinical staff are not 
inclined to so diagnose him because of the 
absence of the traditionally emphasized—and, 
from Bleuler’s point of view, mainly second- 
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ary—‘frankly schizophrenic” symptoms, it is 
debatable which of these facts about the pa- 
tient’s behavior should be given the greater 
weight as a probabilistic indicator of his in- 
ternal psychological state, structure, and dis- 
positions. The development of an objective set 
of profile pattern criteria for the identifica- 
tion of recognized, diagnosed manic-depressive 
and schizophrenic Psychotics has numerous 
possibilities with regard to subsequent lines 
of investigation directed at achieving a “boot- 
straps effect” (Cronbach & Meehl, 1955, p. 
286). We inject this methodological note to 
provide adequate motivation for the contami- 
nated derivation Procedures employed 

In what follows, the terms “hit” and “miss” 
will be employed in the sense of concurrent 
validity, except where specifically mentioned 
otherwise. Hints as to construct validity ap- 
pear in certain of the cross-validation sam- 
ples, but our main Purpose is to present the 
available concurrent validity data for the 
reader to interpret and utilize however he 
sees fit within his own pragmatic and theo- 
retical framework. To avoid the tedious repe- 
tition of quotation marks, the words “miss” 


and “hit” will hereafter be employed without 
them. 
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code (commonly found both in psychotic de- 


“Pressions and in neurotic depressive reactions 


or anxiety states) would be very close to the 
weights optimal for distinguishing between 
Psychotic and neurotic patients exhibiting less 
self-concern and subjective discomfort and 
who handle their anxiety via somatizing or 
Projecting mechanisms and present profiles 
peaked at 3, 4, or 6. So a mixed summative 
and successive-hurdle model was used through- 
out in preference to a pure summative model, 
Continuous variables and difference scores be- 
ing distributed separately for analysis within 
relatively more homogeneous groups initially 
set apart on the basis of crude but configural 
criteria such as the Hathaway code. We fur- 
ther assume that because of defects in cur- 
rent nosological concepts (which must, of 
course, be distinguished methodologically from 
unreliability in the clinical application of in- 
trinsically powerful diagnostic concepts), or 
failure of the MMPI scores to provide dis- 
criminating information, profile configurations 
occur which are genuinely without differential 
significance. Therefore three classification out- 
comes were allowed: A curve would, by the 


. application of the rules, be classified as either 


“Psychotic,” “Neurotic,” or “Indeterminate” 
in form. A third working assumption was that 
the formal diagnosis, while it is a readily 
available crude criterion by means of which 
patterns are initially identified, is far from 
infallible, so that we are justified in assum- 
ing that sometimes the test is right and the 
official diagnosis is wrong. Consequently, one 
must avoid the temptation to formulate too 
many ad hoc rules in the effort to capture 
every test miss. With this in mind the non- 
psychometric case data were deliberately al- 
lowed to influence our MMPI rule-making at 


' all stages prior to final cross-validation. At- 


tempts to capture a miss by devising (or re- 
vising) a rule were not pursued in those in- 
stances in which critical reading of the case 
material justified serious doubt as to whether 
the criterion in this case was performing bet- 
ter than the MMPI. The skilled clinical eye 
was employed as a searcher and idea-origi- 
nator; statistical runs were employed both as 
Searchers and as checks upon the deliverances 
Of the clinical eye. A fourth working assump- 


tion was that only the most obvious test- 
related theoretical or dynamic assumptions 
should play any important part in accepting 
a rule. We take it for granted that present 
knowledge, either of personality dynamics or 
test-taking behavior, is rarely sufficient to 
justify the neglect of a statistical finding; so 
that if a certain pattern “works” it is the task 
of theory, now or in the future, to explain 
why it does. 

On the basis of a preliminary run on the 
MMPI profiles of 41 male neurotics and 39 
male psychotics who had been hospitalized in 
the inpatient service of the University of Min- 
nesota Hospitals prior to 1946, we chose two 
sets of difference scores. Pt and Hs were 
paired, both having originally been derived 
as “neurotic” scales but it being part of 
MMPI lore and general psychiatric experi- 
ence that the obsessional veers more toward 
the psychotic side than does the patient with 
a preference for somatic symptoms. Sc and D 
were also paired as a result of this prelimi- 
nary study. Both of these scales were origi- 
nally derived on psychotic patients but it is 
known that D, a “mood” scale, is often ele- 
vated in neuroses and is markedly so in many 
cases diagnosed anxiety-neurosis. Severe anx- 
iety states run a very high D, yet some of 
these cases are (as shown by subsequent 
course or atypical symptoms) actually schizo- 
Phrenic. It seemed plausible that one might 
counter-balance these factors by pairing the 
D scale with the more psychotic Sc scale, A 
preliminary cross-validation on 99 additional 
cases (55 neurotic and 54 Psychotic) resulted 
in approximately 67% hits, so these two dif- 
ference scores were retained. 

We next plotted (Sc — D) against (Pt — 
Hs) onan expanded sample of 104 Psychotics 
and 128 heurotics; 
were set up on this plot by drawing li 
45° so located hat. the ae y e podes 


mediate bands, two and four, show. 
but with 


À 5 The 
equations of these lines, with the 


bracketing 
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Øf the four variables rearranged for easier 
computation, involve the quantity denoted 
Beta in the final rules, and the five regions on 
this graph correspond to the five bands co- 
ordinated to these Beta values. Beta = (Pt 
+ Sc) — (Hs + D). 
After a study of the cases missed an in- 
spection of several possible distributions of 
difference scores selected for nonoverlapping 
with the first two, two other pairs were 
chosen. (Pa — Hy) showed a good separation 
among a subset of cases not discriminable by 
the band method, and had several armchair 
features to recommend it. Neither of the com- 
ponent scales contains a suppressor correc- 
tion (McKinley, Hathaway, & Meehl, 1950; 
Meehl & Hathaway, 1946); both scales have 
numerous “subtle” items involving denial of 
pathology and the self-image of normality, 
rationality, and psychic health (Gough, 1954; 
Meehl, 1950a; Meehl & Hathaway, 1946; 
Seeman, 1952, 1953; Wiener, 1948, 1951); 
both are extrapunitive or impunitive rather 
than intropunitive, tending to manipulate the 
environment or to develop symptoms and 
traits involving less subjective discomfort and 
anxiety. Both are associated with an inability 
to “think psychologically,” marked lack of 
insight, and resentment. of Psychiatric ex- 
ploration. The (Pa — Hy) difference score 
might be thought of as located on a psychotic- 
neurotic axis different from that represented 
by (Pt — Hs). 
The second new difference score was (Pd 
— Hs). Here an indicator of the acting out, 
extrapunitive, “tough” component (as con- 
trasted with the more decompensated, suffer- 


ing and self-concerned individual given to 
elevations on 
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erations were, of course, in each case checked 
against empirical distributions. | 

A criterion sample of 262 male patients, 4 
evenly divided between neurotics and psy- 
chotics, constituted the main basis for rule 
development. One hundred eighty-seven of 
these profiles came from the inpatient files of 
the University of Minnesota Psychopathic 
Unit; 49 cases were drawn from records avail- 
able at the VA Hospital at Fort Snelling, 
Minnesota; and 26 cases were drawn from 
the files of a Canadian mental hospital. Rec- 
ords with ? >60, L> 70, or F > 80 were 
excluded. An effort was made to exclude cases 
in which the MMPI was for some reason 4 
(such as catatonic untestability) not taken 
until the patient had recovered from an acute 
episode, but it is not always possible to de- 
termine this from the records. 

Preliminary to the statistical study of these 
profiles, one of us (PEM) went through the 
(randomized) set of profiles inspectionally + 
and classified each as neurotic or psychotic. 
Several months later this Procedure was re- 
peated. The results of this sorting will be re- 
ferred to as the “impressionistic profile clas- 
sification,” and it was part of the evidence «. 
used in deciding whether an attempt should 


be made to write a special rule, or modify a 
tentative rule, in the effort to eliminate a 
test miss. 


Cases falling within each band (Beta re- 
gion) were distributed as to Delta values. 
While these distributions usually indicated 
respectable statistical validity, from the stand- * 
point of clinical practice they showed misses 
felt to be avoidable on the basis of inspec- 
tion and previous clinical experience. Further- 
more, there appeared to be obvious “holes 
and deviant “clumps,” Suggesting the pres- 
ence of strong minority profile types which 
might be readily identifiable by features not 
reflected in Delta. Several searching aids were 
used, such as reshuffling of the profiles so that 
they would be laid out on the floor in a dif- 
ferent random order, or arranging them i” 
different systematic orders from right to left 
(e.g., first by absolute elevation of the pe 
score, then by the first digit of the code, the? 
in order of the Delta value, then in clusters 
based upon the formal diagnosis). 

Profiles falling wholly within the “normal 


Y 
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range” (i.e., having no T> 70) behaved dif- 
ferently enough so that a special rule employ- 
ing the Welsh internalization ratio (Welsh, 
1952) was applied to them. A special rule 
was also invented for mild “fake good” (L 
> 60) profiles. 

These searching procedures resulted in a 
set of 13 rules, arranged in sequence so that 
in applying the system one reads through the 
list of rules in order until he comes to one 
which covers the profile under consideration. 
Application of the rule to this profile then re- 
sults in one of four outcomes: Profile “psy- 
chotic” (P), profile “neurotic” (N), profile 
indeterminate (I), or “proceed to next rele- 
vant rule.” In this manner all profiles are 
finally classifiable under one of the first three 
rubrics P, N, or I. 

This provisional set of rules was then tried 
out on a preliminary cross-validation sample. 
Because of the great temptation to capitalize 
upon sampling errors in such complicated and 
variable searching procedures as those em- 
ployed, one expects a marked degree of 
shrinkage upon cross-validation, and there- 
fore this first cross-validation sample was con- 
ceived of only partly as a check on whether 
we were getting anything but chiefly as a 
means of modifying the rules by the elimi- 
nation of ad hoc subrules or adjustment of 
cutting scores. The preliminary cross-valida- 
tion sample consisted of 140 file cases of 
white males aged 18-65 whose MMPI profiles 
were presumed valid by the above-mentioned 
?-L-F criteria and who were not tested in 
remission so far as could be judged from the 
staff notes and other data in the chart. None 
of these patients had received shock therapy 
prior to testing, and most were tested within 
two to three days of admission. Ninety-two 
of the cases originated from the psychiatric 
unit of the University of Minnesota Hospitals 
and 48 from the Minneapolis VA Hospital. 
All were inpatients. The distribution of diag- 
noses from the combined criterion and pre- 
liminary cross-validation samples (NW = 402) 
is shown in Table 1. 

Prior to application of the rules, one of us 
(PEM) read in random order the case sum- 
maries of the 92 cases from the university 
hospital files, deleting material on the psycho- 
logical test data, the sections presenting dif- 
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TABLE 1 


DISTRIBUTION OF DIAGNOSES ON 402 
DERIVATION Cases 
(Original criterion plus preliminary 
cross-validation) 


Neurotics 


(N = 201) 
N % 
Hypochondriasis 43 21.4 
PN mixed 37 18.4 
Anxiety neurosis 38 18.9 
Reactive depression 31 15.4 
Hysteria 29 14.4 
Obsessive-compulsive 16 8.0 
Other . 5 2.5 
Neurasthenia 2 1.0 
201 100.0 
Psychotics 
(N = 201) 
N % 
Schizophrenia 107 53.3 
Paranoid (48) 23.9 
Simple (23) 11.4 
Hebephrenic (8) 4.0 
Catatonic (9) 4.5 
Mixed (4) 2.0 
Other (15) 75 
Paranoid State 22 10.9 
Paranoia 2 1.0 
Manic-depressive manic 19 9.4 
Manic-depressive depressed 28 13.9 
Involutional psychosis 23 11.5 
201 100.0 


ferential diagnostic considerations, and the 
final official diagnosis and prognosis. The 
reader made his own diagnosis on the basis 
of this reading. The purpose of this pro- 
cedure was to provide information on cri- 
terion trustworthiness in cases where one of 
the preliminary rules yielded a test miss and 
a decision had to be made as to whether this 
particular miss was real or apparent. If the 
case reader agreed unreservedly with the psy- 
chiatric staff and the latter’s diagnostic sum- 
mary did not raise any doubts as to the diag- 
nosis, a more persistent effort was made to 
modify the preliminary rule so as to avoid 
the miss than if he disagreed with the staff. 
We were particularly concerned to detect 
those cases in which the case reader diag- 
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nosed psychosis in opposition to a psychoneu- 
rotic label attached by the staff, but in which 
the staff diagnostic summary also included 
mention of the likelihood of such subclinical 
diagnoses as, e.g., “incipient schizophrenia” 
or “strong cyclothymic element.” 

In the preliminary cross-validation sample 
the MMPI profiles were again impressionisti- 
cally sorted as “psychotic” or “neurotic” on 
two occasions (separated by several months’ 
time and randomizing their order), and the 
consistency or disagreement of the two sort- 
ings was utilized in study of the individual 
test misses. 

As expected, the shrinkage on cross-valida- 
tion was pronounced. In the criterion sample, 
13% of cases were classified indeterminate, 
and of the remaining 87% of cases for whom 
a decision was provided by the preliminary 
rules, the hit-rate was 89% (for both the 
neurotic and psychotic subpopulations). On 
preliminary cross-validation, the proportion 
of indeterminate profiles does not change 
significantly (10%), but the hit-rate falls 
dramatically. The hit-rate for neurotics is 
only 47%, for psychotics 63%. The overall 
hit-rate is 55%, yielding a hit-rate of only 
61% among the 126 cases for which a de- 
terminate classification was made. We there- 
fore see a decline of almost 30% in the hit- 
rate among determinate cases in moving to 
this new sample. The cross-valid hit-rate, 
while statistically significant, is pragmatically 
unimpressive. 

Modification of rules, addition of three new 
rules, and slight alterations in rule sequence 
were made on the basis of these results. Again 
the procedures were complex and variable, 
and concentrated attention upon those rules 
in which the hit-rate was poorest. Seven of 
the original 13 rules were in this manner sub- 
jected to some kind of modification, ranging 
from slight adjustment of a cutting score to 
fairly radical revision. Constant back-refer- 
ence was made during this phase to the effect 
of proposed modifications upon the hit-rate 
in the original 262 criterion cases. 

One interesting result of these procedures 
was the progressive displacement of the origi- 
nally emphasized band rules to the end of the 
rule sequence. The patterns of six scales rep- 
resented by Beta and Delta only begin to 
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function as powerful discriminators after the 
curves have initially been divided into ma- 
jor “types.” One is reminded here of the point 
made by Block (1957) in another context, 
that the kind of R covariation which one is 
likely to discover by traditional individual- 
differences methods may be quite misleading 
unless the population has first been divided 
into subpopulations by Q covariation analy- 
sis, the R covariation patterns being some- 
times very different within the several sub- 
population “types.” 

The modifications made resulted in a drop 
in the criterion group from 87% to 77% de- 
terminate classifications with a maintenance 
of 89% hits among cases classified; in the 
cross-validation group there was a decline 
from 90% to 74% determinate classifications, 
but an associated increase in the hit-rate from 
61% to 83% among those classified. For the 
total augmented criterion group (N = 402), 
we have rules which leave 24% of the cases 
indeterminate but give us an 87% confidence 
when a classification is made. Thus the move- 
ment in modification was in the direction of 
increased recognition of nondifferentiating pat- 
terns, liquidating ad hoc rules which served 
to classify special criterion cases but which 
collapsed on cross-validation. The final set of 
rules is presented in Appendix A.? 


Cross-VALIDATION 


For final cross-validation of the modified 
tules we attempted to secure a sample which 
would be both large and sufficiently diverse 
(as to clinical populations and staff diagnostic 
practices) to provide some information about 
validity generalization. Through the kindness 
of several clinical psychologists in providing 
MMPI data we were able to obtain eight dif- 
ferent samples of cross-validation cases from 

2A five-page statement of the rules has been de- 


posited with the American Documentation Institute. 
Order Document No. 6330 from ADI Auxiliary Pub- 


lications Project, Photoduplication Service, Library . 


of Congress; Washington 25; DLC, remitting in ad- 
vance $1.25 for microfilm or $1.25 for Photocopies. 
Make checks payable to Chief, Photoduplication 
Service, Library of Congress, Mimeographed copies 
of the rules may be obtained from Paul E. Meehl, 
Box 390 Medical School, University of Minnesota, 
Minneapolis 14, Minnesota, i 
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various clinical installations over the country.® 
The sample sizes varied from 42 to 273, with 
a median of 97 cases per sample and a total 
N = 988. A more detailed characterization of 
these samples is contained in Appendix B.* 
The samples vary as to nature of the popula- 
tion (VA and non-VA, outpatient and inpa- 
tient). They are geographically dispersed, and 
so far as known to us there is considerable 
variation as to the predominant local theo- 
retical and diagnostic orientation. All, kow- 
ever, are males. They vary widely in the diag- 
nostic “purity” of the cases. In three samples 
(A, B, E) the criterion is completely uncon- 
taminated, the MMPI having been unavail- 
able to the diagnosing clinicians. A fourth 
(K) is effectively uncontaminated since, al- 
though MMPI results were available, the pa- 
tient population sampled consisted of psy- 
chotics in a state hospital where the daily 
census of neurotic diagnoses runs only 1-2%. 
The remaining four samples (C, D, F, G) 
suffer from unknown but nonnegligible con- 
tamination. Hit-rates do not differ as between 
the contaminated and uncontaminated cases 
(x? = 1.13, 1 df, p> .20). Tables 2 and 3 
summarize the results of applying the rules 
to these eight samples. 

The H:M:1 distribution does not vary sig- 
nificantly over the seven neurotic categories 
(omitting the single phobic reaction, x? = 
20.84, 14 df, .10 < p< .20). Pooling all 
schizophrenics, H:M:I does not vary over 
psychotic diagnoses (x? = 8.08, p > 80). Al- 
though inspection suggests that the affective 


3 We wish to express our indebtedness to the fol- 
lowing clinical psychologists who were of great as- 
sistance to us through their kind efforts in making 
their data available to us for reanalysis or for tak- 
ing the pains to track down diagnoses, original rec- 
ords, and other information on the criterion and 
cross-validation samples: H. R. Albrecht, VA Hos- 
pital, Chillicothe; George Guthrie, Pennsylvania 
State College; Howard F. Hunt, University of Chi- 
cago; Thomas Kiresuk, Minneapolis General Hos- 
pital; Timothy F. Leary, Kaiser Foundation, Oak- 
land, California; James C. Lingoes, Langley Porter 
Clinic; Donald R. Peterson, University of Illinois; 
Albert Rosen, University of Maryland; Harold 
Rubin, VA Mental Hygiene Clinic, Philadelphia ; 
Ranald M. Wolfe, Chillicothe VA Hospital. 

4 A four-page summary of the sources and charac- 
teristics of the eight cross-validation samples has 
been deposited with the American Documentation 
Institute (See Fn. 2). 
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disorders as a group are harder to identify 
than the schizophrenics, this trend does not 
quite achieve statistical significance (x? = 
5.50, .05 < p < .10). Further checking on 
larger numbers of involutionals and manic- 
depressives is indicated especially since among 
the decidable cases, hit-rates (H/H + M) do 
differ as between schizophrenics and affectives 
(.75 versus .54, xX? = 4.97, 1 df, .02 <p< 
05). That is, when we ignore indeterminate 
curves, the affective cases are more often mis- 
identified as neurotic. Qualitative study of the 
missed affective cases Suggests that criterion 
error or patient change accounts for some of 
these misses. 

The variation of H:M:I over schizophrenic 
subcategories fails to reach statistical signifi- 
cance (x? = 14.67, 10 df, 10 < p< .20). It 
is interesting to note that the best results, in 
terms of high total hits, low indeterminate 
rate, and high confidence among classifiables, 
occur in the conversion and somatization 
groups. As we move into neurotic categories 
where dysphoria, failure of the defense, intel- 
lectualizing and other obsessional mechanisms, 
conscious guilt, and “bad” self-concepts are 
more in evidence, the profiles increasingly re- 
semble those of the psychotic group. 

The H:M:1 distribution does not differ sig- 
nificantly as between all psychotics and all 
neurotics (x? = 2.39, 2 df, p> 30). Con- 
sidering only the 691 determinate-curve cases, 
neurotics and psychotics do not differ in hit- 
rate (x? = 2.35, 1 df, 10 < p < .20). 

The variation in H:M:I over the eight 
samples is significant (x? = 43.14, 14 dj, 
< .001). One sample, D (Rubin’s Chillicothe 
data), fails to exceed chance differentiation 
among classified cases. All samples do show 
H > M, the ratios varying from 1.6 to 156; 
The hit-rates as between neurotic and psy- 
chotic cases do not differ in seven of the sam- 
ples, but the 19% higher identification of neu- 
rotics in Sample G is at the .001 level. 
by these results wil deen iseouraged 

y i epend both upon his 
clinical expectations and his methodological 


centrate upon the fact 
t of correct classifications 
in the entire sample is only 53.3%, we may 
not be much impressed. If, however, we ac- 
cept the fact that some curves are ambiguous 


kaa = =—O — a 
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and ought not to be given any appreciable 
weight in making a decision (an attitude 
toward laboratory tests to which our medi- 
cal colleagues have become thoroughly ac- 
customed), and therefore attend chiefly to 
the subset of cases for which the rules pro- 


vide a decision, we see that the ratio of hits 
to misses among decided curves is 3.2 to 1. 
When a curve is classified on the basis of the 
rules, the classification has attached to it a 
confidence of .76. Assuming the MMPI to be 
in use in a clinic for whatever (multiple) pur- 


TABLE 2 
Comparative RESULTS BY DIAGNOSIS IN THE E1cut Cross-VALIDATION SAMPLES 
(N = 988) 
Neurotics 
ae 
Diagonsis N % H% M% 1% H+M 
Psychoneurosis or psycho- 185 35.0 46.5 17.8 35.7 72 
somatic, unspecified 
Anxiety 135 25.5 57.0 17.8 25.2 76 
Conversion or somatiza- 99 18.7 69.7 8.1 22.2 90 
tion 
Depression 65 12.3 52.3 13.8 33.8 .79 
Mixed 23 4.3 47.8 13.0 39.1 79 
Other (e.g., passive- 9 1.7 77.8 0.0 22.2 1.00 
aggressive, neurotic 
character) 
Obsessive-compulsive 8 1.5 37.5 12.5 50.0 45) 
Hypochondriasis 4 8 50.0 25.0 25.0 .67 
Phobic Reaction 1 2 100.0 0.0 0.0 1,00 
529 100.0 
Psychotics 
Schizophrenia paranoid 174 37.9 59.2 14.9 25.9 80 
Schizophrenia mixed or 79 17.2 41.8 22.8 35.4 65 
unspecified 
Schizophrenia latent or in 59 12.9 54.2 18.6 27.1 74 
remission 
Schizophrenia hebephrenic 6 13 33.3 0.0 66.7 1.00 
Schizophrenia simple 6 13 50.0 33.3 16.7 -60 
Schizophrenia catatonic 3 3 33.3 0.0 66.7 1.00 
Psychosis unspecified 92 20.1 47.8 17.4 34.8 73 
Manic-depressive depressed 12 2.6 33.3 33.3 33:3 50 
Manic-depressive manic 10 22 50.0 30.0 20.0 62 
Manic-depressive mixed 2 A 50.0 50.0 0.0 50 
Tanka psychosis 8 1.7 37.5 37.5 25.0 50 
Paranoid condition 6 13 66.7 16.7 16.7 «80 
Psychosis with psycho- 2 4 100.0 0.0 0.0 1.00 


pathic personality 
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TABLE 3 


Comparative RESULTS IN THE EIGHT Cross-VALIDATION SAMPLES 


Criterion H 
Sample N % psychotic H% M% 1% H+M pe 
As 92 64 55 16 28 17 <.001 
Bs 77 è 52 45 29 26 .61 <.05 , 
(8: 103 47 49 16 35 .15 <.001 
D 42 52 40 21 38 65 ns 
Es 181 50 45 18 36 71 <.001 
F 166 43 47 20 33 .70 <.001 
G 273 27 63 12 25 84 <.001 
Ka 54 100 78 5 17 93 < Woes 
Total 988 54 53 17 30 .76 <.001 


a Essentially uncontaminated samples. | 
» Chi square or exact test used as appropriate. 


poses it is deemed appropriate, the minute or 
less of clerical time required to apply the 
Meehl-Dahlstrom rules is not an unjustifiable 
expenditure of effort to obtain this much ad- 
ditional information on a sizable subset of 
cases. That the concurrent validity compares 
favorably with the pooled weighted judgments 
of 29 Minnesota clinicians (and is better than 
any of them taken individually) would seem 
to justify substitution of the rules for impres- 
sionistic profile assessment with respect to the 
psychoticism variable. These patterns also dis- 
criminate better than any of five other “actu- 
arial” methods, including the linear discrimi- 
nant function (Meehl, 1959a). 

One of the present samples (Palo Alto, 
Sample B) does provide sufficient uncontami- 
nated information to make feasible certain 
within-diagnosis comparisons which shed a 
little light upon construct validity. Through 
the kind cooperation of Howard Hunt, we 
were able to obtain access to a one-page psy- 
chological summary sheet for each of the pa- 
tients in that sample. This summary, in addi- 
tion to the usual face-sheet data, also included 
varying amounts of information regarding 
diagnostic considerations (e.g., impression of 
the admission board, number of previous ad- 
missions, type of service discharge, compensa- 
tion, hospital status at the time of testing, 
Rorschach or Wechsler-Bellevue findings). 
One of us (PEM) read over each of these 
summary sheets (uncontaminated by knowl- 
edge of the associated MMPI profile) and 
made a subjective judgment in six steps as 


to the strength or clarity of evidence for and 
against psychotic tendencies. To get a rating 
of ++ for “very clear psychosis,” the data 
had to include the admission board impres- 
sion and a Rorschach diagnosis, neither of 
which considered any alternative. Semiobjec- 
tive rules were set up for lesser degrees of 
clarity or amount of information running 
through +, +, = (“neurotic but debatable”), 
—, and —— (“very clear neurosis, consider- 
able data given”). Any case marked “in re- 
mission” was automatically considered ques- 
tionable. Cases with disharmony between ad- 
mission board impression and final diagnosis 
were automatically considered questionable. 
For the 44 cases judged as either “very clear” 
or “clear,” the hit-rate was 68% (67% and 
69% for the two levels of clarity, respec- 
tively); whereas for the 13 cases considered 
“doubtful,” only four were correctly classi- 
fied, corresponding to a hit-rate of 31%. (x? 
= 5.84, p < .02). That is, when the admis- 
sion board was impressed with some behavior 
not reflected in the formal diagnosis, or the 
psychiatric staff was moved to record a sec- 
ondary diagnosis, or the psychologist giving 
the Rorschach found evidence running coun- 
ter to the diagnosis administratively assigned 
such a patient was more likely to produce a 
MMPI profile out of harmony with the offi- 
cial diagnosis than was true for patients in 
whom no such inconsistencies were in evi- 
dence. 

The relation between construct and concur- 
rent validity in situations of this type is too 
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complex and involves too much methodologi- 
cal controversy to be developed here (see 
Meehl, 1959b). From the standpoint of con- 
current validity, the Palo Alto findings just 
described can be interpreted either favorably 
or unfavorably, depending upon one’s prag- 
matic emphasis. From a traditional viewpoint, 
the subjective rating based on the face-sheet 
data may be viewed as a “criterion” (al- 
though highly unreliable); the diagnostic re- 
versals given by the pattern rules in this in- 
termediate range must then be conceived as 
validly reflecting the “mixed” character of the 
patient’s behavior, certain aspects of which 
unduly influence the psychiatric staff respon- 
sible for the formal diagnosis while other as- 
pects are conveyed by the face-sheet informa- 
tion and therefore reflected in the “clarity” 
ratings of the case reader. From another point 
of view, one can argue clinically that the prac- 
tical utility of a psychometric instrument 
varies inversely as the clarity with which the 
patient’s diagnosis can be pegged without the 
instrument, so that the finding of high agree- 
ment for clear-cut cases and poor agreement 
for borderline cases is precisely the reverse of 
what might be desired in clinical practice. The 
rationale of this second position involves an 
implicit commitment to the idea of construct 
validity and brings us to our own preferred 
mode of thinking about such internal rela- 
tionships. The construct validity of the pro- 
file is its power to reveal the internal psycho- 
logical structure and state of the patient, 
which it does fallibly and probabilistically, as 
do the other indicators available (including 
the social impact of the patient upon the 
diagnosing psychiatrist). To say that one 
wants the test precisely for cases where the 
diagnosis is otherwise difficult, is to say that 
“mixed” behavior output leaves one in doubt 
as to the inner psychological condition, and 
it is desired to use the test as an aid in as- 
sessing this inferred internal state of affairs. 
This way of conceptualizing the situation 
makes it unfeasible to refine the validation 
procedure further unless the quantitative and 
qualitative character of the additional data 
(other than formal diagnosis) is superior to 
that routinely available. 
Since the kind and degree of construct va- 
lidity remain to be established and the pres- 
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ent investigation deals only with the most 
obvious among the indefinite family of con- 
current and predictive validities which the 
patterns may possess, a vexing problem of 
terminology obtrudes itself. How shall the 
decidable profiles be labeled? In the present 
state of the evidence, we would favor a rather 
noncommittal language. The adjectives “psy- 
chotic” and “neurotic” might be carefully de- 
fined as referring merely to curve types, warn- 
ing against an automatic classification of the 
patient, but one realizes that such hygienic 
semantic provisions do not always achieve 
their aim in practical usage. Furthermore, the 
term “psychotic” has itself a rich network of 
associations, valuable for theoretical purposes 
and in the design of subsequent research but 
which in daily clinical practice should not be 
linked too closely to the curve form itself. 
We propose as a terminological convention 
that the neurotic curve ¢ype as identified by 
these rules be called an ‘‘N-curve,” and the 
psychotic curve ¢ype be referred to as a “P- 
curve.” The construct, concurrent, and pre- 
dictive validities associated with these types 
and the subtypes (rules) by which the broader 
groups are identified then remain to be de- 
termined. This approach is consistent, at the 
profile level, with our preference for the use 
of digits in designating individual MMPI 
keys. An alternative and theoretically neutral 
locution would be “first zone” (=N) and 
“third zone” (=P), reserving “second zone” 
for the conduct disorder types, adopting a 
“zone” language as is done, e.g., in the col- 
loidal gold test of clinical neurology, 

We hope that a “bootstraps effect” has 
been achieved by the identification of these 
configurations. From a construct validity view- 
point, the phenomenon of a “test miss” can 
be very instructive and in the long run may 
be more productive of understanding and im- 
provement in assessment procedures than an 
obvious “hit.” Suppose that a patient pre- 
sents, on formal mental status examination 
and other behavior data gathered from in- 
formants and ward personnel, a nonpsychotic 
clinical picture. Yet, as happens with consid- 
erable frequency, the MMPI pattern is a 
schizophrenic P-type. Assume exclusion of ob- 
viously invalid or essentially “chance” re- 
sponse patterns readily detectable by present 
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methods. What are we to think about this pa- 
tient? Setting aside questions of evaluating 
the test, one can contemplate this situation in 
terms of the patient’s psychology. The pa- 
tient, when presented in a standard manner 
with 550 verbal stimuli, has responded to 
them systematically, in a way which is “sta- 
tistically congruent” with the response pat- 
terns of patients who were oé clinically clear 
of psychotic behavior but rather identified by 
nontest criteria as diagnosible schizophrenics. 
This systematic pattern of responding cannot 
be dismissed. In any adequate account of the 
psychometric situation, it will have to be fitted 
into some kind of consistent causal analysis. 
When out-and-out faking or lying has been 
excluded, the verbal responses “true” and 
“false” to this standardized set of verbal 
stimuli must be psychologically construed as 
phenomenological reports of varying ade- 
quacy. It is of course not assumed that the 
reports, insofar as they have a content refer- 
ring to behavioral dispositions or the facts of 
the world or of other people’s conduct, are 
“correct” (Meehl, 1945). What is, however, 
assumed—once we have excluded clear fake 
records—is that the patient is reporting, 
within the limitations of a prespecified domain 
and fixed-response context, the character of 
his phenomenology at the time of testing. We 
then have a paradox. Here is a patient whose 
MMPI-phenomenology is “statistically char- 
acteristic” of clinically obvious schizophrenia 
but whose behavior in a diagnostic interview, 
on the ward, and in recent extramural con- 
texts is not clinically schizophrenic. Such a 
patient must be psychologically different from 
both the clinically schizophrenic patient with 
a third zone curve and the neurotic patient 
with a first zone curve. His verbal response 
to these items is not something detached from 
the world and utterly uninterpretable without 
nontest data. After all, the test has elicited a 
sizeable mass of verbal behavior, and if previ- 
ous evidence enables us to say “what is a 
schizophrenic way of responding to these 
items,” the fact that a patient has responded 
thus is a weighty piece of evidence regarding 
him and must be “explained” rather than, as 
is so often done in staff conferences, “ex- 
plained away.” 

There are dangers involved in this kind of 


thinking for those who would irresponsibly 
defend an instrument to which they are 
strongly committed, effectively cutting them- 
selves off from all possibility of refutation; 
but this danger should not mislead us into 
undervaluing the evidential weight of a psy- 
chometric pattern. An adequate understand- 
ing of the class of patients defined as test- 
misses from the concurrent validity stand- 
point must include a psychological account of 
the presence of their schizophrenic phenome- 
nology, when their social and other behavior 
is apparently that of the neurotic group. The 
development of such an account would seem 
to hinge upon the accumulation of further 
data on as many different kinds of concur- 
rent, predictive, and content validity as the 
ingenuity of clinical investigators can devise. 
Examples which come readily to mind would 
be the following: In a group of patients all 
officially diagnosed as psychoneurotic, what 
is the relationship between the incidence of 
schizophrenic Rorschach indicators and the 
presence of P-curves of schizoid type? If we 
assume that some so-called “neurotic depres- 
sive reactions’ are actually phases of a 
“damped” endogenous manic-depressive cy- 
cle, are there nontest indicators (e.g., a his- 
tory of recurring depressions in the patient, or 
the finding of depressions and suicides in his 
family history) associated with “neurotic” 
depressions manifesting P-profiles of the de- 
pressive type? What is the relationship be- 
tween curve type and such physiological indi- 
cators as the Funkenstein reaction (Funken- 
stein, Greenblatt, & Solomon, 1952) or the 
sedation threshold (Shagass & Jones, 1958)? 
If we classify patients on the basis of curve 
type, and then for each curve group, regard- 
less of the clinical diagnosis officially given at 
the time of initial study, plot cumulative sub- 
sequent hospitalization over a long time span 
how do the slope and asymptote constants of 
these curves compare? Suppose that a group 
of patients seen in an outpatient setting are 
Q sorted and Q correlations with an idealized 
Q sort description of “pseudo-neurotic schizo- 
phrenia” are computed. Considering a sample 
of patients all diagnosed as neurotic, how do 
the distributions of these Q correlations com- 
pare as between neurotic patients having a 
P- versus those having an N-profile? It is by 
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accumulation of investigations of this type, 
preferably in the context of at least a sketch 
of theory as to the nature of the psychotic 
disorders, that an adequate picture of the 
construct validity of profile pattern signs will 
have slowly to be achieved. 


SUMMARY 


By a combination of statistical searching 
and codification of clinical experience, a set 
of objective profile signs has been evolved as 
an aid in the discrimination between “neu- 
rotic” and “psychotic” MMPI profile pat- 
terns. These signs were then applied to the 
profiles of eight cross-validation samples (V 
= 988) from a diversity of clinical installa- 
tions involving varying degrees of contami- 
nation from essentially none for four samples 
to unspecifiably high. Approximately 30% of 
the cases present profiles which are classified 
as indeterminate with regard to the psychotic- 
neurotic distinction. Among classifiable cases, 
the concurrent validity hit-rate varied from 
a low of 61% to a high of 93% in the eight 
samples, with a median of 73% hits, and a 
hit:miss ratio of 3.2:1 for the total pooled 
sample. It is suggested that research with 
these patterns be directed to the elaboration 
of their construct validity and to the psycho- 
logical understanding of the phenomenology 
and dynamics of cases which are “test-misses” 
from the concurrent validity standpoint. It is 
also suggested that curves characteristic of 
those found in diagnosed psychoneurotics be 
designated simply as N-type (or “first zone”) 
profiles, in contradistinction to P-type (“third 
zone”) profiles manifesting the configuration 
typical of psychotic patients. The rules, while 
complex and of a somewhat forbidding as- 
pect, can, after a little practice, be applied by 
a clerk in less than a minute’s time. 
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Attempts to determine the effectiveness of 
psychotherapy have used many different cri- 
teria to assess the extent of change and there 
has been considerable discussion of what vari- 
ables constitute the most adequate measures. 
Some researchers have solved the problem of 
having to decide between a number of avail- 
able criteria, none of which is completely 
satisfactory, by combining several measures 
into a composite criterion. Assumptions im- 
plicit in this approach are that psychotherapy 
brings about a general and unitary person- 
ality change and that all criteria are imper- 
fect measures of this change. Other research- 
ers have insisted that the effects of psycho- 
therapy may be very specific and that the 
criterion for evaluation of therapeutic out- 
come should bear a direct theoretical relation- 
ship to the underlying theory of therapy. It 
is important to know the degree of generality 
of therapeutic outcomes, both in making de- 
cisions concerning appropriate criteria for re- 
search, and in determining whether specific 
therapies may be needed for specific prob- 
lems or whether general psychotherapeutic 
techniques may be applicable to a wide va- 
riety of problems. 

Two previous factorial studies of therapy 
change (Cartwright & Roth, 1957; Gibson, 
Snyder, & Ray, 1955) suggest that person- 
ality change is multidimensional and that the 
factors tend to be organized around the vari- 
ous instruments used to measure change. It 
is, however, erroneous to interpret this as 
indicating that the effects of psychotherapy 
are themselves multidimensional. The therapy 
is certainly not the only independent variable 
operating to produce change in a group of 
subjects (Ss) undergoing therapy. Thus, to 
study the dimensionality of therapy change, 


1 This study was supported by a grant (XR 1916) 
from the Purdue Research Foundation. 


it is necessary not only to determine the fac- 
tor structure of the change measures, but also 
to establish which, if any, of the change fac- 
tors are related to the amount of therapy re- 
ceived. 

The ideal design for dealing with this prob- 
lem would be to obtain a variety of measures 
of personality change from a large number of 
Ss undergoing psychotherapy and to obtain 
the same change measures from a matched 
group of control Ss over a comparable wait- 
ing period. A comparison of the factor struc- 
ture of the change measures in the therapy 
group with that in the control group would 
indicate whether or not therapy has a general 
or group effect which would tend to increase 
the correlations among groups of change 
measures in the therapy group. Then com- 
parisons of the amount of change on the vari- 
ous factors in therapy and control groups 
would indicate which change factors are af- 
fected by therapy and which are not. Un- 
fortunately, this paper is not a report of the 
results of this ideal study, but rather is an at- 
tempt to answer some of the same questions 
with somewhat less adequate data. 

In this study, a number of therapy change 
scores are factored in order to determine the 
dimensions along which change occurs in the 
measures used. Changes along these dimen- 
sions in the therapy group are compared with 
changes in a comparable control group in or- 
der to determine which of the changes can be 
attributed to the therapy. Additional evidence 
on this point is obtained from the relation- 
ship of the changes in the therapy group to 
the amount and intensity of the therapy. 


METHOD 
Subjects 


During the 1957-58 and 1958-59 academic years, 
75 undergraduate students were seen at the Purdue 
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Psychological Clinic for at least five psychothera- 
peutic interviews and completed the California Psy- 
chological Inventory (CPI) and a sentence comple- 
tion test before the first and after the last interview. 
At the end of therapy they also completed a brief 
rating scale, a portion of which is shown in Table 1, 
and the therapist filled out a somewhat longer rating 
scale which included four items corresponding to 
those shown in Table 1. There were 57 male and 18 
female clients. Approximately half were self-referred 
to the clinic while the other half were referred from 
the dean’s office, the student health service, or other 
campus agencies. Difficulties presented in the intake 
interview ranged from alcoholism and homosexuality 
to lack of motivation to study. The most common 
complaints were difficulties in interpersonal relation- 
ships, difficulties in school work, and family or au- 
tonomy problems. None of the cases included in this 
study were so sick that they had to leave school 
during the course of therapy. The mean number of 
therapy interviews was 14.7 and the standard devia- 
tion was 9.2. 

These Ss were seen for therapy by third- and 
fourth-year graduate students who were enrolled in 
a two-semester practicum course. There were 41 
therapists, most of whom saw two of the research 
Ss. Most of the therapists had had some previous 
experience with psychotherapy, but for a few this 
was their first direct therapy experience. All of the 
therapy was closely supervised, typically one hour of 
supervisory conference with a faculty member for 
each hour of therapy. Recordings and observation 
through a one-way screen were occasionally used as 
an aid to supervision. Most of the clients were seen 
once weekly, but a few were seen two or three times 
a week. The goals of therapy varied considerably 
from therapist to therapist, but for the most part 
were in the direction of the establishment of a close 
interpersonal relationship with the client and the dis- 
cussion of feelings and problems with a view to the 
client’s gaining insight. 

When a client came to the clinic he was seen for 
an initial intake interview by~a. graduate assistant 
and an appointment was made to take the CPI and 
the sentence completion test. After the tests were 
completed the client was assigned to a therapist who, 
with his supervisor, was then in complete charge of 
the case. The clinic closed for the summer at the end 
of the spring semester, which caused the termination 
of therapy for 58 of the 75 cases included in the 
present study. The remaining cases terminated volun- 
tarily some time before the end of the spring se- 
mester. Following the last therapy interview the 
client was contacted by letter or phone by the ex- 
perimenters (Zs) and asked to come in to take the 
posttests. These tests were administered by the Es 
and the clients did not see their therapists at this 
time. 

As might be expected it was often difficult to get 
all of the required information on every client. Of 
157 clients who took the initial tests only 75 met 
the requirements of the study. Most of those not in- 
cluded were eliminated because of an insufficient 


number of interviews. However, a number were not 
included because they dropped out of school or for 
some other reason did not take all of the tests. 


Factor Analysis of Change Scores 


For each of the 75 Ss, 30 change scores were avail- 
able. The CPI was scored for the 18 standard scales 
described in the CPI manual (Gough, 1957) and 
change scores were calculated by subtracting the pre- 
test score from the corresponding posttest score. The 
sentence completion test consists of 55 stems de- 
veloped by Rosenberg (1957), each of which pre- 
sents a conflict situation to be reconciled. The test 
was scored according to an unpublished system de- 
veloped by the junior author in which the Ss’ recon- 
ciliation of the conflict presented by each stem was 
classified as either active or passive and as either 
firm (dealing directly with the conflict situation) or 
soft (avoiding the conflict situation). The combina- 
tions of these ratings summed over all items yielded 
four scores: active-soft (AS), active-firm (AF), pas- 
sive-soft (PS), and passive-firm (PF). Interscorer 
reliability was found to be .87 for AS, .75 for AF, 
.64 for PS, and 49 for PF by Ebner (1958) using 
a shortened form of the test and another college stu- 
dent sample. Four sentence completion change scores 
were calculated by subtracting the pretest score from 
the corresponding posttest score. 

Four ratings of amount of change were made at 
the end of therapy by both therapist and client on 
the items shown in Table 1. These change scores 
were intercorrelated and the resulting 30 X 30 matrix 
was factor analyzed by the principal components 
method with communalities estimated as the high- 
est correlation in each row.? On the basis of the size 
of the largest factor loadings and the size of the 
latent roots it was decided to retain six factors which 
were rotated analytically to the varimax criterion 
of orthogonal simple structure.” The rotated factor 
matrix is shown in Table 2. 


THE FACTORS 


The findings of this analysis are consistent 
with those of Cartwright and Roth (1957) 
and Gibson, Snyder, and Ray (1955) that the 
amount of change occurring with psycho- 
therapy depends to a large extent on the 
vantage point from which it is observed. Each 


? The correlation matrix and the unrotated factor 
matrix have been deposited with the American Docu- 
mentation Institute, Order Document No. 6405 
ADI Auxiliary Publications Project, i 
tion Service, Library of Congress; Washington 25 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to: 


Chief, Photoduplication Service, Library of Con- 
gress. j 


from 
Photoduplica- 


* These calculations were done on Purdue’s Data- 
tron 205 computer. 
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TABLE 1 


RATING SCALE FILLED OUT BY THE THERAPY GROUP AT THE END OF TREATMENT 


Weare trying to evaluate the effectiveness of the work of the Psychological Clinic in an attempt to improve the 
services offered. Your frank answers and comments to the items below will greatly help us in doing this. 

On the following four items rate the amount of change, if any, which has occurred in you since you first came 
to the clinic. Consider all changes whether due to coming to the clinic or not. Indicate your rating by a check 
mark by the appropriate number. 


1. Amount of change in the symptoms or complaints which brought you to the clinic. 


1 2 3 4 5 
worse than when no change some considerable no longer bothered 
I came improvement improvement by symptoms 


2. Amount of change in your understanding of yourself and your behavior. 
3 


1 


4 


more confused no change 


about myself now 


3. Amount of change in feeling and general outlook on life. 


understand myself 
some better 


understand myself 
much better now 


(Happiness, sense of well being, etc.) 
4 5 


1 2 3 
feel worse no change feel some 
better 


4. Overall rating of amount of change considering everything that is important to you. 
3 4 


1 2 


feel feel a great 
considerably deal better 
better 


§ 


change for no change some change considerable a great deal 
the worse for the better change for of change for 
the better the better 


Note.—Therapists and control subjects completed scales very similar to the above. 


The direction of the scale for Items 2 and 


4 was reversed on the actual rating scale so that low scores indicated improvement: however, all scores reported here are in terms of 


the scales in the above form, 


instrument used to assess change defined its 
own factor or factors. 

Factor A is clearly a CPI factor with high 
loadings on Wb, Re, Sc, To, Gi, and Ac. 
Since the Wb and Gi scales were constructed 
to detect faking, this factor may be tenta- 
tively interpreted as representing change in 
the tendency to present oneself in the best 
light on inventory items. The other scales 
loading on this factor represent changes in 
self-description in regard to maturity, con- 
scientiousness, permissiveness, and depend- 
ability. However, the profiles presented by 
Gough (1957) of Ss instructed to fake bad 
show a marked lowering of the scores on 
these scales. As a matter of fact, the differ- 
ences between the T scores of Ss faking good 
and Ss faking bad reported by Gough (1957) 
are, with the exception of the Cm scale, pro- 


portional to the corresponding factor loadings 
on the present factor. 

_ If the interpretation of this factor as a fak- 
ing or social desirability factor is correct, it 
is surprising that the client’s direct ratings of 
change do not load on it. Perhaps this is be- 
cause the CPI scores are difference scores be- 
tween pretest and posttest and would thus be 
sensitive to any change in tendency to pre- 
sent oneself in a good light, while the client 
ratings are made only at the end of therapy 
when the client may not be aware of this 
change in self-description or may not consider 
it in his ratings. 

Factor B is clearly a therapist rating fac- 
tor. In addition to the therapist ratings, the 
client rating of change in understanding loads 
on this factor. What is surprising is that the 
therapist ratings identify a factor which is so 
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little related to test score changes and client 
ratings. The loading of client rating of change 
in understanding may represent some real 
agreement between therapist and client. How- 
ever, since any changes achieved were pre- 
sumably discussed in the final interviews, one 
would reasonably expect more agreement than 
this. 

Factor C is another CPI factor on which 
Do, Sc, Sy, Sp, and Sa have the major load- 
ings. There are lower loadings for Wb, To, 
Ac, Ai, Ie, and PS. The highest loading CPI 
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scales on this factor are grouped together by 
Gough (1957) as measures of poise, ascend- 
ency, and self-assurance. They differ from 
those loading primarily on Factor A in that 


they all seem to be concerned with ease of 


social interaction and comfortableness with 
others, where the scales loading on Factor A 
are more concerned with responsibility and 
self-control. 

Factor D is a client rating factor and has 
generally negligible loadings for all other vari- 
ables. There is a low loading for the therapist 


TABLE 2 
ROTATED FACTOR MATRIX 


Factors 
Change Scores A B CG D E F Ie 
CPI Change 
Do Dominance 16 —.10 37 03 24 34 53 
Cs Capacity for status 12 .06 62 18 —.10 —.08 45 
Sy Sociability -19 —.13 -11 05 25 8 65 
Sp Social presence 08 —.03 75 19 14 —.01 63 
Sa Self-acceptance .03 10 70 —.04 10 09 52 
Wb Sense of well-being :53 —.03 36 .00 08 54 71 
Re Responsibility 54 —.06 20 —.02 — 42 =.05 51 
So Socialization 34 =.19 04 14 —.09 28 26 
Sc Self-control 74 04 —.20 .08 —.02 —.09 60 
To Tolerance 68 02 43 16 —.10 07 69 
Gi Good impression 56 27 —.03 10 36 —.03 53 
Cm Communality —.02 =12 04 —.04 —.11 66 47 
Ac Acievement via conformance 55 —.13 33 11 30 41 70 
Ai Achievement via independence .49 OL 32 —.03 07 13 37 
Te Intellectual efficiency AL —.11 41 08 —.03 51 62 
Py Psychological mindedness 35 —.04 11 12 14 13 19 
Fx Flexibility —.08 38; =. =4f =90 09 15 
Fe Femininity —.10 —.06 —.06 —.10 —.39 02 18 
Sentence Completion Change 
AS Active-soft 27 Wy 15 04 39 16 32 
AF Active-firm —.15 —.04 19 05 A3 —.01 25 
PS Passive-soft .01 .00 —.34 —.22 —.52 23 49 
PF Passive-firm ll —16 07 08 — 43 — 47 45 
Client Ratings of Change 
Symptoms and complaints 16 07 17 yA .05 —.09 66 
Understanding of self 04 44 27 41 .09 O1 44 
Feeling and outlook on life 13) -24 08 79 04 14 73 
Overall rating .06 SaN 02 85 Eri 02 77 
Therapist Ratings of Change 
Symptoms and complaints —.02 TA —.10 A8 19 ~.04 61 
Understanding of self 08 .74 12 —.01 =.03 ‘05 60 
Feeling and outlook on life —.05 64 —.05 30 24 — 25 62 
Overall rating 01 31  —07 20 2 -17 3 
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rating of change in feeling. Surprise was ex- 
pressed when the therapist ratings seemed 
largely independent of the other measures of 
change, but such surprise is even more appro- 
priate in regard to the client ratings because 
the test score changes (especially the CPI) 
represent a kind of client rating in themselves. 
As was mentioned in discussing Factor A, the 
test score changes represent differences in per- 
formance at two different times and thus are 
sensitive to changes that the client may not 
be aware of at the end of therapy, since his 
rating of change depends on memory of his 
earlier state. It is interesting to note that the 
client’s rating of change in understanding has 
only a moderate loading on this factor, sug- 
gesting that improved understanding is not so 
important in determining the client’s evalua- 
tion of the outcome of therapy. Going back to 
Factor B, the therapist rating factor, for a 
moment, it is interesting that the therapist’s 
rating of change in feeling has the lowest 
loading among the therapist ratings and that 
it does have a small loading on Factor D. 
This suggests that part of the lack of rela- 
tionship between the therapist and client rat- 
ings may be due to a difference in values 
rather than complete lack of agreement on 
what changes have occurred. The therapists 
seem to consider self-understanding more im- 
portant in their concept of improvement than 
does the client and conversely the client seems 
to value change in the way he feels somewhat 
more highly than does the therapist. Change 
in presenting symptoms seems to be about 
equally important for therapist and client. 

Factor E is defined mainly by loadings of 
the sentence completion scores and seems to 
represent change in the ratio of active to pas- 
sive solutions to the conflict presented in the 
sentence stem. The negative loading for Fe 
and the positive loading for Gi is consist- 
ent with the tentative interpretation of those 
sentence completion scores, but the negative 
loading of Re does not seem to fit into any 
reasonable interpretation. 

Factor F has Cm, Ie, Wb, Ac, and PF as 
high loading tests. Since Cm, the highest load- 
ing variable, is a rare response scale and Wb, 
the second highest loading variable, is a re- 
sponse distortion scale, perhaps this factor 
represents change in the care taken in re- 
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sponding to the test items. This would be 
consistent with the negative loading for PF 
since the PF sentence completions represent 
easy avoidance of a conflict situation and are 
fairly obviously not desirable responses. This 
interpretation is supported by the fact that 
on the profile presented by Gough (1957) of 
CPI items answered at random plotted on the 
male norms, the four scales deviating most 
from the mean are the Cm, Ie, Wb, and Ac 
scales, and the rank order of their deviation 
is the same as the rank order of their factor 
loadings on the present factor. 

Nature of the test score change factors. 
One question that is of interest is whether or 
not the factors identified in the test score 
changes are the same as those that would be 
isolated from the original test scores. An un- 
published factor analysis of the 18 CPI scales 
done independently on different Ss was com- 
pared with the two CPI factors of the present 
study. Factor A is practically identical to the 
first factor of the CPI analysis and the cor- 
relation between factor loadings based on the 
18 CPI scales is .90. Factor C is very similar 
to the second factor of the CPI analysis with 
a correlation between factor loadings of .91. 
Thus, there is clear-cut evidence that the two 
CPI change factors represent changes in the 
same influences that are responsible for co- 
variance among the CPI scales in the first 
place. 

Correlations between the two CPI change 
factors (A and C) and the corresponding fac- 
tors in the pretests were estimated by averag- 
ing the correlations between the pretest scores 
and the corresponding change scores for the 
scales with the highest loadings for the fac- 
tor using the z transformation. These correla- 
tions were —.39 for Factor A and —.44 for 
Factor C. This indicates that the lower (more 
maladjusted) the original score the greater 
the change tends to be. Since most of the Ss 
in this study had initial scores below the mean 
for college students, these correlations could 
be interpreted as indicating regression toward 
the mean due to unreliability of the test 
scores. Alternatively these negative correla- 
tions could be interpreted as indicating the 
greater tendency of the more maladjusted 
cases to improve, either because of therapy 
or spontaneously. The same regression effect 
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is present in the sentence completion scores 
as is indicated by the estimated correlation 
of —.46 between pretest score and change 
score on Factor E. 


CHANGES OCCURRING WITH THERAPY 


The above factor analysis has indicated the 
dimensions along which change occurs. How- 
ever, it does not tell us whether or not the 
mean changes in the Ss receiving therapy are 
different from zero nor does it tell us whether 
or not these changes have been brought about 
or facilitated by the therapy. 


Evaluation of Mean Change 


The mean change in the salient variables 
for each of the factors is shown in Table 3. 
Every high loading variable for Factors A, 
B, C, and D shows significant change in a 
direction that would be considered to reflect 
improvement. The patterns of change for 
Factors E and F are not so clear-cut. Three 
of the four variables loading on Factor E 
show significant change but one of these, Re, 
is in the wrong direction. Only half of the 
high loading variables on Factor F show sig- 
nificant change, and this may be due to the 
loadings of these scales on other factors. 
Thus, Factors E and F cannot be said to 
show consistent changes. 

These results show that the 75 Ss, all of 
whom received therapy, when considered as 
a group show definite improvement on the 
two CPI factors and on the therapist and the 
client rating factors. 


Changes Related to the Therapy 


Although the above results show significant 
improvement on four of six change factors, 
they do not indicate , whether or not the 
improvement was produced by the therapy. 
There are, however, some comparisons within 
the present data that bear on this important 
question., One possible indication of the ef- 
fect of therapy on the various change fac- 
tors would be the correlation of the factors 
with the number of interviews on the assump- 
tion that the larger the number of interviews, 
the greater should be the therapeutic effect. 
These correlations can only be suggestive, 

Owever, since the number of interviews is 
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TABLE 3 


SIGNIFICANCE TESTS OF CHANGE SCORES OF THE 
SALIENT VARIABLES ON EacH FACTOR 


Variable Mean* SD p 
Factor A 
Wb 2.12 5.05 36E“ 
Re 1.16 3.02 3.32"* 
Sc 1.24 4.70 2.62* 
To 1.68 4.14 ase 
Gi 1.21 4.68 2.18* 
Ac 1.64 4.01 3.54" 
Factor B (therapist ratings) 
Symptoms 3.27 79 13.90** 
Understanding 2.96 56 12.20** 
Feeling 3.21 .89 11.73** 
Overall 3.17 74 13.70** 
Factor C 
Do 2.33 4.70 4,29** 
Cs 1,93 3.60 4.64** 
Sy 2.05 3.96 4.48** 
Sp 2.89 5.26 4.75** 
Sa 1.05 2.95 3.08** 
Factor D (client ratings) 
Symptoms 3.43 84 14.70** 
Feeling 3.35 98 11.92** 
Overall 3.37 85 13,95** 
Factor E 
Pse —2.47 9.42 2,27* 
Pr” —0.35 9.70 0.31 
Ree 1,16 3.02 —3.32** 
AF 3.32 9.45 3;02** 
Factor F 
Wb 2.12 5.05 3.63** 
Cm 0.13 3.01 0.37 
Te 1.63 4.93 2.86** 
PEs —0.35 9.70 0.32 


a For test variables scores were obtained by subtracting pre- 
test raw score from posttest raw score. 

b Test of significance of difference of mean from zero. In 
the case of ratings (Factors B and D) ¢ test is of difference from 
a tamg of no change d s 

e These variables had negative loadings e res ivi 
PSI B: gs on the respective 

* Significant at .05 level. 

** Significant at .01 level. 


confounded with time between tests for the 
test score changes and probably is confounded 
with the placebo effect for the ratings. Also 
there is probably a nonlinear relationship be- 
tween number of interviews and success as 
has been suggested by Cartwright (1955). 
Nevertheless, there was considerable variance 
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in number of interviews (mean = 14.7, SD = 
9.2) and correlations between each factor and 
number of interviews were estimated by av- 
eraging the correlations of the highest loading 
variables with number of interviews using the 
z transformation. These combined correlations 
are shown in Table 4. With correlations based 
on 75 cases the correlation with Factor B 
(therapist rating) is the only one significant. 
Another indication of the amount of ther- 
apy received might be based on the skill of 
the therapist. There were 41 therapists, each 
of whom was carefully supervised. Correla- 
tions between supervisors’ rating of thera- 
peutic ability of the therapist and the vari- 
ous change factors were estimated by the 
procedure used above. These correlations are 
shown in Table 4. Rated skill of the thera- 
pist is significantly related to both the thera- 
pist and client rating of change. Some con- 
tamination of the supervisors’ rating is a pos- 
sibility since the supervisor’s judgment of the 
therapist may have been influenced by the 
progress of the case, but this artifact is at- 
tenuated by the fact that each of the thera- 
pists carried several cases. Even though all 
the cases were not included in this study, 
they were all considered by the supervisor in 
making his rating. Number of interviews was 
correlated a borderline significant .21 with su- 
pervisors’ rating of skill of the therapist. 


COMPARISON WITH A CONTROL GROUP 


On the basis of the above approximations 
it appears that the therapist and client rat- 
ings are the only measures of change that are 


TABLE 4 


CORRELATIONS OF THE CHANGE FACTOR SCORES WITH 
NUMBER or INTERVIEWS AND SKILL OF THERAPIST 


Corrleation Correlation 


with with 

Change Number of Skill of 
Score Interviews Therapist 

Factor A 
Factor B oe on 
Factor C 02 07 
Factor D AS "27+ 
Factor E 05 16 
Factor F 09 ‘01 


#* Significant at .01 level with one-tailed test. 
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influenced by the psychotherapy. However, 
comparisons within a therapy group leave 


much to be desired as a means of evaluating ` 


therapy, and data for a nontherapy control 
group was collected to serve as a baseline for 
the evaluation of changes in Factors A, C, 
and D (three of the four factors showing sig- 
nificant changes in the therapy group). 


Considerations in Selecting Control Subjects 


One of the most difficult problems in the 
evaluation of the results of psychotherapy has 
been the securing of adequate untreated con- 
trol cases. Many investigators have omitted 
controls entirely. Some, such as Barron and 
Leary (1955), have used waiting list cases, 
and others, such as Rogers and Dymond 
(1954), have used a normal group of non- 
clinic Ss, 

Although these two studies represent the 
most adequate controls available for changes 
in self-ratings with therapy, some objections 
can be raised to both. Barron and Leary have 
pointed out that there may be considerable 
therapeutic effect in an initial intake inter- 
view and in being on a clinic list. They also 
suggest that self-rating tests may be affected 
by the relationship to the clinic and thera- 
pist, and these relationships are quite differ- 
ent for clinic and waiting list controls at the 
time of the posttest. The nonclinic control 
group used by Rogers and Dymond has the 
disadvantage of not being comparable to the 
therapy group in terms of maladjustment. On 
both of their self-rating measures, the self- 
ideal correlation and the Q sort adjustment 
score, the nonclinic control group achieved 
higher mean scores on the pretest than any 
mean score ever achieved by the therapy 
group. Thus, if the tests have any ceiling ef- 
fect, the control group will be more affected 
than the therapy group. 

In the present study an attempt was made 
to secure a nonclinic control group that was 
comparable to the therapy group in a num- 
ber of important respects. The control group 
used in this study had the following charac- 
teristics: (a) each therapy case was matched 

with a corresponding control case for sex ° 
S and for initial scores on the criterion test 
(the CPI), (b) time between pretest and 
posttest for each control case was equivalent 


j 
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to the time taken by therapy in the corre- 
sponding therapy case, (c) both therapy and 
control groups were drawn from the same 
Student population which is fairly homogene- 
ous with regard to age, (d) the control Ss 
were told at the time of the posttest that the 
effects of a course in which they were en- 
rolled, introductory psychology; were being 
studied. This was an attempt to produce a 
similar test-taking attitude to that of the 
therapy Ss who knew that therapy was be- 
ing evaluated when they took the posttest. 


Procedure for Obtaining Control Data 


Data for the therapy cases were collected over a 
two-year period. During the second year of the 
study a control group was obtained for the 42 ther- 
apy cases obtained during the first year. No control 
cases for the second-year therapy Ss were collected, 
so the N for all control group comparisons is 42. 

Early in the fall semester of the second year of the 
study, 183 introductory psychology students signed 
up to participate in an experiment, which was a 
course requirement, and were given the CPI. The 
tests were administered in small groups. The CPI 
profiles for these 183 Ss were plotted on transparent 
paper and matched visually with the profiles of the 
therapy Ss. The control S of the same sex and with 
the most similar CPI profile was selected as the 
matched control case for each of the 42 therapy Ss. 
In those instances where several control Ss had simi- 
lar profiles to a given therapy S, the one with the 
smallest D score (Cronbach & Gleser, 1953) was se- 
lected. This procedure yielded reasonably good match- 
ing in every case. Since the therapy and control 
groups were to be compared on the two CPI change 
factors it would have been desirable to match the 
groups on the basis of factor scores rather than pro- 
files. However, the control group had to be col- 
lected before the factor analysis was done so this 
was not possible. Reasonably good matching on fac- 
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TABLE 5 


Comparison OF INITIAL CPI Factor Scores 
or THERAPY AND CONTROL GROUPS 


Therapy Control 
Factor Mean SD Mean SD t 
A 47.2 9.7 48.9 88 0.87 
E 49.8 9.1 52.3 9.5 1.30 


tor scores was obtained by the procedure used. The 
mean pretest factor scores for Factors A and C (the 
two CPI factors) of the therapy and control groups 
are shown in Table 5. Although fairly good matching 
of therapy and control pairs was obtained, this table 
shows a nonsignificant tendency for the control group 
to have higher scores. 

The 42 matched control Ss were called in indi- 
vidually for retesting after a period of time equal to 
that between pretest and posttest for the correspond- 
ing therapy S. At this time the control Ss were told 
that the Es were evaluating the effectiveness of the 
introductory psychology course in bringing about 
personality changes. They were retested with the 
CPI and were given a slightly altered form of the 
rating scale shown in Table 1. References to the 
clinic were changed to make them appropriate to 
the introductory psychology course, Throughout the 
testing of the control group an attempt was made 
to structure the testing situation in a manner as 
similar to that of the therapy group as possible, À 

Factor scores for the two CPI factors were ob- 
tained by calculating the mean standard score change 
obtained from the CPI profile form for the highest 
loading variables on each factor. Thus, the factor 
score for Factor A is the mean standard score change 
for Scales Wb, Re, Sc, To, Gi, and Ac; and for 
Factor C is the mean standard score change for 
Scales Do, Cs, Sy, Sp, and Sa. The factor score for 
Factor D, client rating, is the mean rating for Scales 
1, 3, and 4 of the rating scale shown in Table 1. 


TABLE 6 


COMPARISON OF THERAPY AND CONTROL GROUPS ON CHANGE FACTORS 


A, C, anD D 


Difference between 
therapy and con- 


Therapy Control trol (2) 
Factor Mean SD is Mean SD & Variances’ Meansb 
A 3.26 6.35 3.33** 4.07 4.79 5.52** 1.83 —0.56 
C 5.14 7.58 4,39** 1.84 5.66 2.10* 1.89 2.38** 
D 3.50 0.67 19.35** 2.73 0.53 11.90** 1.64 6.18** 


* Test of significance from zero or from rating of no change in the case of Factor D, 


b Test of significance of differences between variances and betw: 
taking. into account the correlation of change scores brought about by the matching of therary 


Significant at .05 level. 
** Significant at .01 level, 


een means of the change scores 


of therapy and contr 
py and control cases, Ao 
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TABLE 7 
CoMPARISON OF THERAPY AND CONTROL CHANGE SCORES ON 
INDIVIDUAL CLIENT RATINGS AND CPI Scares 
Test of difference 
Therapy Control (t) between therapy 
(N = 42) (N = 42) and control 
Scale Mean SD P Mean SD ie Variances? Means” 
Factor A 
Wb Sense of well-being 2.64 5.74 2.98** 2.69 3.31 5.278% 3.68** = —0.48 
Re Responsibility 0.60 3.07 1.26 1.19 4.42 1.74 —2.38** —1.20 
Sc Self-control 0.76 4.53 1.09 3.81 6.01 4.11** —1,83 —2.08* 
To Tolerance 1.88 4.66 2.62* —3.17 645 —3.18** —2.11* 338% 
Gi Good impression 1.19 4.43 1.74 6.81 5.95 742** —1,90 —4.66** 
Ac Achievement via conformance 1.57 4.32 2.36* 2.26 3.71 3.95** 0.99 —0.78 
Factor C 
Do Dominance 2.88 5.20 3.59** 1,59 5.32 1.94 —0.14 1.20 
Cs Capacity for status 2.07 408 3.29* 0.24 3.25 0.48 1.56 3.08** 
Sy Sociability 2.40 4.48 3.47** 1.09 4.18 1.69 0.44 1.31 
Sp Social presence 3.71 5.52 4.36** 1.62 4,80 2.19* 0.90 1,80 
Sa Self-acceptance 1.26 3.16 2.58* 0.14 3.32 0.27 —0.30 1.60 
Factor D (client ratings) 
Symptoms 35 55 1702" 27 AT 9.64" 1.59 5.66** 
Feeling 3.5.79 «12.30% 26 53 7.28"* 193 4.70** 
Overall 3.5 55 17,62 2.9 43 13,53** 1.89 3,74** 
Not Scored on Any Factor 
CPI Scales P 
So Socialization 0.60 4.27 0.90 1.88 5.34 2.28* —1.45 =—1.27 
Cm Communality 0.48 2.91 1.06 0.60 2.00 1,93 2.52 1.10 
Ai Achievement via independence 1.62 3.53 2.97** 1.24 3.54 2.27* —0.01 0.72 
Te Intellectual efficiency 1.67 5.06 2.14* 1.60 4.86 2:13* —.28 0.33 
Py Psychological mindedness 0.26 3.44 0.49 0.98 2.31 TIa 2.64* —1,15 
Fx Flexibility 0.60 2.35 1.64 —0.02 2.85 —0.05 —1.26 1.29 
Fe Femininity —0.50 2.86 —1.13 —0.19 2.16 —0.57 1.80 —0.51 
Client Rating p , 
Understanding of Self 3:2 65  11,95** 3.0 34° -19,02** 2.78** 1.56 


a Test of significance of difference from zero, or from a rating of no change in the 
b Test of significance of difference between variances and between means of the change scores of 


case of client ratings, 
therapy and control groups 


taking into account the correlation of change scores brought about by the matching of therapy and control cases. 


* Significant at .05 level. 
* Significant at .01 level. 


Comparison of Therapy and Control Groups 


The change scores for therapy and control 
groups on the three factors to be compared 
are shown in Table 6. This table shows that 
both therapy and control groups show signifi- 
cant changes on all three factors, all in the 
direction of improvement. When therapy and 
control groups are compared the therapy 
group shows significantly greater improvement 
than the control group on Factors C and D. 


Other studies (such as Cartwright, 1956) 
have found therapy groups to have greater 
variance on change measures than control 
groups. This is also true of the present ther- 
apy and control groups on all three factors, 
but these differences are not statistically sig- 
nificant unless one is willing to use a one- 
tailed test for this comparison. 

A comparison of therapy and control groups 
on the individual scales is shown in Table 7. 
Here it can be seen that the significant dif- 
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ferences between therapy and control groups 
on Factors C and D are due to consistent dif- 
‘ferences in all of the scales loading on these 
factors. This supports the contention that it 
is on the underlying dimension common to 
all scales loading on the factor that change 
occurs. This consistency is not found in the 
change scores loading on Factor A. The con- 
trol group shows greater change than the ex- 
perimental group on most of the scales load- 
ing on Factor A, but shows a significant 
change in the wrong direction on one scale, 
To. The inconsistent performance of the To 
change scores in the control group is puzzling, 
but a recheck of the data indicates that it is 
not due to recording or calculating error. The 
change scores with no high loadings on any 
factor (for the most part scales with low com- 
munalities in the factor analysis) show no sig- 
nificant differences between therapy and con- 
trol groups. This suggests that the dimensions 
in the measures used in this study that are 
affected by psychotherapy are pretty well ac- 
counted for by the factors isolated. 


Some Implications for Research Design 


It may be appropriate to mention at this 
point the great clarification of the therapy 
and control comparison that was brought 
about by the factor analysis. The CPI data 
shown in Table 7 were available before the 
factor analysis was done, and without the or- 
ganizing influence of the factors, the data 
showed the confusing result of the therapy 
group improving more on Cs and To and the 
control group improving more on Sc and Gi. 
Since the number of comparisons made was 
relatively large, it was difficult to evaluate 
the statistical reliability of this pattern of 
differences. However, when the scales are 
grouped according to their factor loadings as 
in Table 7, an orderly pattern emerges.‘ 

The control group changes shown in Table 7 
indicate the necessity of using a control group 
in the evaluation of psychotherapy change. It 
cannot be said on the basis of the present 


4 Harrison Gough (personal communication) has 
pointed out that the grouping of CPI scales brought 
about by the factor analysis was already implicit in 
the four clusters of scales described in the manual. 
The first cluster is very similar to our Factor D and 
the second is very similar to our Factor A, 
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data whether the control group changes are 
due to a general optimism leading to im- 
proved self-ratings or whether they represent 
regression effects due to the selection of the 
more maladjusted cases by the matching proc- 
ess. However, the marked tendency of the 
therapy cases with the lower test scores to 
show the greatest improvement, which was 
discussed above, suggests that it is necessary 
for an adequate control group to be matched 
with the therapy group on the criterion meas- 
ures to be used. 


Discussion 


The above results appear to show that psy- 
chotherapy is effective in producing person- 
ality changes that are reflected in changes on 
certain CPI scales and in behavior discernible 
by both therapist and client. However, the 
factor analysis has shown that these changes 
occur along three independent dimensions. 
Most investigators seem to assume that psy- 
chotherapy has a unitary effect in improving 
the adjustment of the client, and that lack of 
agreement among criteria is due to error in 
measurement of some sort. The present find- 
ings can be interpreted in a way that is con- 
sistent with this assumption of unidimensional 
change. Therapist and client ratings and CPI 
measures of sociability may all be sensitive to 
a unitary change in adjustment brought about 
by therapy, but each of these types of meas- 
ures may in addition be affected by special 
biases or other errors which are responsible 
for the finding of independent factors. If this 
were the case, one would expect to find some 
correlation between the change scores on the 
three factors due to the common influence of 
the personality change in a group receiving 
therapy. An inspection of the correlation 
matrix of change scores reveals that client 
ratings had a low positive relationship with 
both the test score changes loading on Fac- 


„tor C and the therapist ratings. But the thera- 


pist ratings and test score changes were not 
correlated. However, the client ratings show 
similar correlations with the test score changes 
loading on Factor A and with the other scores 
that do not change with therapy. This sug- 
gests that the relationship between client rat- 
ings and the other variables is due to com- 
mon biasing effects rather than the influence 
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of a general psychotherapy change. The al- 
ternative interpretation of the present results 
is that the effects of psychotherapy are multi- 
dimensional. In view of the lack of correla- 
tion among change scores shown to be due to 
therapy, this interpretation seems somewhat 
more consistent with the data. 

If the effects of psychotherapy are multi- 
dimensional, the psychotherapy itself must 
have several independent aspects which have 
differential effects on the various criterion 
measures. For example, if therapist ratings 
and test score changes both reflect improve- 
ment due to therapy, and yet they are un- 
correlated in a group receiving therapy, they 
must be due to different aspects of the therapy. 

The CPI change (Factor C) differentiates 
therapy and control groups, but does not ap- 
pear to be related to either amount or in- 
tensity of the therapy as represented by num- 
ber of interviews and skill of therapist. This 
suggests that the mere fact of having entered 
into a therapeutic relationship is sufficient to 
produce this change. The first association one 
is likely to have to this is that perhaps the 
CPI change is a manifestation of the tend- 
ency to present oneself as in need of help at 

the beginning and as not needing help at the 
end of therapy (the old hello-goodbye effect). 
There are, however, two findings that suggest 
that the CPI changes represented by Factor 
C are not due to increased tendency to pre- 
sent oneself in the best light at the end of 
therapy. The first is that therapy and control 
groups did not differ on Factor A, which is the 
factor that should be most sensitive to changes 
in tendency to present oneself in a good light. 
The second is the Es’ nonquantitative ob- 
servation that the therapy Ss did not feel any 
particular obligation to the clinic. The ther- 
apy group was much more difficult to sched- 
ule for posttests than the control group and 
many more of the therapy Ss missed posttest 
appointments and expressed resentment at the 
imposition than was the case with the gen- 
erally cooperative control Ss. 

Thus, the CPI change with therapy seems 
on the one hand to be an attitude change 
brought about by simply being in therapy, 
while on the other hand, it does not appear 
to be simply a change in tendency to present 
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oneself in the best light. Perhaps the CPI dif- 
ferences between therapy and control groups 
are due to differences in the changes expected 
by the Ss to accrue from therapy and in- 
troductory psychology. Or, alternately, per- 
haps they reflect genuine personality changes 
brought about by therapy. The latter inter- 
pretation would necessitate the assumption 
that the supervisors’ ratings do not corre- 
spond to the therapists’ ability to produce this 
particular change. 

Therapist and client ratings were both re- 
lated to the skill of the therapist as rated by 
the supervisor, and the difference between the 
correlation of these two variables with num- 


ber of interviews was not significant. They ` 


were also significantly correlated with each 
other. (Correlations ranged from .23 to .36 
on corresponding scales.) Thus, the assump- 
tion that they are reflecting independent 
changes ‘is not as well justified as in the case 
of the CPI changes. The lack of higher cor- 
relation between them is probably due to dif- 
ferent biases inherent in the different points 
of view. For example, data not presented in 
this study indicate that the therapists tended 
to rate as improved those with high (well ad- 
justed) initial CPI scores, suggesting that 
their rating of change may be influenced by 
the general level of adjustment, while this was 
not observed in the client ratings. 

One possible interpretation of the relation- 
ship of client and therapist ratings with skill 
of therapist is that therapist, client, and su- 
pervisor may all be reacting to the ability of 
the therapist to form pleasant, lasting rela- 
tionships. However, if this were the case the 
clients would be expected to show evidence of 
presenting themselves in a good light in mak- 
ing the favorable ratings that produced the 
very significant difference from the control 
group on Factor D. Evidence discussed above 
suggests that this was not the case, thus it 
appears more likely that both therapist and 
client are rating real personality changes in 
the client. 

SUMMARY 


A number of measures of change with psy- 
chotherapy including several therapist and 
client ratings and change scores for 18 CPI 
scales were factor analyzed, using 75 therapy 
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cases as subjects. The six factors obtained 
from this analysis were largely identified by 
the measuring instruments used. Of the six 
factors four showed mean changes represent- 
ing significant improvement. The effects of 
therapy on the various change factor scores 
were studied by comparison with a matched 
control group and by correlating the change 
factor scores with number of interviews and 
with rated skill of therapist. These compari- 
sons indicated that psychotherapy has signifi- 
cant effects as measured by therapist and 
client ratings of change and by change ina 
group of CPI scales reflecting poise and inter- 
personal effectiveness. 
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In an earlier article, Vernier, Whiting, and 
Meltzer (1955) reported certain signs ob- 
tained from the house and person drawings of 
the House-Tree-Person (H-T-P) Test which 
differentiate significantly the MHB (maximum 
hospital benefit) discharged patient from the 
AMA (against medical advice) discharged pa- 
tient upon his admission to a Tuberculosis 
Service. They found significant differences be- 
tween MHB and AMA groups on 7 of 14 test 
signs selected for analysis (see Table 1, Orig- 
inal Data). It was concluded by the authors 
that: 


Despite the small samples available for analysis, a 
sufficient number of reliable differences in test scores 
were found to permit accurate prediction for individ- 
ual patients. The data for one of the tests, the draw- 
ing of the house, justified the development of a spe- 
cific index. 


PROBLEM 


Because of the theoretical implications as- 
cribed by the authors to the predictive signs, 
and because of their possible use in planning 
therapeutic programs for TB patients, the 
need for cross-validating the original results 
was deemed essential. The present study, then, 
represents an attempt to provide cross-valida- 
tion information on comparable groups of TB 
patients. In fact, our second cross-validation 
sample represents, in part, the same patient 
population from which their original sample 
was drawn. 


METHOD AND RESULTS 


H-T-P Test protocols were obtained from a 
total of 397 patients admitted to our Tubercu- 
losis Service during the period of December 


1954 through June 1956. For purposes of 
analysis 397 “house” drawings and 383 “per- 
son” drawings were available. One hundred 
thirty-six of these patients received eventual 
AMA discharges, while 261 patients were dis- 
charged mus. Because the Ns of both groups 
were greater than 100, the formula recom- 
mended by McNemar (1949, p. 76) was 
utilized to test the significance of the differ- 
ences between groups on each of the 14 test 
signs. 

As noted in Table 1 (First Cross-Validation 
Sample) none of the differences between the 
groups reported significant in the earlier study 
was found in this current analysis. Indeed, a 
reversal in one of the signs, significant at the 
-05 level, was noted in the current sample. 
The findings suggested the need for further 
corroborative efforts. 

Consequently, the H-T-P protocols of a 
second sample of TB patients admitted during 
the period January 1953 to January 1954 
were studied. All available patients receiving 
either ama (N =74) or MHB (N =77) dis- 
charges were used. With Vs below 100 in both 
groups, McNemar’s (1949, pp. 76-77) recom- 
mended formula for “small sample” differences 
between proportions, and the rule-of-thumb 
criterion which indicates when it is safe to 
compare groups by the D/sp technique were 
used. Again, as noted in Table 1 (Second 
Cross-Validation Sample) no significant dif- 
ferences between the AMA and MHB test pro- 
tocols were found. 

As a further step, the statistical findings of 
the original study were re-evaluated. The 
formula for small sample differences and the 
rule-of-thumb criterion again were employed. 
Significant differences between the two groups 
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TABLE 1 
COMPARISON OF AMA AND MHB GROUPS ON THE HousE-TREE-PERSON TEST 
Reanalysis First Cross. Second Cross- 
Original Data of Original sa se a Validation 
Data Validation Sample Sample 
. l 
Test Signs AMA MHB AMA MHB AMA MHB 
(W =50)| (N =30) (V =136)| (W=261)| (X =74)| (N =77) 
% % CR CR % % % % 
Door on left side of page only 6 27 23% 2.64% 25 22 14 26 
Door details, 2 or more present 24 30 0.6 0.6 15 18 24 23 
Window detail drawn as + 62 23 3.9** 3.39** 40 31 34 22 
Number of windows 4 or more 32 53 1.8 1.85 38 35 36 47 
Smoke from chimney 26 0 4.3* 3.05» 21 13 22 18 
Walk present 20 10 12 1.17 13 8 12 22 
Steps drawn 18 27 1.0 0.96 19 16 28 22 
Road indicated 12 0 2.4* 1.972 2 1 15 py] 
Shrubbery and/or trees drawn 2 10 2.0 1.5 3 4 4 9b 
Small side of house on right +t 20 2.4* 217" 4 Qs 7 5b 
side of page 
Ground line drawn 22 3 ii 2.30 10 11 18 12 
(N =134)| (N =249) 
Head facing to right of page 20 7 1.6 1.57 13 16 15 18 
Neck either omitted or drawn 50 37 1.2 1.13 49 51 39 43 
wider than long 
Hands omitted 44 23 2.1* 1.91 54 57 26 29 
* Significant at .05 level. 
** Significant at .01 level. 
a Reversed. NS 
b These cannot be safely interpreted as significant. 
were found on six of the seven test signs DISCUSSION 


originally reported as significant. However, as 
noted in the Reanalysis of Original Data 
section of Table 1, four of these differences 
did not meet the rule-of-thumb criterion, sug- 
gesting their very limited predictive value. 
The two remaining signs, which were reported 
as significantly different, did fulfill the rule- 
of-thumb criterion. However, as noted earlier, 
these significant differences were not con- 
firmed in our two cross-validation samples. 


It is difficult to account for the large dis- 
crepancies between results of the original 
study and the results obtained in the cross- 
validations. Investigation of the character- 
istics of the original and current patient sam- 
ples revealed minimal differences between the 
groups with regard to sex, age, educational 
level, and personality traits. In addition, at 
least 50% of the second cross-validation sam- 
ple included patients used in the original 
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study. Since the scoring method is rather clear- 
cut, the scoring of the data does not appear 
to be an area which contributed significantly 
to the obtained large discrepancies. 

Had the authors of the original study ap- 
plied McNemar’s rule-of-thumb criterion a 
more cautious interpretation of the results of 
the original study would have been offered. 


SUMMARY 


The H-T-P protocols of two samples of 
patients who had obtained either MHB dis- 
charges or AMA discharges were analyzed in 
an effort to cross-validate the findings of an 
earlier study (Vernier, et al., 1955). None of 
the seven signs of the original study which 
were reported to differ significantly between 
the groups was found to hold up in the cross- 
validation. 
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The seven signs reported in the original. 


study as significantly different were rean- 
alyzed by appropriate statistical techniques; 
five of these signs were found to be statis- 
tically. untenable. 

The results of the present study suggest 
that the implications reported in the original 
study for projective test theory and for the 
problem of AMA and mus behavior prediction 
should be utilized with extreme caution. 
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CHANGES IN RORSCHACH PERFORMANCE AND 
CLINICAL IMPROVEMENT IN SCHIZOPHRENTA* 
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The concept of withdrawal as descriptive 
of the behavior of the schizophrenic patient 
(Bleuler, 1916; Coon, 1944; Henderson & 
Gillespie, 1950; Masserman, 1946; Strecker, 
Ebaugh, & Ewalt, 1951) has been over- 
shadowed by the controversies and differences 
in opinion about the consequences, etiology, 
and mechanisms of schizophrenia. The com- 
plex behavior known as withdrawal involves 
the total organism and manifests itself in a 
variety of ways, both in psychologic and 
physiologic behavior (Angyal, Freeman, & 
Hoskins, 1940). With clinical improvement 
following a schizophrenic episode, patients 
show greater interaction with others, more re- 
sponsiveness to the environment, and greater 
expression of emotion (Greenblatt & Solomon, 
1953). These changes are evidence of a re- 
turn to “normal” behavior. We can use these 
changes in behavior to identify three areas 
in which withdrawal, so important in under- 
standing schizophrenia, occurs. These can be 
grouped as the areas of (4) interpersonal re- 
lations, (b) attitudes towards or perception 
of the world, and (c) the emotional life. Each 
area may be related to the special interests of 
a different psychiatric school. The Sullivan 
school concerns itself particularly with with- 
drawal from people. Meyer (Cameron, 1944) 
and Campbell (1935) emphasize withdrawal 
from the world (environment). The Freudian 
school emphasizes withdrawal of emotions 
from external objects as libidinal energy is 
centered on or in the self. 


1 This study was carried out at the Boston Psy- 
chopathic Hospital while the writer was a USPHS 
Fellow. The complete report of the study is on file 
at Boston University Library, as the writer’s unpub- 
lished dissertation of the same title (1955). 

2 The writer is indebted to Chester C. Bennett for 
his invaluable critical assistance. 


In this study the assumption was made that 
the Rorschach test has satisfactory indices to 
reflect reactions in the areas of interpersonal 
relations, perception of the world and ability 
to deal with emotions, and that these indices 
are sensitive to changes within indivduals. 
The Rorschach indices reflecting responses in 
the three areas were derived from Rorschach 
theory and were not empirically determined 
single scores. The indices were essentially 
those combinations of scores and ratios 
weighed and evaluated according to their 
theoretical significance by the clinician when 
he interprets a Rorschach protocol. 

The Rorschach scores reflecting the empa- 
thy, rapport, and feeling toward others and 
which were called indices of interpersonal re- 
lations are (a) the Movement and Human 
responses (M + H) and (b) the ratio of 
form-color responses to the sum of color- 
form and pure color responses (FC:CF + C). 
The patient’s perception of the world was re- 
flected in four sets of Rorschach indices: (a) 
the perception of stimuli as others see them 
(F + %), (b) conformity with the thinking 
of the group (the number of Popular re- 
sponses), (c) evidence that the world is a 
source of fear and danger (responses with 
destructive and tension-laden content), and 
(d) distortion in the distance maintained by 
the individual between himself and the world 
(Rejections, Denials, Self-References). The 
expression and control of emotion were re- 
flected in four sets of Rorschach indices: (a) 
the ratio of intellectually determined responses 
to the total number of responses (F%), (b) 
the affective energy available for response to 
external stimuli (8-9-10%), (c) emotional 
maturity and control (FC:CF:C), and (d) 
the ratio of the sum of the emotional re- 
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sponses which are controlled to those responses 
where control is secondary or lacking (FC + 
FY + FV:CF+C+YF+Y+VF+Y). 


PROCEDURE 


The general hypothesis tested was that patients 
who show clinical improvement have significantly 
greater incidences of changes in the direction of 
healthier Rorschach responses than patients who do 
not show clinical improvement. Changes in the di- 
rection of healthier responses as these are judged in 
Rorschach practice were taken as evidence of dimi- 
nution of withdrawal. 

The Rorschach protocol of each patient which was 
obtained when he was hospitalized as acutely ill was 
compared with his protocol obtained just preceding 
his departure from the hospital, regardless of con- 
dition at time of discharge. Predictions for the 
changes in each set of Rorschach indices accompany- 
ing changes in clinical condition were: 

1. In the area of interpersonal relations, clinically 
improved patients would show changes in the direc- 
tion of a greater number of healthy movement and 
human responses (M +H) and a relative increase 
in the ratio of FC responses to the sum of CF+C 
responses. 

2. In the area dealing with perception of the 
world, the clinically improved patient would show 
changes in the F+% in the direction of the opti- 
mum range; the number of Popular responses would 
increase; Content would contain fewer signs of ten- 
sion; and there would be fewer Rejections, Denials, 
Self-References. 

3. In the area dealing with the expression and 
control of emotion, patients showing clinical im- 
provement would show change in the direction of 
the optimum range in the F% and in the 8-9-10% ; 
the FC:CF:C distribution would be away from the 
“regressive” shift; and the form-dominated responses 
(FC-+FY+ FV) would increase relative to the re- 
sponses where form is secondary or lacking (CF + 
C+YF+Y+VF+Y). 

Only cases diagnosed as having an acute schizo- 
phrenic reaction, regardless of type, and whose ill- 
nesses were of less than six months’ duration, and 
who were between the ages of 16 and 40 were in- 
cluded in this study. This approximates the range 
Henderson and Gillespie (1950) give as the ages 
within which two-thirds of the schizophrenias have 
their onset. Patients with chronic alcoholism, epi- 
lepsy, mental defect, physical handicaps from birth, 
psychosurgery, and those who were already under- 
going somatic therapy were excluded. The average 
age for the total group was 24.02 years; average 
education was 12.06 years of schooling. Only one pa- 
tient in this study was over 35 years of age. The 
present hospitalization was his second, with a remis- 
sion period of more than five years. Two other pa- 
tients also had been hospitalized previously; and in 
each instance an interval of more than two years 
had elapsed between hospitalizations, during which 
time the patients had made economic and social ad- 
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justments at their premorbid levels. In all, the data 
were obtained from the Rorschach records of 45 
cooperative acutely ill schizophrenic patients. The 
45 cases were consecutive hospital admissions S€- 
lected only on the basis of diagnosis, age, and re- 
cency of onset of illness. Each patient in this study 
was examined both when he was acutely ill and 
when he left the hospital either as improved or un- 
improved. Each individual was both an experimental 
subject (S) and his own control. 

Since the general hypothesis of this study involved 
comparison of groups of improved and unimproved 
patients, changes in clinical condition had to be es- 
tablished. Two psychiatrists rated the patient, both 
on his admission to the hospital and again upon his 
departure from the hospital, for severity of illness. 
There was complete agreement between the psychia- 
trists that 12 of the patients showed no improve- 
ment in their clinical condition in the intervals be- 
tween the administration of the two tests. These 
cases became the unimproved group. There was 
agreement in the psychiatric judgments that 14 pa- 
tients showed “marked” improvement. The remain- 
ing 19 cases showed moderate or slight improvement 
in the opinions of both doctors. Thus, 33 cases were 
judged to show clinical improvement during their 
hospitalizations. 

In order to compare these groups on the experi- 
mental data, hypotheses that the improved and un- 
improved groups came from the same population 
were tested. When the statistical tests were made 
for age, education, length of hospitalization, kind of 
therapy, and length of time between examinations, 
the null hypotheses could not be rejected and the 
assumption that all patients came from a single 
population was held tenable. 

The changes in Rorschach indices between the 
first and second examination were the experimental 
data. A change indicating greater interest in others, 
or in the world, or better control of emotions was 
considered a positive change. If positive change was 
not shown, the case was classified as showing no 
change on a particular index. The direction of the 
change rather than the amount of change consti- 
tuted the experimental data. 

The positive changes for each index of with- 
drawal in each patient’s pair of Rorschach proto- 
cols were analyzed separately and the incidences of 
positive changes for the improved and unimprove* 
groups were compared. In addition, the changés in 
Rorschach indices for each area of withdrawal were 
combined and analyzed as a single score for inci- 
dence of multiple changes in the improved and un- 
improved groups. Three multiple incidence values 
were thus possible for the arca dealing with the atti- 
tude toward others: changes in both the Mt+H 
constellation and the FC:CF:C ratio, changes in 
only one of these indices, or no change in either 
index. In each of the two other areas dealing wit 
withdrawal (attitude toward the world, expression 
and control of emotion) there were five possible 
multiple incidence values. Using the chi square sta- 
tistic (McNemar, 1949) the incidences of positive 
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TABLE 1 


VALUES FoR SIGNIFICANCE OF CHANGES IN RORSCHACH INDICES OF WITHDRAWAL 
IN CLINICALLY ĪMPROVED vs. UNIMPROVED CASES 


All Improved 


Cases Showing Cases Showing 


Cases Marked Improvement Some Improvement 
Rorschach Index (N = 33) (N = 14) (N = 19) 
Attitude toward People 
M+H <.04 <.03 <.15" 
FC:CF +C <.04 <.04 b g 
Combined Indices <.01 <.01 <.05 
Attitude toward Environment 
` 02 b 
F+% <.03 za 
Content <2 <a <01 
Popular 7 s, y 
Rejection-Denial <.15* <.15 
Self-Reference 
Combined Indices <.01 <.01 <.01 
Expression and Control of Emotion 
Fr <.04 <.02 <A 58 
cs we <.04 <.02 <.15* 
FCiICF:C <.01 <.01 <.02 
PC+FY+FV:CF+C+ <.10" <.03 
VF+Y+VF+V 
Combined Indices <.02 <.01 <.158 


a Trend toward significance. 
b Not significant. 


Rorschach changes in the total group of improved 
cases (33 patients) were compared with the inci- 
dences of these changes in the unimproved group 
(12 cases). In the same way, the incidences of posi- 
tive changes in the 14 cases showing marked im- 
Provement, and the 19 cases showing “some” im- 
Provement were compared with the incidences of 
these changes in the 12 cases showing no improve- 
ment (Table 1, Columns 2 and 3). A $ value less 
than .05 was the level set in this study for rejection 
of a null hypothesis. Since only direction of change 
(positive, in the direction of “healthier” responses) 
was studied, the p values for the one-tailed test 
(Jones, 1952) were used. 


RESULTS 


The significance of the incidences of posi- 
tive changes in the groups showing improve- 
ment when compared with the unimproved 
group for each index and the combined in- 
dices is given in Table 1. The greater dis- 
criminatory power of the combined Rorschach 
indices corroborates Rorschach theory and 
practice which holds that individual scores 
must be combined and patterned out for 
meaningful interpretation. Incidences of posi- 
tive changes in the entire group of 33 im- 


proved cases were significantly greater for 7 
of the 10 individual Rorschach indices when 
compared with the incidences of positive 
changes in the 12 unimproved cases. The cases 
showing marked improvement showed signif- 
cant changes on 8 of the 10 indices. In con- 
trast to this, the cases showing only “some” 
improvement showed significant changes on 
only 2 of the 10 indices. The greater clarity 
and higher instances of positive changes in 
the markedly improved group suggest that 
“pure” cases give findings more significant 
than less definitive cases even though the 
number of cases in a sample is considerably 
reduced by using only the pure cases, 


Discussion 


The Rorschach changes which most signifi- 
cantly differentiated between the improved 
and unimproved cases were seen in the re- 
versals of the regressive shift in the FC2CFiC 
“ladder.” The increase in the FC responses 
suggests that with clinical improvement the 
ego functions more effectively and there is a 
greater control over emotions. Use of the 
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qualitative changes in each patient’s record, 
such as changes from minus to plus signs in 
a given score, may have contributed to the 
high significance consistently shown for the 
association between the reversal of the re- 
gressive shift and clinical improvement. : 

Cases evaluated as markedly improved clini- 
cally showed increased control of both emo- 
tional responsiveness and emotional tensions 
(increases in FC + FY + FV). The combined 
improved cases showed only a trend toward 
this‘ change. The inference can be made that 
the more disturbing emotions indicated by 
shading and vista responses are especially 
difficult to deal with, since only the cases 
showing marked improvement can handle 
them effectively. 

The decrease in pathological content which 
was found to be significant in all improved 
groups suggests that the world becomes less 
hostile or barren to the clinically improved 
patient regardless of the degree of improve- 
ment he shows. The significant positive 
changes in the F + %, indicating greater 
ability to deal with reality, may be dynami- 
cally interlocked with the significant change 
found in decreased pathological Content. To- 
gether, these changes suggest that the world 
is “tested” more realistically by the patient 
because he is less afraid, and that he may be 
less afraid because of greater intellectual con- 
trol and ego strength. 

Two studies dealing with retest changes 
report negative findings when using Popular 
responses. Kelley, Margulies, and Barrera 
(1941) found fluctuations in the number of 
Popular responses given by amnesic patients 
when almost all other factors on retest were 
stable. Siegal (1948) found that Popular re- 
sponses “occurred diffusely” in patients show- 
ing improvement as well as in those failing to 
show improvement. Molish (1951) found that 
three-fourths of the failures to give Popular 
responses by schizophrenic patients occurred 
on cards which usually elicit human percepts. 
In light of the relationship found in the pres- 
ent study between the M + H index and clini- 
cal change, his findings suggest that we might 
have seen meaningful changes in Popular re- 
sponses if the analysis had been made in 
terms of their content rather than in terms 
of their total number in a protocol. 
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Although prognostic signs were not the 
concern of this study, relevant data are avail- 
able from the analysis of the initial test rec- ~ 
ords of the improved and unimproved cases. 
These groups did not differ significantly, upon 
admission to the hospital, in the instances of 
healthy and unhealthy responses on any of ; 
the Rorschach indices used in this study. ` 
F+% was slightly more disturbed in the 
patients who showed marked improvement 
later, but this index showed no trend toward 
discriminating between those cases who later 
improved and those who did not improve | 
when the degree of improvement was undif- # 
ferentiated. 


SUMMARY i 


To study changes in withdrawal which oc- f 
cur with clinical improvement, Rorschach in- 
dices of responsiveness to people, the environ- 
ment, and to the patient’s own affective life 
were selected. The indices reflecting activity 
in these three areas were derived from Ror- 
schach theory and were not empirically de- 
termined scores. The indices were essentially 
those combinations of scores and ratios 
weighed and evaluated according to their 
theoretical significance by the clinician when " 
he interprets a Rorschach protocol. a 

The Rorschach records of 45 cooperative’ | 
acutely ill schizophrenic patients between the 
ages of 16 and 40 were studied. These cases 
were consecutive hospital admissions selected 
only on the basis of diagnosis, age, and re- 
cency of onset of illness. Each patient in this * 
study was tested both when he was acutely ill Ww 
and when he left the hospital as either im- | 
proved or unimproved. Each individual was 4 
both an experimental subject and his own } 
control; changes between first and second « 
protocols were analyzed. é É 


The findings in this study show that the — 
Rorschach technique brings the function MEE 


personality into focus so that the assumption | 
that thie instrument is sensitive to changes | 
within an individual is justified. Combined | 
indices were more significantly related to } 
clinical improvement than the separate in- ,_ 
dices, except for the FC:CF:C. ratio. This is | 
in accord with Rorschach theory and prac- 

tice which hold that individual scores must | 


be combined for meaningful interpretation of 
a protocol. 
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Recent reports of impaired verbal abilities 
in psychomotor or temporal lobe epileptics 
have given new impetus to the continuing 
search for patterns of cognitive functioning 
associated with abnormal brain conditions. 
Quadfasel and Pruyser (1955) found that 
psychomotor epileptics had significantly lower 
Wechsler-Bellevue Verbal IQs in comparison 
to Performance IQs than did a group of 
grand mal epileptics. They further reported 
that memory for verbal material as measured 
by the Wechsler Memory Scale was signifi- 
cantly impaired for the psychomotor group 
while memory for designs was intact. Meyer 
and Jones (1957) confirmed the findings on 
the Wechsler-Bellevue in English patients and 
demonstrated that the deficit became greater 
subsequent to temporal lobe operation. Clini- 
cal evidence has mounted over many years to 
indicate that speech and communicative skills 
are associated with the temporal lobes, espe- 
cially that of the left or dominant hemisphere. 
Since most psychomotor epileptics have left 
temporal lobe foci (Gibbs & Gibbs, 1952), 
the clinical and experimental findings of ver- 


1 This report is based in part on a thesis sub- 
mitted by the junior author in partial fulfillment of 
the degree of Bachelor of Arts with distinction at 
Duke University, under the direction of the senior 
author. The authors are grateful to Joseph B. Parker, 
Chief, Psychiatric Service, Durham VA Hospital; 
and Bernard Stotsky and Paul Daston, Chiefs, Psy- 
chological Service, Durham VA Hospital, for their 
aid in carrying out this study. This research was 
supported in part by a grant (B-1459) to the senior 
author from the National Institute of Neurological 
Diseases and Blindness, USPHS; and was carried 
out in conjunction with a larger VA study, The 
Social and Emotional Adjustment of Epileptics, un- 
der the direction of Joseph B. Parker, William Wil- 
son, and Lever Stewart. 

2 Parsons is now at the Department of Psychiatry, 
Neurology, and Behavioral Sciences, University of 
Oklahoma Medical Center. 


bal impairment in this group. appear con- 
sistent. 

The above results not only bear on the 
question of specificity and localization of 
brain functions but also seem to hold promise 
for the clinician who is faced with many 
perplexing diagnostic problems in connection 
with psychomotor epilepsy, e.g., differentiation 
from anxiety states, schizophrenia, psycho- 
pathic behavior, etc. However, before warrant 
may be given for generalization, the possible 
restrictions and conditions should be explored. 
One such restriction has already been raised 
by Meyer and Jones (1957), who. found that 
only left temporal lobe epileptics manifested 
the verbal deficits; right temporal lobe sub- 
jects (Ss) did not. On the other hand, Quad- 
fasel and Pruyser report the same Verbal- 
Performance discrepancy in both right and 
left temporal lobe epileptics. Examination of 
previous studies suggested that variables such 
as unilaterality vs. bilaterality of lesion, 
“pure” psychomotor vs. psychomotor mixed 
with generalized seizure patterns have been 
confounded. One purpose of the present study, 
then, is to examine those cognitive abilities 
represented by the Verbal and Performance 
Scale scores of the Wechsler Adult Intelli- 
gence Scale (WAIS) in a population charac- 
terized by strictly nilateral temporal lobe 
EEG foci and pure psychomotor seizure pat- 
terns. 

A second aspect of the study is the ex- 
ploration of possible differences in pattern of 
cognitive abilities as represented by subtest 
scoring on the Wechsler among psychomotor 
epileptics, generalized seizure (grand 4 
epileptics, and control Ss. That such a P?™ 
tern may exist for Ss with temporal lobe 
dysfunction is suggested by Milner’s (1954) 


report of disturbances in visual comprehe™” 
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sion, as measured by Picture Arrangement 
type tests; in Reitan’s (1957) proposition 
that the Similarities subtest of the Wechsler 
1S particularly affected by temporal lobe dam- 
age and in Penfield’s stimulation studies 
(Penfield & Jasper, 1954) which have led 
im to postulate that the temporal lobes have 
an important function in memory, compari- 
son, and generalization. The specific pattern, 
then, may take the form of lower Similarities 
(generalization and comparison function), In- 
formation and Digit Span (remote and im- 
mediate memory), and Picture Arrangement 
(visual comprehension) for the psychomotor 
group in comparison to control (nonCNS 
damaged) Ss. 

Again, however, relevant variables must be 
considered. Degree of personality disturb- 
ance, a variable often ignored in studies of 
the brain damaged, seems pertinent. It is 
Tecognized that psychomotor epileptics mani- 
fest more personality disorders than other 
epileptic groups (Gibbs & Gibbs, 1952) and 
that personality disturbances may affect cog- 
nitive functions (Wechsler, 1958). It would 


| appear that a nonepileptic, nonCNS damaged 


control group, which was equated for degree of 
personality disturbance, would give the clearest 
test for patterns on the Wechsler associated 
with temporal lobe epilepsy. A generalized 
seizure group who had foci in the same area 
as the psychomotor group would provide a 
test for the uniqueness of any pattern found 
in the latter group. However, it might be 
more reasonable to expect that groups with 
similar localization of lesions (as defined by 
EEG foci) would manifest a similar pattern 
of cognitive difficulties. In summary, the first 
two predictions to be tested are: , 3 

1. Psychomotor and generalized seizure epi- 
leptics with EEG foci in the same general 
area (unilateral) have lower Verbal than 
Performance Scale scores on the WAIS. 

2. These epileptic patients also have a pat- 
tern of cognitive functioning characterized by 
lower Information, Similarities, Digit Span, 
and Picture Arrangement subtest scores, in 
comparison to a group of nonepileptic con- 
trols equated for degree of personality dis- 
turbance. 

A third purpose of this research is the in- 
Vestigation of the effect of degree of abnor- 
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mal EEG activity on the WAIS performance 
of the epileptic groups. Recent findings by 
Hovey and Kooi (1955) suggest that fluctua- 
tions in the level of responding (with conse- 
quent lowering of quality of response) on the 
Wechsler are related to the presence of sub- 
clinical paroxysmal EEG activity, the latter 
occurring in the absence of overt or clinical 
manifestations of seizures. It is anticipated 
that the relationships stated above will be 
more clearly established in those epileptics 
having a greater degree of abnormal EEG ac- 
tivity. 
METHOD 
Subjects 


The Ss in this study were 46 male, right-handed 
patients of the Durham VA Hospital. Patients were 
placed into three groups, Psychomotor, Generalized, 
and Control, on the basis of criteria stated below. 
These three groups were equated for age, race, and 
education and were not significantly different on 
Full Scale WAIS IQs (Table 1). 

Fifteen Ss were placed in the Psychomotor group 
on the basis of three criteria: (a) the presence in 
two or more electroencephalograms of interictal epi- 
leptiform discharge localized in only one of the tem- 
poral lobes (5 Ss with right temporal foci and 10 
with left foci comprised this group); (b) the pres- 
ence of seizures which were clearly epileptic and only 
psychomotor in nature, as determined by clinical ob- 
servation and/or histories elicited from the patient 
and relatives or acquaintances familiar with the pa- 
tient’s seizures; * and (c) an age of onset of epileptic 
symptoms of 20 years or older. 

Sixteen Ss were selected for the Generalized group 
on the basis of type of seizure (generalized, tonic- 
clonic seizures) and the presence of focal epileptiform 
activity in two or more EEGs. Seven of these pa- 
tients had psychomotor seizures in addition to their 
grand mal symptoms. The foci of this group was 
mixed, although predominately temporal lobe (14 
out of 16 patients): 5 Ss had left temporal foci, 2 
right temporal, 3 bitemporal, 2 left frontotemporal, 
1 left parietal-temporal, 1 right temporal-occipital, 
1 left frontocentral, and 1 right frontal pole. Onset 
of seizure symptoms also was after 20 years of age. 
The differences between the Psychomotor group and 
the Generalized group on mean age of onset and on 
duration of disorder are not significant (Table IA 

All epileptic patients were classified as to degree 
of abnormal EEG activity. “Abnormal” was defined 
as the presence of any of the following: spikes, sharp 
waves, sharp and slow waves complexes, multiple 


The authors wish to thank William P, Wilson 
and Lever F. Stewart, certified electroencephalog- 
raphers, who furnished the diagnostic information 
reported in this study. The EEG criteria were based 
on Penfield and Jasper (1954). 
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TABLE 1 
DESCRIPTION OF THE CONTROL, PSYCHOMOTOR, AND GENERALIZED GROUPS 
, Age of Onset Duration 
Age Education Full Scale IQ Symptoms of Disorder 
Control 
(N = 15) 
me 31.5 9.2 94.1 = = 
SD 6.3 3.6 13.4 = a 
Psychomotor 
(N = 15) 
X 32.1 9.3 93.7 25.1 5.1 
SD 10.4 5.9 13.6 10.0 4,2 
Generalized 
(N = 16) 
Sd 32:2 9.0 88.5 29.3 4.6 
SD 6.5 3.4 11.3 7.8 3.0 


Note.—None of the differences among groups is significant, as 


spike bursts, and spike and wave discharges. Records 
were rated on a three-point scale ranging from 
minimal activity (occasional or rare occurrence) 
through moderate to marked (abnormal activity oc- 
curring almost all the time) by a board elecetro- 
encephalographer. In all other respects the neuro- 
logical examination of these patients was negative. 

The Control group consisted of 15 patients who 
had no evidence of CNS pathology and who were 
referred to the Psychological Service for psychodiag- 
nostic testing. These Ss were equated with the Psy- 
chomotor group and Generalized groups on pertinent 
variables and for the degree and pattern of person- 
ality disturbance as manifested on the Minnesota 
Multiphasic Personality Inventory (MMPI). The 
MMPI T scores of the three groups, Control, Psy- 
chomotor, and Generalized, do not differ significantly 
(Table 2). The collective profile is marked by an 
elevated neurotic triad and psychasthenia and schizo- 
phrenia subscales. 


Procedure 


The Ss in the two epileptic groups were tested by 
the junior author on the WAIS, as part of a larger 
battery of tests. The WAIS was administered ac- 
cording to standard procedure (Wechsler, 1955). 


TABLE 2 
Mran MMPI T SCORES OF THE CONTROL, PSYCHOMOTOR, AND GENERALIZED GROUPS 


measured by ¢ tests. 


Since another block design test was used in the 
larger battery, the Block Design subtest was omitted; 
therefore, the Performance Scale and Full Scale IQ: 
reported for these Ss were prorated.* The MMP 
(Group Form) was administered to these Ss as par 
of the routine intake procedure of the Psychological 
Service. 

Control Group Ss were administered the WAIS) 
and the MMPI by various examiners from the psy- 
chological service. 


RESULTS 


The means and standard deviations for the 
Performance Scale IQs, Verbal Scale IQs, and 
Full Scale IQs are presented in Table 3. A 
Lindquist Type I analysis of variance of thé 
Performance Scale and Verbal Scale IQs re- 


4 As members of the general class of brain injured: 
epileptic Ss would be more likely to score lower o” 
the Block Design than nonepileptic groups and thus 
reduce discrepancies between the expected lowet 
Verbal than Performance scores. Thus, omission © 
the B-D should favor the predicted higher Perform 
ance Scale. 


L oF E He D Ry Pa wt Pa Pp Se Mo A 
Control 56 60 5 7 T 5 O 5 5 6o 7 60: a 
Psychomotor 53 61 5 720 75 -B n SL 6 n B 6 a 
Generalized 50 55 51 75 72 69 64 55 56 64 65 63 4 


Note.—None of the differences among groups is significant as measured by U tests. 
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TABLE 3 


MEAN PERFORMANCE IQ, VERBAL IQ, 


AND MEAN “PERFORMANCE Minus VERBAL SCORE” 


FOR CONTROL, PSYCHOMOTOR, AND GENERALIZED GROUPS 


Performance IQ Verbal IQ Performance-Verbal Score 

Control 

£ 94.2 93.7 +5 

SD 14.6 12.0 69 
Psychomotor 

X 91.5 96.0 As 

SD 13.8 14.3 10.6 
Generalized 

na 86.3 91.1 -48 

SD 11.7 124 11.0 

Analysis of Variance 
af MS F 

Between groups 2 ue 93 ns 
Error (B) 43 aa i = 
Between trials 1 . os 05 
TXG 2 66.3 : ns 
Error (W) 43 46.9 


veals a significant trials effect. It is evident 
from inspection of the means that this dif- 
ference lies in the two epileptic groups, 
both of whom have approximately the same 
discrepancy, i.e., a lower Performance than 
Verbal IQ: which is opposite to the first pre- 
diction. Comparison of the left temporal and 
right temporal lesion cases in both epileptic 
groups indicates that the right temporal group 
(N = 10) have approximately the same Per- 
formance and Verbal IQs while the left tem- 
poral group (N = 16) manifests the discrep- 
ancy. 

To test the second prediction, rel 
score was obtained by subtracting the X of 


a difference 


the scaled scores on Information, Similarities, 
Digit Span, and Picture Arrangement from 
the X of the scaled scores for the remaining 
six subtests for each S. A positive score for the 
epileptic groups means a difference in the pre- 
dicted direction. In the last column of Table 4 
the difference score is presented for each 
group. The Generalized group differs signifi- 
cantly (¢ = 2.17, p < .05) from the Control 
group in the predicted direction, i.e., having 
a lower X Information, Similarities, Digit 
Span, and Picture Arrangement score; the 
Psychomotor group does not. Examining the 
pattern of subtest scoring reveals that the 
weight of the difference between groups is 


TABLE 4 


Mran WAIS SUBTEST SCORING OF THE CONTROL, PSYCHOMOTOR, AND GENERALIZED GROUPS 


Picture Picture 


Infor- Compre- Arith- Similar- Digit Vocab- Digit, Com- Arrange- Object koei 
mation hension metic ities Span ulary Symbol pletion ment Assembly Score 
Gol 9.5 10.3 9.3 9.8 9.2 8.5 6.8 10.0 8.9 = 
Eon Tomotor Ot 10.5 10.1 9.1 8.7 8.2 6.5 9.7 7.6 30 73 
Roronoa 79 9.8 9.7 7.24% 8.6 8.3 6.0 9.4 7.0% 7.2 +7 


* Generalized group significantly different from Controls, p < .05. 
ae Generalized group significantly different from Controls, p < .01. 


+ Generalized 


group different from Controls at .10 <p> .05. 
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TABLE 5 


MEAN AGE, EDUCATION, PERFORMANCE, AND VERBAL IQs For CONTROL, MINIMAL, 
AND MOopDERATE-SEVERE Activity Groups 


Age Education Performance IQ Verbal IQ 
Controls (V = 15) 31.5 9.2 94.2 93.7 
Minimal (N = 14) 31.8 7.9 92.6 91.3 
Moderate and Marked (N = 17) 36.1 9.9 85.6 95.6 


carried by Similarities (p < .01) and to a 
lesser extent by Picture Arrangement (.10 < 
p > .05). Information and Digit Span are not 
significantly different. 

The effect of degree of abnormal EEG ac- 
tivity was tested by comparing the fourteen 
Ss, from both the Psychomotor and General- 
ized groups, rated as having minimal activity, 
to the 17 Ss falling in the moderate and 
marked categories. In Table 5 the means for 
the control variables and Performance and 
Verbal IQs are noted for the three groups. 
It is evident that the Minimal group tends to 
be lower in education and the Moderate- 
Marked group is somewhat older. The latter 
variable, however, is not likely to produce an 
effect since both Verbal and Performance 
IQs are adjusted for age (Wechsler, 1958). 
In light of a near significant F (.10< p> 
.05) for differences between the two abnor- 
mal activity groups on education and the find- 
ing of a correlation (7) of +.59 between edu- 
cation and Performance IQ, an analysis of 
covariance was performed (Edwards, 1950). 
The result of this analysis for the Perform- 
ance IQ indicates a highly significant differ- 


TABLE 6 
ANALYSIS OF COVARIANCE OF PERFORMANCE 
AND VERBAL IQs IN ABNORMAL EEG 
Activity Groups 


Performance IQ df MS F 

X 29 

Within 28 54.11 

Adjusted Mean 1 1381.00 25.52** 
Verbal IQ 

£ 29 

Within 28 94.76 

Adjusted Mean 1 44.9 474 
** p < 001. 


ence between groups, the Marked-Moderate 
group having a much lower Performance IQ 
than would be expected for their educational 
level in comparison to the Minimal group. 

Analysis of covariance for the Verbal IQ, on 
the other hand, revealed no differences. 


DISCUSSION 


The finding that the epileptic groups have 
lower Performance than Verbal IQs contrary 
to the first prediction, raises a number of ques- 
tions concerning generalization from previous 
findings. Lateralization of lesion could not ex- 
plain these results since the discrepancy was 
more pronounced in the left temporal focus 
group as opposed to the right sided focus pa- 
tients. Equating the Control group to the epi- 
leptic groups on degree and pattern of per- 
sonality disturbance, while admittedly only 
an approximation, could have contributed to 
between group differences but not to the ob- 
tained within group differences in the epi- 
leptics. In seeking to illuminate the possible 
cause for the discrepancy between our results 
and others, three variables appear important. 

First, the cortical functioning of our popu- 
lation of temporal lobe epileptics was prob- 
ably less severely disturbed than other groups. 
In contrast to the present group, the Meyer 
and Jones (1957) epileptics were candidates 
for temporal lobe surgery. Quadfasel and 
Pruyser’s (1955) patients had a longer dura- 
tion of illness and earlier age of onset. While 
this might explain no difference between 
groups, it could hardly account for the sig- 
nificantly lower Performance Scale IQs. 

Second, our population has a lower Full 
Scale IQ than the previous studies. To esti- 
mate the possible effect of this variable, the 
published data of Meyer and Jones (1957) 
were analyzed (preoperative scores). By tak- 
ing the Full Scale IQ as the best overall meas- 


ect” animes —— 
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ure of the general level of intelligence, the 
left-sided lesion group of patients were di- 
vided at the median into High and Low IQ 
Stoups. The High IQ group (N = 10) aver- 
ages 9.8 points lower on the Verbal Scale 
relative to Performance IQs while the Low 
TQ group (N = 10) had a .2 difference. This 
difference between groups is significant at the 
05 level by U test. Since the patients used by 
Quadfasel and Pruyser (1955) had an even 
higher average IQ than Ss used by Meyer 
and Jones and the present study, it may be 
the verbal deficit reported in preoperated 
temporal lobe epileptics holds only in high IQ 
groups. The complexity of this issue is em- 
Phasized by the recent report of the preop- 
erative Wechsler-Bellevue Scores for left and 
tight sided temporal lobe lesion groups by 
Milner (1958). In this study, using compara- 
tively high IQ groups, the preoperative Verbal 
and Performance IQs did not differ for the 
left-sided group, but the right-sided group had 
Significantly lower Performance IQs! f 

The third variable is the difference in test 
instrument. The previous studies used the 
Wechsler-Bellevue, while the present study 
employed the WAIS. Studies cited by Wech- 
sler (1958, pp. 114-117) indicate the possi- 
bility of Verbal-Performance IQ discrepancies 
With the WAIS in comparison to the Wechsler- 
Bellevue. The one study reporting both tels 
on the same 50 Ss, drawn from a university 
counseling center, indicates that WAIS Per- 
ormance IQs averaged 4.52 points lower than 
the Verbal Scale IQs. However, no difference 
existed between the WB Verbal and Perform- 
ance IQs. A second study found Performance 
IQs significantly lower (5.69 IQ points) than 
Verbal IQs in 154 psychiatric patients, but 
again found no difference between the WB 


Verbal and Performance IQs in 392 compa- . 


rable patients. Similar Verbal-Performance IQ 
discrepancies are reported for a sample of 130 
normal, aged Ss on the WAIS (Eisdorfer, 
Busse, & Cohen, 1959). 

The average Performance minus Verbal IQ 
difference in the epileptic groups in this pres- 
ent study is —4.65, almost the same as that 
noted above. To check the possibility that 
our epileptic Ss may not differ significantly 
from the general population of the hospital, 
the WAIS records of 100 patients under 60 
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TABLE 7 


MEAN AGE, EDUCATION, PERFORMANCE, AND VERBAL 
IQs or 100 Mate Hosrirauizep SUBJECTS WITH- 
out Dracnosis OF ORGANIC BRAIN DAMAGE 


R Perform- 
Educa- Perform- Verbal ance- 
Age tion anceIQ IQ Verbal 
x 33.98 9.01 86.69 90.19 —3.50* 


* Significant at the .01 level. 


years of age who had no diagnosis of brain 
damage, and who had not been used in pres- 
ent study were selected at random from the 
Durham VA Hospital files. As indicated in 
Table 7, this group is comparable in age and 
education to the groups in the present study. 
The results are clear: the Performance IQ is 
3.50 points below the Verbal IQ, a significant 
difference (p < .01). The Performance-Verbal 
discrepancies of the two groups of epileptics 
do not differ significantly from this larger 
group. 

From these results it is concluded that our 
epileptic groups do not manifest the previ- 
ously reported preoperative Performance-Ver- 
bal IQ discrepancies. Further, should other 
investigators using different populations sub- 
stantiate the lower Performance IQ relative 
to Verbal IQ reported in the four studies 
cited above, the implication for the clinician 
is obvious: some readjustment in the concept 
of the magnitude of the discrepancy neces- 
sary before differences in those abilities rep- 
resented by these scales can be inferred. 

Confirmation of Prediction 2 in the Gener- 
alized group vs. Controls suggests that pat- 
tern of subtest performance may reflect the 
effects of cortical dysfunction of an epileptic 
nature in patients who may not be manifest- 
ing behavioral signs of epilepsy at the time of 
intelligence testing. It is of interest to note 
that the two subtests which discriminated 
best, Similarities and Picture Arrangement, 
have been identified by previous investiga- 
tors as susceptible to temporal lobe malfunc- 
tion (Milner, 1954; Reitan, 1957), 

Further, the effect of degree of abnormal 
activity is noteworthy (Tables 5 and 6). The 
Moderate-Marked Activity group had a much 
lower Performance IQ than Verbal while the 
Minimal Activity group had practically identi- 
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cal Verbal and Performance IQs. The pres- 
ence of subclinical neurophysiological dis- 
turbances may have a more telling effect on 
the timed tests constituting the Performance 
Scale. Indeed, all the Performance subtests 
were lower for the Moderate-Marked group 
than the Minimal Activity group despite the 
former’s educational advantage. Such results 
are in accord with the impaired attentiveness 
and deficits in sustained serial activity re- 
ported in similar groups by Hovey and Kooi 
(1955); Morrell (1956); and Rosvold, Mir- 
sky, Sarason, Bransome, and Beck (1956). 


SUMMARY 


Patterns of cognitive functioning on the 
WAIS were studied in three groups of Ss. 
Fifteen psychomotor (temporal lobe) epilep- 
tics with unequivocal unilateral EEG foci; 16 
generalized seizure patients with temporal 
lobe foci; and 15 nonCNS damaged Con- 
trols. Ss were matched on relevant variables 
and were similar in degree and pattern of 
personality disturbance (MMPI). Contrary 
to previous reports, the Psychomotor group 
does not manifest a deficit in Verbal IQ rela- 
tive to Performance IQ. The Generalized 
group is significantly lower, as predicted, on 
a combined Information, Similarities, Digit 
Span, and Picture Arrangement Score. Sig- 
nificantly lower Performance IQs are found 
in a moderate and very active abnormal EEG 
activity group compared to a minimal ac- 
tivity group, regardless of seizure pattern. 

From data based on 100 hospitalized Ss in 
this study and studies cited by Wechsler, it 
is suggested that, unlike the Wechsler-Belle- 
vue, the Performance IQ on the WAIS tends 
to be three to five points -lower than the 
Verbal IQ. 
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ON THE BREAKDOWN OF THE SENSE OF REALITY: 
A COMMENT 


PHILIP A. GOLDBERG 
University of Buffalo 


Hozier (1959) has attempted to answer a 
difficult question. Accepting the breakdown of 
reality as the central fact of schizophrenia, 
she hypothesizes that, “the loss of the sense 
of reality in the schizophrenic individual in- 
volves a breakdown in the bodily self as a 
consequence of insufficient cathexis” (p. 186). 
As a test of this hypothesis Hozier investi- 
gated the spatial perception of schizophrenics 
by means of a Figure Placement task, a Doll 
task, and a Draw-a-Person task. Her results, 
which she accepts as confirming her hypothe- 
sis, show that schizophrenics make signifi- 
cantly more errors on these tasks than nor- 
mal controls. 

Unfortunately for the clarity of her results 
there are methodological and theoretical diffi- 
culties which she has either overlooked or too 
cavalierly dismissed. There has been much 
work reported in the literature to indicate 
that the generally poor performance of schizo- 
Phrenics can, in large part, be attributed to 
motivational variables. Cavanaugh (1958) 
found this to be the case with concept for- 
ħation tasks, and Lang (1959) i obtained 
similar results investigating reaction time. 
Lang (1959) believes that “Shakow’s concept 
[1950] of an inability to maintain set appears 
to fit the data? (p. 267). Hozier men- 
tions Shakow’s concept but dismisses it on 
the grounds that her experimental group con- 
sisted of patients judged to be cooperative by 
their ward psychiatrist. 3 

One has no way of knowing or evaluating 
the adequacy of the ward psychiatrists’ rat- 
ings of cooperativeness. Hozier does not tell 
us the procedures used to obtain these rat- 
ings, nor does she specify the variables that 
entered into the determination. One might 
speculate that the demands of custodial care 
might lead ward psychiatrists to equate trac- 
tableness with cooperativeness. Without pur- 
suing this speculation, it would seem fair to 


say that when discussing schizophrenic per- 
formance, problems of motivation and set are 
too important for the reader to be confident 
that Hozier’s solution is adequate. 

The literature demonstrating schizophrenic 
deficit is too voluminous to need citation here. 
Hozier did not merely attempt to indicate an- 
other area of deficit; her attempt was to 
demonstrate the relationship between reality 
loss and “the spatial problem of dealing with 
the body ... and the relationship of the 
body to the world” (p. 194). But how would 
her schizophrenics perform in other task situa- 
tions that do not specifically involve such 
spatial perception of the body? It is pos- 
sible, perhaps likely, that they would show 
deficit. But whatever the case, Hozier’s study 
does not have this control and the reader 
must question the correctness of her conclu- 
sion, which relates the schizophrenics’ im- 
paired performance to the specific problem 
of the spatial perception of the bodily self. 
Hozier has arbitrarily explored a single facet 
of a multifaceted problem to emphasize bodily 
self as the primary model of reality. A more 
parsimonious explanation of her data might 
be formulated in terms of attitudinal and at- 
tentional variables presumed to operate in a 
wide range of task situations. 

Hozier does raise tangentially this ques- 
tion of confounding when she states, “what 
part other variables may play in accounting 
for the significant results cannot be deter- 
mined by the present study” (p. 193). But 
here it seems her primary concern is with the 
adequacy of her task situations. She dis- 
misses the question even before she raises it 
when she says, “certainly, the tasks are re- 
lated to spatial relations and organization” 
(p. 193). Accepting on faith the adequacy 
of her tasks, Hozier’s recognition that other 
variables might play a role in her results 
emphasizes the point made above. It is not 
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necessary to question that some relationship 
exists between her tasks and spatial organi- 
zation; it is necessary to question the con- 
trol of the “other variables” that may also 
be related to the tasks and which might have 
made Hozier’s results significant. “Other vari- 
ables” always exist potentially, and no one 
has the right to demand of an experimenter 
that he exhaustively anticipate these vari- 
ables. However, when the accumulated re- 
search of other workers implicates particular 
classes of variables as being significantly re- 
lated to a phenomenon, it is the responsibil- 
ity of the experimenter investigating this phe- 
nomenon to control and account for these 
variables. 

There are other questions that might be 
raised about this study. Hozier states that pa- 
tients who were receiving insulin coma or elec- 
tric shock treatment were excluded from her 
experimental group. Presumably this was done 
to insure that her subjects’ (Ss’) performance 
would not be hampered by extraneous fac- 
tors. But it is widely recognized that one of 
the major obstacles in doing research with 
hospital patients is the difficulty in getting 
patients who are not “on drugs.” Hozier does 
not report whether the schizophrenics in her 
experimental group were receiving drugs. If 
they were, we can only speculate as to the 
probable effect on performance. 

Another question might be raised about 
Hozier’s use of one-tailed tests. Goldfried 
(1959) would question her criterion for the 
selection of a one-tailed test, carrying with 
it the possible increase in the level of signifi- 
cance. Nevertheless, this issue is controversial 
and though this writer would agree with Gold- 
fried, it is perhaps not a crucial criticism. 
However, other criticisms already made do 
seem to be crucial. 

But it would be unfair and scientifically un- 
profitable to ignore the value inherent in 
Hozier’s work because of what seem to be 
certain significant weaknesses. Those features 
believed to be methodologically weak are not 
intrinsic to the theoretical problem at issue. 
Had the Ss been asked to draw three dimen- 
sional geometric figures in addition to draw- 
ing a person, the study might have been 
tightened considerably. Such a task would 

presumably not involve “the spatial problem 
of dealing with the body,” but it would ap- 
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pear to tap essentially the same motoric and 
spatial skills. Thus, it might provide one 
needed control of set variables. Certainly this 
is just one possible way, and undoubtedly not 
the best, to make Hozier’s study a more 
rigorous investigation. 

Hozier has proceded with an interesting 
theoretical approach to attack imaginatively 
a difficult problem. This is not meant to be 
the traditional sop with which critics con- 
clude their papers. Hozier’s study is worth 
criticizing because of its fundamental value. 
Should a more tightly designed study along 
essentially the same lines as Hozier’s work 
obtain similar findings, its implication for 
ego psychology and for an understanding of 
schizophrenic processes would be of undeni- 
able worth. Additionally, such a study might 
help to resolve the controversy over the na- 
ture of schizophrenic deficit. It has been sug- 
gested that schizophrenic deficit represents a 
withdrawal response to threatening stimuli 
(Rodnick & Garmezy, 1957). Cavanaugh 
(1958), Lang (1959), Hunt and Cofer (1944), 
and others have stressed the motivational 
variables involved in schizophrenic deficit. In 
short, we have yet to understand why the 
schizophrenic behaves as he does. Hozier’s 
work expands a new approach to an old 
question. We need new approaches. 
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A REPLY TO GOLDBERG 


ANN HOZIER 


Veterans Administration Mental Hygiene Clinic, Los Angeles 


Goldberg, after a thoughtful and careful 
review of my article, raises some cogent criti- 
cisms. His main and perhaps his most crucial 
criticism is of my failure to control for moti- 
vational variables. In support of this criticism, 
he cites experimental work which concludes 
that the poor performance of the schizo- 
Phrenic can be largely accounted for by 
Motivational variables. Attractive as this ex- 
planation may be, I see it as being tautologi- 
cal, in that by definition we characterize 
schizophrenics as being motivationally im- 
Paired. They are apathetic, withdrawn, dis- 
interested, distractible, lacking in goal direc- 
tion, etc. If we attempt to account for their 
behavior in terms of motivational variables, 
We are only attempting to affirm our defini- 
tion. Such an interpretation is qualitatively 
similar to saying that a mental defective’s 
lack of intelligence hampers his performance. 
Perhaps the kind of question we should ask 
is what underlies the impairment of the abil- 
ity to maintain set and attend? Do motiva- 
tional variables constitute the necessary and 
sufficient conditions for developing a stable 
Sense of reality? Our hypothesis is that the 
ability to attend, to direct one’s energies to 
a task, to make adequate use of stimuli, to 
think abstractly, etc., is dependent upon an 
adequate differentiation of body/world or, in 
our terms, on a stable, well-cathected bodily 
self. As long as this differentiation is not 
clearly delineated, then true subject-object 
relations cannot exist on a stable consistent 
level. Motivated behavior, in its true meaning, 
is predicated on the capacity of the subject 
(S) to relate, to direct himself to an object. 
We assume that the schizophrenic is incapable 
of sustained motivated behavior. That is, he 
does not experience himself as an S in rela- 
tionship to an object with any consistency as 
a consequence of insufficient cathexis of his 


bodily boundaries. He does have affective 
cathexis available to him, but it is attached 
to various and sundry, isolated things. 

I assumed that the schizophrenics would 
have no vested interest in the tasks—that 
they would not be motivated because of the 
very nature of their illness. The word co- 
operative, rather than motivated, was used 
to indicate that the interest of the schizo- 
phrenic was solicited rather than spontane- 
ously given. I further assumed that coopera- 
tion was not a global trait. We all have 
observed that the schizophrenic, like most 
people, will be cooperative with one person 
and not with another or will do one task and 
not another at request. The psychiatrists were 
asked to select patients who met the selection 
criteria (Hozier, 1959, p. 186) and whom 
they judged would be cooperative with the 
investigator in doing the particular tasks. The 
psychiatrists were aware of the nature of the 
tasks, but were not aware of the predictions. 
They selected 36 patients, some of whom re- 
quired maximum supervision because of their 
destructive and assaultive behavior. Other pa- 
tients resided on an open or semiopen ward. 
Some of these patients were either already 
working in the community or in the process 
of obtaining positions outside of the hospital, 
The records of the 36 patients were reviewed, 
and it was found that 3 of them did not meet 
one of the selection criteria. The patients were 
seen in the order of their appearance on the 
list. Of the 26 Ss contacted, the cooperation 
of 1 S could not be obtained. 

Are clinical evaluations of cooperativeness 
adequate? Did the investigator fail to see 
evidences of uncooperativeness due to her 
own involvement in the research? Unfortu- 
nately, an unequivocal answer to these ques- 
tions cannot be given. It is the investigator’s 
impression that the schizophrenics’ failures or 
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inadequacies in dealing with the tasks were 
not largely a consequence of their unwilling- 
ness to attempt to deal with them. 

Goldberg raises an important question when 

he asks how would schizophrenics perform on 
tasks not specifically involved in the spatial 
perception of the body. Theoretically, as long 
as the body is not sufficiently cathected to 
become bounded and differentiated from 
everything that is not the body, there exists 
no frame of reference from which to judge 
the reality of events in the external world or 
the reality of one’s own psychological experi- 
ences. The conventional dimensions of space 
are body dimensions. What is called up or 
down, to the left or right, or to the front 
or behind does not depend on the position of 
the object, but on the position of the body 
in relationship to the object. Thus, we would 
hypothesize that if the boundaries of the body 
are not sufficiently cathected, then there will 
be a disturbance in spatial perception of non- 
human as well as human objects. We have 
seen this, for example, in the schizophrenic’s 
perception of the vertical (Carini, 1955), in 
his handling of the Bender-Gestalt (Bender, 
1938), in his dealing with the Body Space 
Test (Zierer, 1950), etc. 

I would expect if he were given, for ex- 
ample, a formal test of spatial perception or 
geometry problems in which he can depend 
on his knowledge of rules, propositions, and 
theorems of handling space that he would 
have significantly less difficulty than if the 
task required that he rely on his experiencing 
his relationship to and in space. Perhaps it 
will make it clearer to illustrate the distinc- 
tion between reality testing and a sense of 
reality. Depersonalization, estrangement, and 
hypochrondrasis, often early signs of schizo- 
phrenia, exemplify this difference. The person 
may experience himself as changed, experi- 
ence his world as changed, or experience him- 
self as physically sick, yet at the same time, 
know that he is the same person, that the 
world hasn’t changed, and that his physical 
health is unimpaired. Both reality testing and 
a sense of reality are necessary for dealing 
with the world, but in general, reality testing 
refers to testing something via the intellectual 
processes, and a sense of reality, of experi- 
encing something affectively. The more the 
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task can be adequately dealt with by a pure 
knowledgeable approach, the less, it is hy- 
pothesized, will the schizophrenic fail. Had I 
used the type of control mentioned above, it 
undoubtedly would have tightened the re- 
search considerably. 

Goldberg is correct that I was concerned 
with the adequacy of the tasks. When he says; 
“Tt is not necessary to question that some 
relationship exists between her tasks and spa- 
tial organization; it is necessary to question 
the control of the ‘other variables’ that may 
also be related to the tasks and which might 
have made Hozier’s results significant,” I be- 
lieve his concern here is again with the moti- 
vational variables. I have already attempted 
to indicate why I am not concerned about 
these variables in the same way as he. 

Recent ECT and Insulin Coma Therapy 
were controlled not because they were ex- 
traneous factors per se, but because of the 
confusion which so often accompanies their 
use. Confusion has not been noted as a con- 
sequent of tranquilizers unless used in toxic 
dosages. In those cases, the patients would 
have had neurological signs and on that basis 
have been excluded. The confusion the pa- 
tients had was part and parcel of their illness. 
Some of the patients were on drugs and some 
not; however, the differences in their perform- 
ances were not analyzed. I would predict that 
of the patients having an equal degree of 
disturbance those on tranquilizers would per- 
form more adequately than those not on 
them. 

As Goldberg indicates, there has been con- 
siderable controversy about the use of the 
one-tailed test. As to the criteria for its use, 
I continue to agree with Jones (1954, p. 586). 

Goldberg has raised some significant and 
crucial questions about my research. Although 
I disagree with him on his major point of the 
control of motivational variables, I would 
agree that some question may be raised about 
the adequacy of clinical judgments of co- 
operativeness. However, as I have thought of 
this criticism, I must admit that I find it 
difficult to conceive of an adequate way t° 
control this variable which at some level oF 
other does not lean heavily on clinical judg- 
ment. His point about the use of a contro 
task is well made. I would expect the use ° 
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drugs to exert its effect in the direction of 
nullifying the hypotheses. Thus, I doubt that 
the lack of control of drugs contributes to 
the significant results. 

There is considerable need for further re- 
search not only to cross-validate my findings 
with some of the controls mentioned above, 
but to investigate other hypotheses derived 
from the theory. It is impossible for me to 
state the importance of the bodily self in a 
clearer way than Scott (1951), when he says, 


The body image is not just an object of study in 
the sense that science has made so many things 
objects of study. It is essentially something that one 
lives through and in. To-day when there is so much 
disembodied knowledge around, and so many psy- 
chological forces loose in the world belonging to 
no one, and when we are in very real danger of 
being devoured by our own creations, it seems sig- 
nificant that the human image in the midst of all 
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of these should be brought to the forefront, for it 
is in the last resort the only mediator of them 
(p. 266). 
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SEX-ROLE PREFERENCES IN THREE- AND 
FOUR-YEAR-OLD CHILDREN 


WILLARD W. HARTUP anp ELSIE A. ZOOK 
Iowa Child Welfare Research Station 


The rubric of sex-role differentiation has 
been used to cover both the processes leading 
to sex differences in overt behavior (e.g., boys 
are more physically aggressive than girls) 
and the introjection of values and attitudes 
that are more appropriate in a given culture 
for one sex than the other. A recent attempt 
at clarifying the concept of sex-role differen- 
tiation has been made by Brown (1956, 1957) 
with the assistance of Robert Sears, and Lynn 
(1959). These authors have distinguished 
among sex-role identification, sex-role prefer- 
ence, and sex-role adoption. The development 
of masculinity and femininity is thus defined 
in terms of at least three logically separated 
psychological processes. The present study 
was concerned only with sex-role preference 
which has been defined by Brown (1956) as 
“behavior associated with one sex or the other 
that the individual would like to adopt or that 
he perceives as the preferred or more desirable 
behavior.” 

The sex-role preferences of children have 
been measured in two ways. Rabban (1950) 
developed a technique which consisted of ask- 
ing children to state their preferences from a 
group of 16 toys, 8 commonly associated with 
girls and 8 associated with boys. The admin- 
istration of this test to 300 children from 
three through eight years of age showed that: 
(a) boys possess clear-cut preference patterns 
at an earlier age than girls; and (b) strong 
appropriate preference patterns appear earlier 
among lower-class children than among mid- 
dle-class children. 

Brown (1956) measured children’s sex-role 
preferences by means of a projective test 
called the It Scale for Children. In taking this 
test the child chooses between pictures of 
various objects commonly associated with one 


sex or the other (e.g., toys, clothes, household 
objects, games, etc.). The choices are not 
made for the child himself but for “It,” a 
drawing of a sexless figure. Brown has re- 
ported normative data for 146 kindergarten 
children from Denver, Colorado (1956) and 
613 children from kindergarten through fifth 
grade from Pleasanton, California (1957). 
These data have shown that: (a) distinctive 
sex-role preferences existed for boys and for 
girls at all ages studied; (b) kindergarten 
boys were masculine in their preferences but 
older boys were even more masculine in their 
preference scores; (c) kindergarten girls had 
“mixed” preferences and older girls slightly 
masculine preferences; (d) at all age levels 
girls’ preference scores were more variable 
than boys’ scores. 

The present study was designed to extend 
Brown’s work with the It Scale for Children. 
Preferences of the kind measured by the It 
Scale are undoubtedly acquired before the 
child is of kindergarten age. One purpose of 
this investigation was to obtain information 
concerning the sex-role preferences of three- 
and four-year-old children. 

A second purpose of the present study was 
to explore the effects of the instructions em- 
ployed with the It Scale. It has been men- 
tioned previously that the subject’s (S’s) 
choices are made for an ambiguous drawing 
of a child rather than for S himself. The simi- 
larity of the pictured figure to S has been 
shown to be a relevant variable to children’s 
performance on some projective tests (Arm- 
strong, 1954; Bills, 1950; Furuya, 1957) but 
irrelevant to performance on others (Biers- 
dorf & Marcuse, 1953). If “stimulus-to-sub- 
ject similarity” should affect performance On 
the It Scale, it is obvious that any one pro- 
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cedure would not reveal all the significant 


facts concerning the sex-role preferences of 
children, 


Three different sets of verbal instructions 
were employed with the It Scale in the pres- 
ent study. These instructions were designed 
to produce variations in the test situation 
along a dimension from low to high stimulus- 
to-subject similarity. 


METHOD 
Subjects 


The Ss for this investigation were 161 children 
between the ages of 3-0 and 5-0. Ss were obtained 
from four populations: a community-supported day 
care center, two cooperative nursery schools, and 
one university laboratory nursery school. This sam- 
Ple included all children enrolled in the four nursery 
schools except six children who were absent during 
testing, one child who was a new enrollee, and one 
child who refused to leave the school group with 
the experimenter (E). The sample included 78 girls 
and 83 boys. Sixty-five of these Ss constituted the 
three-year-old group. Their ages ranged from 3-0 
to 4-0 with a mean age of 3-7. Ninety-six Ss, 
ranging in age from 4-0 to 5-0 with a mean age 
of 4-6, made up the four-year-old group. 

Ss from each school were assigned to three in- 
Struction groups (Groups A, B, and C). Assignment 
of Ss was random except that the groups were 
Matched as closely as possible for sex and age. 
Subsequent to the collection of the data, 11 of the 
161 Ss were dropped in order to create proportional 
cell entries in an analysis of variance table. Eight 
of these Ss were girls; three were boys. These drop 
outs were selected randomly from the subgroups 
Containing an excessive number of Ss. 


Procedure 


The It Scale for Children was administered indi- 
vidually by the second author to each S in a private 
room at the nursery school. E began each session 
by giving S the card showing the sexless child. The 
verbal instructions were as follows: 

Group A (low similarity to g): l 

We are going to play a game with this child, o 

O.K.? So this game will be about “Tt. Here you 

hold “It.” Now we are going to show this child, 

whose name is “It,” some cards with pictures on 
them.” 

1The authors are grateful to the staffs of the 
following schools for their cooperation: Day Care 
Service, Inc., Des Moines, Iowa; Plymouth Congre- 
gational Church Nursery School, Des Moines, Iowa; 
Parents’ Cooperative Preschool, Iowa City, Iowa; 
Preschool Laboratories of the Iowa Child Welfare 
Research Station, State University of Iowa, Iowa 
City, Iowa. 

2 These instructions were identical to those de- 
veloped by Brown (1956). 


Throughout the administration of the test E referred 
to the drawing as “It.” 


Group B (moderate similarity to S): 


We are going to play a game with this little boy 
(girl if S were a girl). This game will be about 
the little boy. Here, you hold him. Now we are 
going to show this little boy some cards with pic- 
tures on them. 


E continued to use the appropriate noun or pro- 
noun throughout the test. 


Group C (high similarity to S): 


We are going to play a game with this little boy 
whose name is (S’s name). His name is 

and this game will be about . Here, you: 
hold —__—.. Now we are going to show 

some cards with pictures on them. 


E continued to use S’s name during the test. 


The various sections of the It Scale were adminis- 
tered according to Brown’s manual (1956). However, 
the 16 pictured toys from which S chose during the 
first section of the test were presented in groups of 
four rather than in one group of 16 pictures. The 
pictures were grouped in this manner to insure that 
the young Ss would attend to each picture. Each 
grouping included two boys’ toys and two girls’ toys 
arranged in alternating order and selected from the 
group of 16 pictures on the basis of their popularity 
with kindergartners (Brown, 1956). For convenience, 
all groups of cards, both in the section of the test 
just described and in succeeding sections, were at- 
tached to 9” by 12” cards. 


Scoring 


The weighted scoring procedure suggested by 
Brown (1956) was employed. Using this method, 
male choices were given a positive score, the amount 
varying according to the section of the test. Female 
choices were given a zero score. Under this scoring 
system the possible range for total scores was from 
zero (consistently feminine) through 84 (consist- 
ently masculine). 


Reliability 


Brown has reported test-retest correlations of .71 
and .84 for the It Scale administered to kindergarten 
Ss 30 days apart. An estimate of the reliability of 
the scale for Ss of three and four years of age was 
obtained in the present study by the same method. 
Forty Ss were given a second administration of the 
test. These Ss equally represented the age, sex, and 
instruction groups and were selected on a “first 
come” basis from the laboratory nursery school and 
one of the cooperative nursery schools. The mean 
interval between tests was 60.6 days. The product- 
moment correlation between tests was .66 for boys 
(p<.01) and .71 for girls (p< .01). These coeffi- 
cients are fairly high for tests involving preschool 
Ss with an interval between tests of approximately 
two months. 
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When mean scores from the two administrations 
of the It Scale were compared it was found that 
girls moved toward greater femininity on the re- 
test (see Table 1). It is difficult to account for this 
change in terms of inter-S communication, practice 
effects, etc., considering the ages of the Ss and the 
length of the interval between tests. It is possible 
that the test-retest differences reflect new attitudes 
acquired by girls during the two-month interval be- 
tween tests. Interestingly, boys did not move to- 
ward greater masculinity on the retest. This would 
imply that the preschool period may not be so im- 
portant for the male as for the female as a transi- 
tional period in sex-typing. 


RESULTS 


The It Scale scores of 150 Ss were ana- 
lyzed by means of an analysis of variance that 
involved three factors: age, sex, and instruc- 
tional procedure. The frequency distributions 
of the 12 subgroups appeared by inspection 
to approximate normality. The variances of 
these subgroups appeared to be homogeneous 
with two exceptions: (a) the variance for 
four-year-old girls in Group A was approxi- 
mately four times the average variance of the 
other subgroups; (b) the variance for four- 
year-old boys in Group C was approximately 
one-third the average variance of the other 
subgroups (see Table 2). While two vari- 
ances thus appeared to be deviant, the possi- 
bility remained that homogeneity might be 
true of the table as a whole due to the ap- 
parent homogeneity of variance in 10 of the 
cells. Accordingly, Bartlett’s test was applied 
which yielded a chi square of 17.86, df = 11, 
p < .10. Thus the hypothesis of homogeneity 
of variance could be sustained. It was de- 
cided, however, that no findings from the 


TABLE 1 
Mean Ir SCALE Scores ON TEST AND RETEST 


Mean SD t b 
Girls (V = 20) 
Test 34.60 14.32 
2.675 <.02 
Retest 25.85 16.34 
Boys (N = 20) 
Test 60.30 15.11 
gi á s 
Retest 63.33 16.93 
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TABLE 2 


Means, RANGES, AND STANDARD DEVIATIONS OF 
SCORES ON THE It SCALE FOR Boys AND GIRLS IN 
Two AGE Groups AND THREE INSTRUCTION 

Groups 


Instruction 


Group N Mean Range SD 

Girls 
i A 10 37.70 20-49 10.071 
Three-year-olds B 10 31.10 0-50 15.667 
c 10 32.30 15-47 12.475 
f; 15 31.27 0-84 25.058 
Four-year-olds B 15 11.67 0-40 11.948 
Ç 15 15.80 0-50 14.386 

Boys 
A 10 56.50 37-74 11.342 
Three-year-olds B 10 56.00 32-74 14.156 
c 10 62.90 35-76 11.344 
A 15 60,80 30-84 13.668 
Four-year olds B 15 63.33 31-84 17.117 
c 15 71.13 48-84 8.155 


analysis of variance would be accepted as sig- 
nificant unless the probability level reached 
was beyond .01. This decision was based on 
recommendations by Lindquist (1953). 

The results of the analysis of variance may 
be found in Table 3. In this analysis the ef- 
fects of sex were highly significant (F = 
220.762, p < .001). While on the one hand 
this finding merely demonstrates the validity 
of the It Scale, it is also important because 
it demonstrates clear-cut sex-role differentia- 
tion among very young children. 

The age by sex interaction was also signifi- 
cant (F = 16.464, p < .001) and may be in- 
terpreted as meaning that the effects of age 
on It Scale scores depended upon the sex 
of the child. The simple effects of age were 
studied by means of ¢ tests in which the error 
term from the analysis of variance was em- 
ployed. The age difference for girls was sig- 
nificant (¢ = 3.925, p < .001), indicating that 
four-year-old girls were more feminine in their 
sex-role preferences than three-year-old girls. 
The age difference for boys, on the other 
hand, reached only borderline significance (¢ 
= 1.840, p < .10). Table 2 shows that the di- 
rection of this trend was for four-year-old 
boys to display more masculine preferences 
than three-year-old boys. 

The effects of the instructions on It Scale 
scores also depended on the sex of the child. 
The instruction by sex interaction was signifi- 
cant beyond the .005 level. Analysis of the 
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TABLE 3 


ANALYSIS OF VARIANCE OF It SCALE SCORES 


Source df Sum of Squares Mean Square F 
I A 2 1,193.45 596.73 2.537 
nE — 1 506.25 506.25 2.152 
Sex 1 51,931.20 51,931.20 220.762** 

2 
Instructions X Age 2 151.64 75.82 
Instructions X Sex 2 2,813.46 1,406.73 5.980* 
Age X Sex 1 sra ee 16.464 
T i Sex 2 457. 228.61 = 
— DESS BES 138 32,462.63 235.236 
Total 149 93,388.83 
=p SOF 


Simple effects showed: (a) girls in Group A 
showed less feminine preferences than girls in 
both Group B (“little girl”; £ = 3.320, p< 
01) and Group C (S’s name; # = 2.637, p < 
01); (b) mean scores for girls in Groups B 
and C did not differ; (c) boys in Group A did 
hot differ from boys in Group B, but were 
less masculine than boys in Group © (t= 
2.019, p < .05); (d) boys in Group B were 
Slightly less masculine than those in Group C 
(¢= 1.715, p < .10). 

Inspection of the raw data relative to the 
analysis of variance showed confounding of 
both birth order and size of family with the 
variable of age. A greater proportion of three- 
year-old Ss as compared with four-year-olds 
Were first born children (p < -10). Also, four- 
year-old boys had more siblings than three- 
year-olds (p < .02). The same tended to r 
true of girls, although only at the .20 level. 
The results of a series of ¢ tests, applied to 
the age differences with both birth order and 
family size held constant, were consistent 
With ‘the analysis of variance results. Four- 
year-old girls were more feminine than three- 
year-olds (p < .01); four-year-old boys were 
more masculine than three-year-olds, but only 
at a borderline level ($ < 10). 

The analysis just reported yielded one other 
finding of some importance. There were no 
significant differences between It Scale scores 
of first born and later born children. Thus 
birth order, per se, appeared to have no re- 
lation to the early acquisition of sex-role pref- 
erences. i 

Although this study was not designed to 


relate sex-role preferences to socioeconomic 
status, Ss were obtained from two diverse so- 
cial classes. However, 61% of the lower-class 
children (36 Ss from the community-sup- 
ported day care center) had fathers not liv- 
ing at home. When compared with the other 
Ss tested, who were from intact middle-class 
homes, no class differences were found in sex- 
role preference scores for either boys or girls. 
For heuristic purposes, the 14 lower-class Ss 
who came from intact homes were compared 
with the middle-class Ss. Once again, no class 
differences in It Scale scores emerged. 


DISCUSSION 
Sex Differences 


The sex difference found in It Scale scores 
suggests that at least some aspects of sex-role 
differentiation begin very early in life. Be- 
ginning in infancy, parents in the United 
States commonly prescribe different toys, 
modes of dress, etc., for boys and for girls. 
The findings of the present study reflect the 
success of such child training procedures. 

A sex difference in degree of appropriate 
sex-role preferences was also revealed by the 
data. In middle childhood, boys prefer the 
stereotyped masculine role more strongly than 
girls prefer the sterotyped feminine role 
(Brown, 1957). The same appeared to be 
true of the three-year-olds in the present 
study (see Table 2). Among four-year-olds in 
Groups A and C boys also showed slightly 
stronger masculine preferences than girls 
showed feminine ones. Only in Group B were 
four-year-old girls more feminine than boys 
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were masculine. Thus, the majority of these 
data are in line with the findings for older 
children: boys more strongly prefer the stereo- 
typed male role than girls prefer the stereo- 
typed female role. It should be stressed, how- 
ever, that this tendency was considerably less 
marked among the preschool Ss of the pres- 
ent study than among the older Ss of other 
studies. Quite probably the preschooler is only 
beginning to experience: (a) the greater socio- 
cultural advantages and more consistent re- 
wards for sex-typed behavior accorded the 
male in United States culture; (b) the more 
clearly defined sex-role for the male than for 
the female; and (c) more frequent punish- 
ment of boys than girls for opposite-sex be- 
havior. 


Age Differences 

Although only significant at a borderline 
level, four-year-old boys obtained more mas- 
culine scores on the It Scale than three-year- 
old boys. A similar relation between age and 
sex-role preferences has been reported for 
older boys (Brown, 1957). In the present 
study three-year-old boys obtained a mean 
score of 58.47; the 11-year-old boys tested by 
Brown obtained a mean score of 76.73. Thus, 
for boys, there appears to be a steady change 
toward greater masculinity during the 3- 
through 11-year period. This change suggests 
that boys probably receive relatively consist- 
ent reinforcement throughout early and mid- 
dle childhood for adopting certain stereo- 
typed aspects of the male role. This finding 
could also reflect a condition wherein the 
early parental demands for appropriate male 
sex-typing are augmented in later years by 
the demands of other socializing agents (e.g., 
teachers, peers, etc.). 

The present findings showed that four-year- 
old girls made significantly more feminine 
choices than three-year-old girls. By itself, 
this finding is analogous to the age difference 
found for boys. However, Brown (1957) re- 
ported a change toward masculinity in older 
girls: kindergartners obtained a mean score 
of 38.40 while fourth-grade girls reached 
56.40. In the present study the three- and 
four-year-old girls in Group A (whose scores 
are most comparable to Brown’s) obtained 
mean scores of 37.70 and 31.27, respectively. 
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Therefore, preschool-aged girls possess the 
most feminine sex-role preferences of any 
group of girls in the 3- through 10-year pe- 
riod. Actually, the age differences found in 
It Scale scores for girls support an hypothe- 
sis of Lynn’s (1959): girls may be femininely 
oriented in early childhood due to a basic 
identification with their mothers, but later 
exposure to the masculine-oriented culture 
breaks down or covers over this identification 
during the years of middle childhood. It could 
be also that the mother, who is virtually the 
sole feminine model for the preschool girl, is 
a model that is particularly stereotyped (i.e., 
the mother’s femaleness is typified for the 
child by homemaking, child care, etc.), On 
the other hand, the older girl has additional 
feminine models available to her (outside the 
home, on television, in fiction, etc.). These 
models are frequently observed in nonstereo- 
typed situations or in situations where their 
behavior is more typical of men than women. 
Such a broadened conception of the female 
role for the older girl could produce more 
masculine scores on the It Scale in. which 
femininity is defined in terms of highly stereo- 
typed choices. 


Effects of Instructions 


Instructions that employed S’s name in as- 
sociation with the drawing resulted in more 
masculine scores for boys than instructions 
that stressed lesser degrees of stimulus-to- 
subject similarity. It is possible that high 
stimulus-to-subject similarity elicited the most 
valid measure of boys’ sex-role preferences 
since the questioning was related to S him- 
self. On the other hand, the use of S’s name 
could have aroused anxiety concerning L’s 
approval of inappropriate (i.e., female) pref- 
erences. Such anxiety could have elicited an 
inordinate number of safe, stereotyped male 
preferences. Thus the data indicate only that 
the instructions employed with the It Scale 
affect that sex-role preference scores of boys; 
the data do not indicate which instructions 
produce the most valid scores. 

The finding that no difference existed be- 
tween scores for boys in Group A (“It”) and 
Group B (“little boy”) can be interpreted in 
two ways. First, boys in Group A may have 
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Perceived the figure as a boy.* If such were 
the case, the test situation for Group A would 

very similar to the situation for Group B. 

he alternative interpretation is that Ss in 
Group B were indifferent to the label “boy.” 

his is saying in effect that Ss in neither 
group perceived the figure as a boy, a situa- 
tion which seems improbable. Since the find- 
mgs demonstrate that stimulus-to-subject simi- 
larity affects boys’ responses on the It Scale, 
the various post hoc hypotheses concerning 
the dynamics of these effects become impor- 
tant problems for future research. 

The instructions employed with the It 
Scale also influenced the scores of girls. For 
all girls, but particularly for four-year-olds, 
Instructions that referred to the figure as “a 
ittle girl” resulted in more feminine scores 
than the label “It.” The use of S’s name pro- 
duced little additional femininity in girls’ 
Scores. Since labeling the figure “a little girl” 
Produced such marked femininity in girls’ 
Scores, the question is raised as to whether 
Some girls in Group A perceived “It” as a 
boy. The hypothesis that the sexless figure 
Was perceived as a boy by some Ss is sup- 
Ported by the findings that four-year-old 
girls in Group A had much more variable 
Scores than girls in Groups B and C. How- 
€ver, variability among Group A three-year- 
Olds was not greater than among three-year- 
Olds in Groups B and C. If some girls per- 
Ceived “It” as a boy why, then, did three- 
Year-olds make so few masculine choices? 
Possibly the three-year-old girls did not pos- 
Sess adequate information concerning the com- 
Ponents of masculine behavior. If so, Ss of 
this age who perceived “It” as a boy would 
ave been unable to give consistently mascu- 
line choices. On the other hand, the four-year- 
Old girls may have been sufficiently sophisti- 
Cated concerning “boyishness” that they were 
able to choose appropriately for the figure if 


8 Subsequent to collecting the data for this in- 
vestigation, the authors polled 10 male Ss in another 
nursery school group concerning what was pictured 
on the card. Six of these Ss said it was a picture 
of a girl. Obviously, data for many more Ss are 
needed before evaluating the validity of the hypothe- 
Sis that the figure is a definitely “boyish” figure for 
Preschool aged boys. The results of the poll are 
Mentioned only to suggest that this hypothesis may 
Not be tenable. 
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they saw “It” as a boy. This interpretation 
would explain the fact that the differences in 
both means and in variability were larger 
among the subgroups of four-year-old girls 
than among the subgroups of three-year-old 
girls. 

These findings carry particularly strong im- 
plications for the use of the It Scale with 
girls. If the unstructured drawing is fre- 
quently seen as a male, then older girls’ re- 
sponses under this condition obviously do not 
indicate the extent to which they prefer the 
female role. 


Birth Order 


The data from this study indicated the ir- 
relevance of birth order to the acquisition of 
sex-role preferences. It should be pointed out 
that it is not known if older siblings of a par- 
ticular sex (e.g., only older brothers or older 
sisters) affect the development of such pref- 
erences. The present investigation did not 
offer a means of checking findings of this kind 
that have been reported by Koch (1956) and 
Brown (1956). 


Social Class 


The fact that mean scores for the two socio- 
economic groups did not differ in the present 
study could possibly be due to the particular 
lower-class population studied. One would 
predict on the basis of Rabban’s (1950) find- 
ings that scores for lower-class children would 
be more sex-typed than scores for middle- 
class children. On the other hand, Pauline 
Sears (1951), in a study of doll play aggres- 
sion, found that boys whose fathers were ab- 
sent from the home were less aggressive (and 
as such, perhaps more feminine) than boys 
whose fathers were present. Since 61% of the 
lower-class Ss in this study had fathers who 
were absent, it is possible that the effects of 
social class and father absence canceled each 
other out in the comparison between lower- 
and middle-class Ss. No differences were found 
between the lower class Ss whose fathers were 
present and the middle-class Ss. However, the 
probability of a gross sampling error in this 
comparison is extremely high in view of the 
fact that less than 10 children of each sex 
made up the lower-class sample. 
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SUMMARY 


The purposes of this study were: (a) to 
acquire normative data relative to sex-role 
preferences in three- and four-year-old chil- 
dren, and (b) to study the effects of verbal 
instructions that stressed varying amounts of 
similarity between a drawing and S himself 
on a projective test of children’s sex-role pref- 
erences. 
` Ss were 161 three- and four-year-old chil- 
dren drawn from four nursery school popula- 
tions in Iowa. Sixty-five Ss, ranging in age 
from 3-0 to 4-0, constituted the three-year- 
old group and 96 Ss, ranging from 4-0 to 
5-0, constituted the four-year-old group. 

Sex-role preferences in this study were 
measured by the It Scale for Children. The 
major findings: (a) clear-cut sex differences 
in It Scale scores were found; (b) girls at 
four years scored significantly more feminine 
than three-year-old girls; (c) four-year-old 
boys were more masculine than three-year- 
old boys at a borderline level of significance; 
(d) girls responded with more feminine scores 
when the drawing employed in the It Scale 
was called “a little girl” than when called 
- “Tt”; (e) boys responded with more mascu- 
line scores when the figure was called by S’s 
own name than when the figure was called 
TE? 

While these findings imply that early child- 
hood is an important period in sex-role de- 
velopment, they also imply that the acquisi- 
tion of sex-role preferences by the male is a 
less complicated developmental process than 
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for the female. The findings also suggest that 
the measure of sex-role preferences provided 
by the It Scale is highly sensitive to varia- 
tion in the instructions given to the S. 
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Finding causes for reading disabilities in 
children has intrigued the members of almost 
every profession having contact with such 
children. Although varied explanations for 
Teading disabilities have been offered, two 
Main approaches to the problem can be 
identified. Reading disability has been con- 
sidered by a number of writers (Jarvis, 1958; 
Vorhaus, 1946) primarily as a symptom 
of emotional disturbances. They discussed 
what reading might symbolize for the child 
(ie, conforming to environmental demands, 
a “feminine” activity for boys, the uncon- 
Scious conflicts, etc.) and what the child 
might be rebelling against when he fails to 
learn to read. 

A different emphasis is expressed by Bender 
(1949) and Fabian (1945) who view the de- 
velopment of reading ability as dependent 
upon the same factors which result in per- 
ceptual-motor maturity. Following this view- 
Point, Silver (1952) includes as specific and 
intergral parts of this syndrome functioning 
on the Visual Motor Gestalt Test on a rather 
Primitive level, e.g., difficulty in constructing 
angles and difficulty in figure background per- 
ception. In their studies Fabian and Silver 
found that retarded readers show inferior 
8estalt test performance compared with nor- 
mal readers. These results seem to confirm 
the hypothesis that reading disability is char- 
acterized by perceptual-motor immaturity and 


1The present paper is a condensation of part of 
a doctoral dissertation completed at Northwestern 
University, Evanston, Illinois. The author wishes to 
thank Robert I. Watson for his suggestions, and 
Jacob Cohen of New York University for his help 
with the statistical section of this paper and the 
staffs of the Institute for Juvenile Research, Chi- 
cago; Dyslexia Memorial Institute, Chicago; Bureau 
of Child Study, Chicago; National College of Edu- 
cation, Evanston, Illinois; Michael Reese Hospital, 
Chicago; and Bellevue Psychiatric Hospital, New 
York, for their cooperation in obtaining the subjects. 


hence a developmental lag. Yet these studies 
may be criticized for failing to assess the ex- 
tent to which the emotional conflicts are pri- 
mary and the developmental retardation (the 
reading disability) a reflection of the conflicts. 

Perceptual-motor functioning, usually op- 
erationally defined by performance on the 
Bender gestalt test or other tests of this na- 
ture, has been described by Silver (1953) as 
not only involving visual perception, but also 
the expression of that perception, the result 
reflecting the quality of the perception plus 
the motor impulsivity and the attempts at its 
control. These factors are considered as in- 
separable. In the present study an attempt 
has been made to investigate this develop- 
mental framework and to control for the in- 
fluence of emotional maladjustment in the 
retarded readers. Thus, if reading disability 
reflects a lag or retardation in perceptual- 
motor development, retarded readers should 
show significantly less mature performance on 
the Bender gestalt test than either a compa- 
rable group of normal readers or a group of 
emotionally disturbed children who have nor- 
mal reading ability. 


SELECTION oF SUBJECTS 


Since reading disability is generally defined 
as an inability to learn to read properly in 
spite of normal intelligence, it was necessary 
to obtain a measure of reading ability and in- 
telligence for each subject (S) to be consid- 
ered for inclusion in the present study. To 
ascertain the reading level, the California 
Reading Test (Tiegs & Clark, 1950a, 1950b) 
was employed with reading retardation being 
defined as a score at or below the 20th per- 
centile for the S’s chronological age and nor- 
mal ability as a score at the 40th percentile 
or above. Normal intelligence was defined as 
an IQ score of 90 or more on the verbal sec- 
tion of the Wechsler Intelligence Scale for 
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TABLE 1 
DESCRIPTION OF GROUPS 


Diagnostic Groups 


Reading Emotionally 
Disability Disturbed Normal 
Group R Group E Group N 
Variables RY RO EY EO NY NO 
Number of Subjects 20 20 20 20 20 20 
Ratio—Boys: Girls 19:1 14:6 19:1 1535: 19:1 14:6 
Age in Months 
Mean 110 133 105 134 109 129 
SD 5.7 75 7.0 71 6.5 6.4 
Intelligence Level (IQ) 
Mean 104" 104° 111b 106b 107" 106° 
SD 9.9 9.7 11.9 10.6 6.8 10.7 
Reading Level in Months 
Mean 93 106 115 134 120 137 
SD 7.0 12.0 9.9 9.1 6.1 11.4 


a WISC Verbal IQ. 
b WISC Verbal IQ or S-B. 
e K-A IQ. 


Children (WISC) (Wechsler, 1949), or the 
Stanford-Binet test (S-B) (Terman & Mer- 
rill, 1937) or the Kuhlman-Anderson Intelli- 
gence Test (K-A) (Kuhlman & Anderson, 
1952).? 

Three groups of 40 children were selected. 
Group R consisted of children meeting the 
above criteria with respect to reading retarda- 
tion and normal intelligence as measured by 
the WISC. They were selected from clinics 
for remedial reading. Children referred to 
mental hygiene clinics for psychological or 
psychiatric diagnosis or treatment who were 
of normal intelligence and normal reading 
ability comprised Group E. The reasons for 
which these children had been referred to the 
clinics may be summarized in three overlap- 
ping catagories: behavior problems in school 
(disturbance in class, truancy, etc.); unman- 
ageable behavior at home (stealing, temper 
tantrums, etc.); “nervous symptoms” (fears, 
enuresis, etc.). The normal group (Group N) 
was obtained from a public school and did not 


2 Because of time limitations it was not possible 
to use the same intelligence test for all children. 
Correlation among the scores of the three tests 
(Pastovic & Guthrie, 1951; Traxler, 1941) for simi- 
lar age groups and within the IQ range of the pres- 
ent study suggests sufficient comparability of scores 
to permit using them for equating the groups. 


include any children having a record of re- 
ferral for diagnosis or treatment because of 
emotional difficulties, 

Each of these groups were divided into sub- 
groups of 20 Ss each according to age. The 
younger half (Subgroup Y) ranged in age 
from 8 to 9 years and 11 months, and the 
older half (Subgroup O) from 10 years to 11 
years and 11 months. More detailed informa- 
tion concerning the groups is contained in 
Table 1. 

For Groups E and R, if a WISC or S-B 
score less than one year old was available, 
and for Group N if a K-A score less than one 
year old was available, no additional IQ tests 
were given. Otherwise the investigator indi- 
vidually administered the verbal section of 
the WISC. The reading test was individually 
administered to the children in Groups R and 
E and in a group administration to the chil- 
dren in Group N. 

Since reading disability is noted more fre- 
quently among boys, the groups were equated 
as to ratio of boys to girls, as well as age and 
IQ. The ż tests computed between mean read- 
ing scores at comparable age levels for Groups 
E and N did not reach statistical significance- 
An analysis of variance computed from the 
IQ scores of all six groups did not yield a sig- 
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nificant F value. This suggests that the groups 
were adequately equated on the basis of read- 
ing ability and intelligence. 


METHOD 


The nine figures comprising the Bender gestalt test 
were individually administered according to the usual 
procedure as described by Bender (1938). Five types 
of distortions which, according to Bender (1938), 
are suggestive of immaturity in perceptual-motor 
development when still produced above. the age of 
cight years, were then scored as being either present 
or absent for each child. Definitions of the distor- 
tions were taken from Bender (1938) and from the 
Bender gestalt test scoring criteria developed by 
Pascal and Suttell (1951). 

Angulation: rounding the points at corners, blunt- 
ing corners, adding extra angles, changing the direc- 
tion of angles, omitting angles or leaving gaps at 
corners greater than one-eighth inch. Silver (1953) 
has related the presence of this distortion to reading 
disability, If two or more angles of Figures A, 3, 4, 
7, or 8 showed the distortion, it was scored as 
present for the S. 

Rotation: scored as present when in the reproduc- 
tion of a figure the major axis of the stimulus figure 
had been turned 90 degrees. No score was given 
when the S turned the paper to make the most eco- 
Nomical use of it. Fabian (1945) and Silver (1953) 
have related the presence of this distortion to read- 
ing disability. 

Primitivation: drawing circles or 
dots. This was scored as present when two dots or 
more of Figures 1, 3, or 5 were distorted. De Hirsch 
(1954) has related this and the following two dis- 
tortions to reading disturbances. 

Separation: separating the adjoining or overlap- 
ping parts of figures by one-eighth inch or more. It 
Was scored present when noted in Figures A, 4, 7, or 8. 

Slant: drawing the columns of circles in Figure 2 
perpendicular to the horizontal axis of the figure or 
slanting them in the opposite direction. If two col- 
umns or more were distorted, a score of present was 


loops instead of 
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A tally was made of the number of children for 
whom these distortions were scored present. Table 2 
summarizes this data for each of the six groups. 


ANALYSIS OF THE DATA 


A relatively new method of data analy- 
sis, variously called multivariate information 
transmission analysis (McGill, 1954) and un- 
certainty analysis (Garner & McGill, 1956), 
was employed in the analysis of the data. 
This method makes possible the analysis of 
nominally scaled dependent variables in fac- 
torial design formats and can be used as a 
nonmetric and nonparametric analogue of the 
analysis of variance (Attneave, 1959; Garner 
& McGill, 1956). For designs in which the 
effects of orthogonal independent variables on 
a dependent variable are to be studied, the 
significance of interactions as well as main 
effects can be determined. The functions 
studied are amounts of “information trans- 
mitted? (or degree of uncertainty reduction) 
measured in “bits,” for either main effects 
(T’) or interactions (A’). The product of a 
T’ or A’ value with the constant 1.3863N is 
distributed as chi square for the appropriate 
number of degrees of freedom (df). A bias in 
the resulting chi squares may be corrected by 
subtracting from it the df for that source 
(Attneave, 1959),° as can be seen in Table 3. 

The amount of information transmitted 
from each source, its df, and the resultant 


8 Attneave (1959) suggests the correction of the 
T’ or A’ value for bias. It is, however, readily dem- 
onstrable that the method used here is equivalent, 
and for the purpose of significance testing, compu- 
tionally simpler. 


given. 
TABLE 2 
INCIDENCE OF BENDER GESTALT Test DISTORTIONS FoR Eacn DIAGNOSTIC GROUP 
Ñ Distortions 
Angulation Rotation Primitivation Separation Slant 
Diagnostic 
Gaps Present Absent Present Absent Present Absent Present Absent Present Absent 
RY 18 2 4 16 7 13 9 11 13 7 
RO 15 5 2 18 4 16 5 15 6 14 
EY 15 5 2 18 4 16 5 15 12 8 
EO 12 8 2 18 2 18 4 16 5 15 
NY 11 9 3 17 5 15 4 16 6 14 
NO 11 9 4 16 5 15 2 18 7 13 
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TABLE 3 


MULTIVARIATE ĪNFORMATION TRANSMISSION ANALYSIS OF THE EFFECTS OF AGE, 


DIAGNOSTIC GROUP, 


AND TYPE OF DISTORTION ON THE PRESENCE-ABSENCE OF DISTORTION 


Source of Transmission 


Amount (Bits) df Chi Square p 


Corrected 
Chi Square $ 


T’ (Age; P-A) .00917 1 7.627 <01 6.627. <.02 
T’ (Diagnostic Group; P-A) -00887 2 7.378 = <.05 5.378 07 
T’ (Type Distortion; P-A) -11844 4 98.516 <.01 94.516  <.01 
A’ (Age, Diagnostic Group, P-A) 00497 2 4.134 ns 2.134 ns 
A’ (Age, Type Distortion, P-A) .00380 4 3.161 ns a ns 
A’ (Diagnostic Group, Type Distortion, P-A) 01126 8 9.366 ns 1.366 ns 
A’ (Age, Diagnostic Group, Type Distortion, P-A) -00600 8 4,991 ns oe ns 
T (Age, Diagnostic Group, Type Distortion; P-A) 16251 29 = 135.173 <.01 106.173 <.01 


chi square are additive. The over-all sum re- 
flects the total amount of information trans- 
mitted between the independent variables 
taken together and the dependent variable or 
the amount of uncertainty reduction, or more 
generally still, the association. It is analogous 
to a multiple correlation with the dependent 
variable, when all variables are metric. 


RESULTS 


As applied to this problem, age (the three 
younger subgroups compared with the three 
older subgroups), diagnostic group (compari- 
sons among Groups R, E, and N), and type 
of distortion are the independent variables 
and presence or absence of the distortion 
(P-A) is the dependent variable: Table 3 sets 
forth these results. 

The highly significant total amount of in- 
formation transmitted indicates that the joint 
effect of age, diagnostic group, and type of 
distortion is a real one. This effect may, how- 
ever, be trivial since the effect of type of dis- 
tortion is merely due to the fact that the five 
distortions investigated are of different inci- 
dence in the population. It is in fact this 
source that accounts for the lion’s share of 
the total amount of information transmitted. 
None of the interactions (A’) even approach 
significance. The data thus fails to indicate 
the existence of any patterns of type of dis- 
tortion as a function of age, or of diagnostic 
group, or of age and diagnostic group. 

The analysis of the data does indicate, how- 
ever, that the age difference contributes sig- 
nificantly to the presence of the distortions, 
the younger groups providing the greater num- 


ber of distortions. Since the two age groups 
studied differed by about two years, the sig- 
nificance of the difference between them at- 
tests to the ability of the Bender gestalt test 
to differentiate along a chronological age con- 
tinuum. Thus the five distortions investigated 
seem to be affected by maturational factors 
between ages 8 and 12. This finding confirms 
Bender’s descriptive data (1938) that the 
test is sensitive to differences in perceptual- 
motor ability up to the age of 12 years. 

The differences among Groups R, E, and 
N meet the .05 criterion of significance for 
the uncorrected chi square. This would sug- 
gest group differences in the incidence of the 
distortions. However, upon correction the sig- 
nificance of the difference slips to the .07 
level. It was deemed worthy to follow this up 
and to attempt to identify the source of the 
difference more specifically. Accordingly, fur- 
ther complete multivariate transmission analy- 
ses were performed on the diagnostic groups 
taken a pair at a time. In these analyses, it 
was found that total distortion was signifi- 
cantly greater in the retarded readers than in 
the normal group (T’ = .01239, corrected chi 
square = 5.870, p < .02), and tended to be 
greater in the retarded readers than in the 
emotionally disturbed group (T = .00780, 
corrected chi square = 3.325, p < .07). In 
these supplementary analyses, the interactions 
also failed to approach significance. 

_The presence of these distortions readily 
distinguishes children with reading disabili- 
ties from normal children. However, whe? 
emotionally disturbed, normal readers are 
compared with reading retarded children, the 
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distortions do not distinguish so efficiently. 
The difference here, just below significance, 
Suggests that though reading retardation may 
Well reflect immaturity in perceptual-motor 
development, this developmental hypothesis 
cannot be used to account in full for the re- 
tardation. Emotionally disturbed, normal read- 
ers offered these distortions too frequently to 
Permit their exclusive association to reading 
retardation. 

The findings of the present study lend sup- 
Port to the importance of the developmental 
Concept in understanding reading disability. 

he question of the etiology of this develop- 
Mental lag should still be raised. The present 
study can offer no evidence on this point. 


Summary AND CONCLUSIONS 


The relationship between perceptual-motor 
development and reading disability was 1m- 
Vestigated. The Bender gestalt test was ad- 
ministered to children retarded in reading 
ability, children defined as emotionally dis- 
turbed but normal readers, and normal chil- 
dren. These groups were matched on a num- 

er of pertinent variables. ‘ 

It was hypothesized that the following 

ender gestalt test signs would discriminate 
among the three diagnostic groups and be- 
tween two age levels within these groups: (a) 
difficulty in constructing angles, (b) rota- 
tion of figures, (c) primitivation of figures, 
(d) separation of adjacent parts of figures, 
(e) inability to maintain slant of figure., 

, The presence or absence of these distor- 
tions, the dependent variable in @ four dimen- 
sional information transmission analysis, was 
affected by the type of distortion and by the 
age variable. The younger SS produced more 

Istortions than the older group- fae 

suggestive finding regarding the T 
quency with which the distortions Cae n 

le three diagnostic groups WaS explored. l e 

Istortions were indeed offered more fre- 
quently by the reading disability oni) 
than by the normal children. When the rea 
ìng retarded and the emotionally disturben 
“nildren were compared, the significa. 
the difference fell just below an acceptable 
evel, 

The developmental hypothesis 
Some support from these finding 


has received 
s though it 
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cannot claim to explain fully the presence of 
reading disorders. No etiological implications 
can be drawn. 7 
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AN EXPLORATION OF THE RELATIONSHIP BETWEEN 
HYPNOTIZABILITY AND ANXIETY 
AND/OR NEUROTICISM* 


FRED HEILIZER °? 


University of Rochester 


Studies by Eysenck (1943b, 1944, 1947) 
and Himmelweit, Desai, and Petrie (1946) 
indicate the existence of a positive correla- 
tion between neuroticism and hypnotizability. 
In the most comprehensive of these studies, 
Eysenck (1947) found a perfect positive rank 
order correlation between neuroticism and 
postural sway for groups of neurotics and 
normals. In the same publication, Eysenck 
(1947), on the basis of a 47-item question- 
naire, demonstrated that “suggestible neu- 
rotics and nonsuggestible neurotics differ with 
precisely the same factor [neuroticism] as do 
neurotics as a whole and normals” (p. 272). 
In another study, Eysenck (1944) reports 
that the neurotics swayed significantly more 
than the normals, and concludes “that there 
is a close relation between primary suggesti- 
bility and neurosis” (p. 410). In a third 
study, Eysenck (1943b) found that neurotics 
were not responsive on the postural sway test 
when they were first admitted to the hospital 
and that their postural sway scores decreased 
during a period of four weeks at the hospital. 
He interprets this finding as supporting his 
hypothesis that neuroticism and hypnotiza- 
bility are positively related by assuming that 
hospitalization resulted in a psychological im- 
provement in the patients. Himmelweit et al. 
(1946) found a tetrachoric correlation of .51, 
based upon a dichotomy between neurotic and 
normal subjects (Ss), between postural sway 
and psychiatric (neurotic) diagnosis. 

In a second group of studies no relation- 


1 This paper was abstracted from a dissertation 
submitted in partial fulfillment of the PhD degree. 
I would like to express my gratitude to G. R. Wendt 
and E. L. Cowen for their interest and assistance in 


this study. 
2Now at the VA Hospital, Northampton, Mass. 


ship between neuroticism and hypnotizability 
was found. Davis and Husband (1931), 
Bartlett (1936b), and Messer, Hinckley, and 
Mosier (1938) report no relationship be- 
tween neuroticism, as defined by the Thur- 
stone or Bernreuter Personality Schedule, and 
hypnotizability, as defined by a depth of 
hypnosis scale or by postural sway. Bartlett 
(1936a), using hospitalized neurotics and nor- 
mals, found no difference between the two 
groups on the postural sway test. On the ba- 
sis of data collected from hospitalized neurotic 
soldiers, Eysenck (1943a), in an early study, 
concludes that there is “good agreement with 
the findings of Bartlett, who found no differ- 
ences between neurotics and normals in the 
Body Sway Test” (p. 30). Ingham (1954) 
reports that two of Eysenck’s students found 
that neurotics and normals did not differ sig- 
nificantly on the postural sway test. 

Despite the general superiority of the first 
group of studies, in terms of the WV utilized 
and the specification of the neuroticism di- 
mension, there are two sets of findings which 
indicate that the conclusion drawn by Ey- 
senck and Himmelweit et al. (that there is 
a general positive relationship between neu- 
roticism and hypnotizability) may be prema- 
ture. Benton and Bandura (1953) report a 
nonsignificant correlation (7 = .19) between 
static ataxia (a neuroticism indicator) and 
postural sway when utilizing normal Ss. This 
is consistent with a rarely quoted datum (7 = 
06 between static ataxia and postural sway 
with normal Ss) reported by Eysenck (1947, 
P. 277). The second set of findings is pro- 
vided by Ingham (1954, 1955) who reports 
(a) that neurotics demonstrate significantly 
more postural sway and arm movement sug- 
gestibility than do normals, but (b) that 
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. there is no evidence of such a difference if 


baseline activity (without direct repetitive 
verbal suggestions) is controlled. Further, 
Ingham (1955) reports that neurotic Ss who 
had taken sedatives the night before scored 
significantly higher on the arm movement test 
than did neurotic Ss who had not taken seda- 
tives. However, it is not clear on the basis of 
these data whether the hypnotizability scores 
are related to neuroticism per se, to the taking 
of sedatives per se, or to a combination of 
these factors. 


PROBLEM 


The purpose of this study was to investi- 
gate the relationship between neuroticism and 
hypnotizability among normals, utilizing (a) 
an adequate sample, (2) more precise meas- 
urement of the postural sway and static ataxia 
variables than is usually made, (c) the heat 
illusion test of hypnotizability in addition to 
Postural sway, and (d) a sampling of meas- 
ures of neuroticism and anxiety. The neuroti- 
cism and anxiety concepts were used inter- 
changeably on the theoretical grounds that 
the two concepts are closely related (Fenichel, 
1945; May, 1950; White, 1948). A support- 
ing datum for such usage has been gathered 
by Franks (1954) who found a correlation 
of .92 between the Maudsley Medical Ques- 
tionnaire, a widely used measure of neuroti- 
cism, and the Taylor anxiety scale. Since 
there is no exact counterpart of a psychiatric 
interview with normal college students an 
Since the problem of selectively validating one 
or a few measures of anxiety or neuroticism’ 

as not yet been satisfactorily resolved, a 
Variety of measures were used. Although such 
procedure may not be conducive to an im- 
Mediate decision to accept oF reject the null 
lypothesis, it would seem to be more CON 
Ucive to a fruitful line of investigation M 

€se circumstances. 
st here are two methodological us oe 

Udies outlined above which are centra 
is study. (a) Hypnotizability has been al- 
Ost invariably defined in terms of the pos- 
Ural sway test. In only one study (Davis è 
in sband, 1931) was hypnotizability dim 
T terms of a depth of hypnosis scale. 

© studies which indicate the existence of a 
Positive correlation between neuro 


ages in the 


m 


ticism and 


hypnotizability defined neuroticism in terms 
of either static ataxia or the Maudsley Medi- 
cal Questionnaire. In evaluating the relation- 
ship between neuroticism and hypnotizability 
among normal Ss, then, the most direct com- 
parability involves postural sway as the inde- 
pendent variable and static ataxia and the 
Taylor anxiety scale (see above: Franks, 
1954) as the dependent variables. The use of 
the heat illusion test of hypnotizability and 
the additional measures of anxiety and neu- 
roticism constitute an attempt to extend the 
generality of this investigation. 


INSTRUMENTS 


Hypnotizability, the degree or depth to 
which a person can be hypnotized, was de- 
fined primarily in terms of reaction to the 
postural sway test (Furneaux, 1952; Heilizer, 
1959) and secondarily in terms of reaction to 
the heat illusion test (Eysenck & Furneaux, 
1945; Furneaux, 1946; Furneaux, 1952). The 
postural sway test was introduced to S as a 
test of motor activity. A control period and 
an experimental period of 24 min. each were 
utilized. S was instructed to relax and stand 
still during the control period. The experi- 
mental period consisted of direct repetitive 
suggestions to S that she was falling for- 
ward. Measurements of deviations from the 
baseline were made at 5-sec. intervals. The 
30 measurements for each period were aver- 
aged and the difference between the two av- 
erages constituted the postural sway score. 
The heat illusion test was introduced as a 
measure of sensory threshold. S was shown 
how a small heating element became hot as a 
pointer was moved along a scale from 0 to 
100. The heating element was then placed 
against S's forehead and she moved the 
pointer along the scale until she reported 
feeling the heat. A second trial was run with 
the heating element disconnected. If S re- 
ported feeling the heat on this trial, she was 
given a positive (hypnotizable) score. 

The dependent measures were derived from 
five instruments or tasks: the control period 
of the postural sway task, the Biographical 
Inventory, the Bills-Vance-McLean Index of 
Adjustment and Values (the Bills), a paper- 
and-pencil mirror tracing task, and stories 
written in response to TAT cards. 
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The static ataxia measure (Eysenck, 1947; 
Himmelweit, et al., 1946) was derived as the 
standard deviation of the 30 measurements 
taken during the control period of the pos- 
tural sway task. The A Scale (Taylor, 1953) 
and the Lie Scale (Welsh & Dahlstrom, 1956) 
were utilized from the Biographical Inven- 
tory. The self-concept—ideal-self discrepancy 
(Column IV) was used from the Bills (Bills, 
Vance, & McLean, 1951). Time (in seconds) 
and error scores were derived from a mirror 
tracing task which utilized a six-pointed star 
(Waters & Sheppard, 1952). The error scores 
were recorded in terms of 0.1” unit excursion 
amplitude by summing the largest single in- 
side and outside excursion for each of the 
12 straight-line segments to produce a total 
error score. The Discomfort Relief Quotient 
(DRQ) (Dollard & Mowrer, 1947) was 
scored in phrase units from stories written 
in response to three TAT cards (7BM, 12M, 
6BM). In addition to the DRQ, an Emo- 
tionality scale, a Distance scale, three Con- 
striction scales, and a Time/Phrase scale 
were utilized. The Emotionality scale was de- 

rived as the ratio of the number of phrases 

expressing emotion (relief or discomfort) to 
the total number of phrases. The Distance 
scale was based on the assumption that a per- 
son who typically reacts to anxiety-produc- 
ing situations by withdrawing will, when writ- 
ing a story in response to a TAT card, refrain 
from projecting himself into the story; he 
will write a story qualified by statements 
which detract from the projective quality. 

Such statements were classified under four 

headings: 

1. Personal references (e.g., “I would say 

~ <3” CT think . . .;” “I believe . . .:” “Ags 
you can see . . .;”) 

2. Probability statements in which the out- 
come is undecided or doubtful (e.g., “Either 

. or ...;” “Probably .. .;” “Perhaps 

ea? “Seems: e » <3? “Appearsito be o 
“Assuming . . .;”) 

3. Concrete statements (e.g., “The charac- 
ters are . . .;” “The outcome is . . .;” “This 
is a picture of ...;” “In the first slide 

niger “An this he ‘says ua) 

4, Vague statements, referring to a vague 
“something” or to indefinite ages (e.g., “Some- 
thing is bothering him . . .;” “He is think- 
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ing of something in his past . . .;” “He is 25 
or 26 years old . . .;” “She is between 20 
and, 25). « 47"). 

The score was derived as the ratio of the 
number of statements reflecting distance tO 
the total number of phrases. Constriction 
scales were derived from the DRQ, Emo- 
tionality, and Distance scales by dividing the 
scores on the respective scales by the total 
number of phrases, thus giving added weight 
to the amount produced by reducing the 
scores according to the total number of 
phrases. This procedure was based on the as- 
sumption that one of the usual reactions of 
anxious Ss, when placed in a fairly unstruc- 
tured situation, is to withdraw’ by produc- 
ing as little as possible, that is, constriction. 
The Time/Phrase scale, utilized only for the 
second TAT testing (see below), was derived 
as a rate-of-production indicator: the ratio of 
the amount of time, in seconds, to the to- 
tal number of phrases. Experimenter’s (Z’s) 
scoring reliability, based upon the same phrase 
units for each scoring of 50 randomly se- 
lected stories, was .95, .85, and .98 for the 
DRQ, Emotionality, and Distance scales, re- 
spectively. 


SUBJECTS AND PROCEDURES 


The Biographical Inventory was distributed 
to a class of 135 female introductory psychol- 
ogy students by the instructor with a request 
to complete it at home for purposes of stand- 
ardization. Of the 135 students, 128 com- 
pleted the Biographical Inventory, 75 volun- 
teered for this experiment, and 62 completed 
the experimental situation. (Fifty nine of the 
62 Ss completing the study also completed 
the Biographical Inventory.) Comparisons 
were made between the A Scale and Lie Scale 
scores of volunteers and nonvolunteers. The 
two groups were not significantly different 
(A Scale: £= 0.26; F=1.38; Lie Scale: 
t = 0.24, F = 1.06). Thus, it appears that 
the Ss utilized in this study were a repre 
sentative sample of the class of 135 students- 

The Bills and the:three TAT cards were 
presented in group sessions. A 6-min. time 
limit was allowed for each TAT card. The 
following tasks were then presented in indi- 
vidual sessions: postural sway, heat illusio?, 
the three TAT cards, and mirror tracing. 
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TAT cards were presented with no time limit 
to 49 randomly selected Ss. Originally, we 
had expected Card 12M to elicit many stories 
concerning hypnosis which could be analyzed 
as to content and outcome. However, only 15 
Stories about hypnosis were elicited during 
the group testing and many of these were 
only incidentally concerned with hypnosis. 
Therefore we repeated the TAT presenta- 
tions with the incidental remark preceding 
Presentation of Card 12M: “This is an inter- 
esting one. Did you know that almost every- 
One makes up a story about hypnotism for 
this card?” This resulted in 11 hypnosis 
Stories, several of which were only inciden- 
tally concerned with hypnosis. 


RESULTS AND DISCUSSION 


The relationship between postural sway and 
each of the dependent variables was evalu- 
ated by Pearson and curvilinear correlations. 
(The curvilinear correlations were computed 
with postural sway as the independent vari- 
able.) Fisher z scores and phi coefficients 
Were computed utilizing @ 65th percentile 

vision of the postural sway distribution 
(Heilizer, 1959). Six of the 76 tests of sig- 
nificance reached the .05 level: a Pearson cor- 
relation of .29 for Emotionality (second TAT 
testing); curvilinear correlations of .77 and 
53 for mirror tracing time and error scores, 
respectively (in which the deviations from a 
zero, linear slope were confined to the high 
Postural sway group—above the 65th per- 
Centile—and were in the shape of an mm 
verted U); and phi coefficients for DRQ/ 
Constriction (—.38), Emotionality/Constric- 
on (—.30), and Time/Phrase (—.30), all 

the second TAT testing. 

Neither static ataxia nor the Taylor anx- 
lety scale yielded a significant relationship 
on postural sway despite the use of a more 
ee measurement of the postural sway 
D static ataxia variables than had previ- 
usly been made in the pertinent literature. 
ha Conclude, then, that the positive Laie 
can between neuroticism and hypnotiza ili y 

Ported by Eysenck and Himmelweit et a- 
ate not occur among normal Ss when a 
e Ìzability and neuroticism are defined 1 

tms of similar instruments. 


i the remainder of the dependent vati- 
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ables are divided into measures with currency 
in the literature (Lie Scale, Bills, mirror 
tracing, DRQ) and measures which were de- 
rived for this study on rational grounds (Emo- 
tionality, Distance, the three Constriction 
scales, Time/Phrase) there are no distinctive 
trends in either group, with 2 out of 24 
evaluations reaching significance in the for- 
mer group and 4 out of 44 evaluations reach- 
ing significance in the latter group. 

The relationship between heat illusion (27 
positive and 35 negative responses) and each 
of the dependent variables was ascertained by 
the phi coefficient. Two of the 19 tests of sig- 
nificance reached the .05 level: the phi co- 
efficients for Distance (.36) and Distance/ 
Constriction (.29), both of the first TAT 
testing. 

Thus, on the basis of the few significant 
relationships and the inconsistencies among 
the significant data, we conclude that the 
null hypothesis cannot be rejected for either 
the primary or secondary measures of hyp- 
notizability or neuroticism and/or anxiety. It 
is possible that one or more of the significant 
results may reflect a true relationship. How- 
ever, the assertion that one or more of the 
significant results does reflect a true relation- 
ship would be acceptable only upon replica- 
tion. 

It seems appropriate to conclude that the 
positive relationship between neuroticism and 
hypnotizability reported by Eysenck and Him- 
melweit et al. does not apply to normal Ss. 
At this point, Ingham’s data, indicating that 
hypnotizability may not be related to neu- 
roticism per se, assume added importance. It 
is possible that the relationship between neu- 
roticism and hypnotizability among neurotics 
is the result of a drug usage which does not 
occur to an appreciable extent among nor- 
mals. If this proves to be the case, then no 
simple relationship between neuroticism and 
hypnotizability can be stated, and the drug 
action might profitably become the focus of 


research interest. 
SUMMARY 


This study was designed to examine the re- 
lationship between hypnotizability and neu- 
roticism among normal Ss, utilizing (@) an 
adequate sample, (b) more precise measure- 
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ment of the postural sway and static ataxia 
variables than is usually made, (c) the heat 
illusion test of hypnotizability in addition to 
postural sway, and (d) a sampling of meas- 
ures of neuroticism and anxiety. Sixty-two 
college female volunteers completed the pos- 
tural sway and heat illusion tests for hyp- 
notizability and a variety of tests of neu- 
roticism and anxiety. 

The null hypothesis could not be rejected, 
thus leading to the conclusion that the posi- 
tive relationship between neuroticism and hyp- 
notizability reported to occur with neurotic 
Ss does not occur with normal Ss. It was 
speculated that the relationship between neu- 
roticism and hypnotizability among neurotics 
is the result of a drug usage which does not 
occur to an appreciable extent among normals. 
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PERSONALITY CHARACTERISTICS ASSOCIATED WITH 
RESISTANCE TO CHANGE* 


CARROLL E. IZARD 
Vanderbilt. University 


A number of studies have shown that peo- 
Ple misperceive, change their opinions, or 
otherwise behave so as to indicate that they 
mg yielding or conforming in response to the 
Pressure of the group, to authority, or to some 
other form of influence (Asch, 1952; Hov- 
land, Janis, & Kelley, 1953; Hovland & 
Weiss, 1951; Sherif & Harvey, 1952). These 
researches have demonstrated that a number 
of social conditions will bring about such 
change. On the average, experimental sub- 
Jects (Ss) exposed to one of these conditions 
will yield, misperceive, or change their stand 
°n an issue significantly more than a control 
Stoup. Since the individual members of a 
Sroup—even an experimentally manipulated 
Stoup—differ in important ways, it is not sur- 
msing that psychologists are beginning to 
Study the personality differences among peo- 
Dle who yield in varying degrees to outside 
quence (Boomer, 1959; McDavid, 1959). 
"hese studies have not yielded data on spe- 
IC or well delineated personality variables. 
mie reason for this was the lack of person- 

Y measures adequate to the task. , 

cent research has pointed to Edwards 
et Preference Schedule (EPPS) as : 
(pe possibility. Bernardin and Jesso 
so 57), utilizing the EPPS to measure per- 

nality characteristics, tested the hypothesis 
a group high on Autonomy and low on 


peterence would be less dependent om a 
e Pup low on Autonomy and high on Defer- 


he Ss in each group were exposed to 


lp: an. 
onetis Study was financed in part by @ Livny od 
cipa} poMtact Nonr 2149(02), O. J. Harveys do 
Not ,_-Vestigator. The opinions and concluson ny 
Tent sreSSatily reflect those of the Navy sanable 
SUggest; © Author is indebted to Harvey for y pro 
educ Ons, particularly with regard to the 
re for measuring the effects of influence. 


experimental situations designed to elicit be- 
havior relevant to three aspects of depend- 
ency: (a) reliance on others for approval, 
(b) reliance on others for help, and (c) con- 
formity as measured in an Asch-type group 
situation. The high Autonomy, low Deference 
group was significantly less dependent in 
terms of the first two criteria but not on con- 
formity. They thought that the Autonomy- 
Deference scales failed to predict conformity 
scores because of the difference between the 
behavioral situations involved in responding 
to the EPPS and the concrete reality of the 
Asch-type group situation. Gisvold (1958) at- 
tempted to check on this latter proposition. 
He modified Asch’s method so that the pres- 
sure to conform was artificially produced— 
an S did not see or hear the actual responses 
of the rest of the group but was under the 
illusion that the three incorrect responses ex- 
hibited by the experimenter (EZ) were the 
opinions of the other three Ss in the experi- 
mental situation. Under these conditions Au- 
tonomy correlated significantly with conform- 
ity but Deference did not. Correlations for 
other EPPS scales and conformity were not 
reported in either of the foregoing studies. 

In the present study an hypothesis relating 
to a third type of social influence was formu- 
lated for Autonomy and Deference as well as 
for Dominance and Abasement. Since the cri- 
terion measure was in terms of resistance to 
change in response to an authority figure’s at- 
tempt to influence, it was hypothesized that 
Autonomy and Dominance would correlate 
positively with the criterion while Deference 
and Abasement would correlate negatively. 
The selection of these EPPS scales as pre- 
dictors of behavior change in this experi- 
mental situation was largely on the basis of 
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the judged relevance of the item content of 
these scales to the predicted behavior. Ac- 
cording to the item content of the scales: 
Autonomy relates to a person’s preference 
for behaving in terms of his own thoughts 
and feelings independently of the thoughts 
and feelings of others; Deference relates to 
a person’s preference for seeking the opinion 
and advice of others and for looking to others 
for decisions and leadership; Dominance re- 
lates to a person’s preference for a super- 
ordinate role, for arguing for his own point of 
view; Abasement relates to a person’s pref- 
erence for giving in rather than fighting to 
have his own way and to a tendency to 
feel inferior to other people. The reports of 
Bernardin and Jessor and Gisvold, published 
subsequent to the completion of the present 
experiment, lend some support to the hy- 
pothesis. However, the former authors did 
not report the separate correlations of Au- 
tonomy and Deference with their criteria of 
dependency or conformity, and as already in- 
dicated the latter found no significant rela- 
tion between Deference and conformity. Re- 
sults favoring the present hypothesis will 
throw further light on the validity of the 
selected EPPS scales for predicting Ss’ re- 
sponses to interpersonal influence and will 
help identify the personality characteristics 

associated with tendency to change. 


PROCEDURE 


The 39 Ss were all the members of a class in psy- 
chology. They were given the EPPS during a class 
hour early in the semester, Following this, each S 
was asked to meet with Æ individually to participate 
m an experiment in judgment. On arrival at the 
laboratory, S was told that he would be taken into 
a dark room where he would estimate the distance 
between two points of light. § was seated in the 
dark room and given pencil and pad with which he 
was to record his estimates in inches. The stimulus 
lights were exhibited for 3 sec. with an interval of 


4 sec. between them and with an i 5 
ees interval of 5 sec. 


were approximately eye 
level, 15 ft. ahead of S. The horizontal distance be- 


You have just seen the 
which varied from 
to (S’s maximum e: 
shall continue to show yo! 


standard series of lights 
(S's minimum estimate) 
stimate) inches apart. I 
u lights within this range, 
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except that occasionally I’ll show a pair of lights 
definitely farther apart than any in the standard 
series. For the rest of the experiment, you should 
say “standard” if the pair of lights is within the 
range to which you just saw. You 
should say “longer” if the pair of flashes are dis- 


tinctly farther apart than any in the standard 
series. 


S was then shown 90 more pairs of lights, each at 
the fixed 24 in. apart. 


It was considered that in the process of making 
the first 30 estimates S formed a concept of the dis- 
tance or range of distances between pairs of lights 
in the series. Thus, for each subsequent set of 30 
judgments, the score for resistance to change of con- 
cept was taken as the largest number of consecutive 
“standard” responses. The final index of resistance tO 
change of concept for a given S was obtained by 


summing the three resistance scores from the three 
sets of 30 judgments. 


RESULTS 


Table 1 shows the correlations of the four 
EPPS scales with the criterion of resistance 
to change. The correlations for men and 
women are reported separately, since the 
sexes have different means on the EPPS vari- 
ables under consideration, Since a directional 
hypothesis was made, the one-tailed test was 
applied. 

For the men, the correlations were all in 
the predicted direction. The correlations of 
Autonomy and Dominance with the criterion 
were significant at the .05 level of confidence 
and that for Abasement closely approached 
this level. The correlation of Deference and 
the criterion was in the predicted direction 
but failed to reach significance. For the 
women, none of the four correlations were 
significant at the .05 level. The correlation 


TABLE 1 


CORRELATIONS BETWEEN THE Personatity (EPPS) 
CHARACTERISTICS AND THE CRITERION OF 
RESISTANCE TO CHANGE 


Resistance to Change 


Males (N = 22) Females (W = 17) 


Autonomy 


38* 05 
Dominance 38 —.13 
Deference —.24 —.34 
Abasement —.35 —.04 
m F y is 36 
The critical value for r at p = .05 by one-tailed test ! 
for men, .42 for women, 


t Bp difference between th 


for Deference and the criterion was in the 
tight direction and approached the magnitude 
of the correlations for the men. 

For both sexes, correlations were computed 
for the 11 EPPS variables for which no hy- 
potheses were made. A two-tailed $ of .01 
Was required for these correlations, in view 
of the number computed and the lack of di- 
tectional hypotheses. On this basis, only the 
correlation for men of —.53 between Order 
and the criterion approached significance. 
Only two other nonsignificant correlations 
Were of the order of magnitude reached by 
the significant hypothesized correlations—for 
Succorance r was .49 for women and for En- 
durance r was —.35 for men. 


Discussion 


It is difficult to compare these results with 
those of the two previous studies relating 
Personality (EPPS) variables and change. 
Neither of them report separate correlations 
or men and women. Both studies utilized 
Only the Autonomy and Deference scales and 
one (Bernardin & Jessor, 1957) used the two 
sa les as a multiple criterion for grouping 
without giving any indication of the contribu- 
tion of the separate scales to the prediction 
of hypothesized behavior. All three studies 
used different methods of exerting social influ- 
ence or pressure to change. 

If both Autonomy and Deference con- 
tributed to the differentiation of dependent 
and independent Ss in the Bernardin and 
Jessor study and if susceptibility to change 
> indeed a part of the dependency construct, 

€n Autonomy is the one EPPS variable that 

ad a measure of validity in all three studies. 
€tnardin and Jessor did not get a difference 
etween their dependent and independent 
8toups in an Asch-type situation, but Gisvol 
the when he modified the Asch technique so 
at Ss could conform without publicly an- 
reuncing a response contrary to a oo 
£ality situation. In the present study Ss cou 


ewise conform or be influenced in pe 
e relationships © 


S variables and the criterion for men ant 
nality vari- 


Oomen could 
mean that the perso” ; 
soles lead to different responses 17 the pa 
xes, that the experimental situation = a 
€rent effect, or that the interaction of per 
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sonality and situational variables was differ- 
ent for males and females. It is entirely pos- 
sible that judging the distance between lights 
in a dark room could have quite different 
meaning and different degrees of relevance for 
the sexes. 

Although none of the correlations for the 
EPPS variables judged irrelevant to the cri- 
terion actually reached significance at the re- 
quired level, two of them were between the 
05 and .01 level and were equal to or larger 
than the hypothesized correlations. Further, 
the correlations for these variables were con- 
sistent in direction for both sexes. These vari- 
ables and the correlations with the criterion 
for males and females, respectively, were: Or- 
der, —-53, —.28; and Succorance, 19, .49. It 
is conceivable that Ss characterized by high 
Order may have felt considerably disoriented 
and insecure in the relatively unstructured, 
ambiguous experimental situation. Tf so, it is 
reasonable that they should be more suscep- 
tible to outside influence—instructions which 
gave additional structure and order to the 
situation. A possible, though highly specula- 
tive, explanation for the correlations of Suc- 
corance with the criterion is that Ss high on 
Succorance perceive attempts to change or 1n- 
fluence them as a threat to their need for ac- 
ceptance, affection, and encouragement. é 

The interpretation of the four variables in- 
volved in the hypothesis of this study is sim- 
pler, at least for male Ss. Autonomy and 
Dominance have a degree of validity for pre- 
dicting resistance to interpersonal influence 
as measured in the experimental situation. It 
can be said that each of these scales measures, 
in part, a facet of behavior on the autonomy- 
heteronomy dimension. The two previous 
studies support this generalization with re- 
spect to Autonomy. One of these studies, and 
the present one to a lesser degree, found that 
Deference also measured an aspect of be- 
havior on this dimension for both men and 

romen. 

” “The fact that the male Ss varied in the ex- 
tent to which they changed their perceptual- 
cognitive response to a stimulus situation un- 
der influence of E’s instruction and the fact 
that this change tended to be associated with 
certain operationally defined personality char- 
acteristics have implications for various inter- 


personal situations. Such situations include 
some approaches to psychotherapy and coun- 
seling, where the advice, information, sugges- 
tion, or interpretation of one person is in- 
tended to influence another. Males low on 
Autonomy and Dominance and high on Abase- 
ment might be expected to change to a greater 
extent. There was some evidence that high 
Deference would contribute to susceptibility 
to change in both males and females. 


SUMMARY 


It was hypothesized that the personality 
characteristic of Autonomy, Dominance, De- 
ference, and Abasement, as measured by the 
EPPS, would correlate with change of con- 
cept under influence of £’s instructions. The 
39 Ss, all the students in one undergraduate 
class in psychology, were first given the EPPS. 
Ss were then taken individually to the labo- 
ratory where they were given the opportunity 
to form a concept of the distance between 
two stimulus lights exposed in a dark room. 
Following this, E gave instructions calculated 
to influence S to change his concept of the 
distance, although the actual distance re- 
mained the same. 

For male Ss the hypothesis was confirmed 
for two of the four personality characteristics 
at less than the .05 level of confidence and 
for another at approximately the .05 level. 
Autonomy and Dominance correlated posi- 
tively with resistance to change and Abase- 
ment correlated negatively. The correlation 
of Deference and resistance to change was 
not significant but was in the predicted direc- 
tion for both men and women. The implica- 
tions of this finding for the validity of the 
EPPS and for certain interpersonal interac- 
tions, including psychotherapy and counsel- 
ing, were discussed briefly. 

The hypothesis was not confirmed for fe- 
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male Ss. It was considered a possibility that 
the differences in correlations for men and 
women were artifacts relating to the experi- 
mental situation. 

None of the 22 cortelations for the EPPS 
variables not included in the hypothesis was 
significant at the required .01 level. Howeve!, 
two of the variables had correlations which 
were between the .05 and .01 levels and which 
were corroborated by correlations in the same 
direction for the other sex. Tentative inter- 
pretations. of the meaning of these variables 


in relation to the experimentally evoked be- 
havior were offered. 
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CONSTRUCT VALIDITY OF THREE MASCULINITY- 
FEMININITY TESTS 


GORDON A. BARROWS 
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Masculinity-Femininity (M-F) tests are 

ae lly developed using items which differ- 
entiate men from women. Obviously, the con- 
struct validity of these tests cannot rest en- 
tirely on this differentiation since much sim- 
pler criteria are available for this purpose. 
The construct validity of these tests must also 
i based on relationships with other variables 
within a sex group. Since there are several 
widely used tests of M-F, they should be 
ighly intercorrelated if they are measuring 
the same construct. ‘These tests should cor- 
relate with interest patterns which the society 
defines as masculine or feminine. There is no 
a priori reason to expect these tests to Cor- 
relate with general intelligence; however, 
Since men are usually found to be somewhat 
Superior on tests involving quantitative or 
withmetic ability, masculinity in men might 
Orrelate with these tests. 

On the basis of theory and clinical lore, 
might expect to find certain differences re- 
coe to marital status variables. On the as- 

nption that a strong masculine motive 1M- 
Ta one toward the masculine role in mar- 
N and parenthood; (a) married men 
oe be expected to have higher masculinity 
hing than single men, (b) men who marry 
high younger age might be expected to have 

Sher masculinity scores than men who marry 

on Older age, (c) men who have more chil- 

cui, might be expected to have higher mas- 

chi nity scores than men who have fewer 
ildren, 

Who» hoanalytic theory suggests that men 

ee women older than themselves are 

mi; ing for a “mother-substitute.” Such men 
Sht be expected to be lower in masculinity 


1 
R j 
coi? at Department of Psychology, Brooklyn 
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than men who marry women younger than 
themselves, since the latter supposedly have 
resolved their “Oedipus Complex” through 
identification with their father. 

Jung (1939) has suggested that the femi- 
nine component in men (“anima”) becomes 
stronger as they age. This hypothesis is sup- 
ported by data of Terman and Miles (1936) 
who found a decline in masculinity scores in 
males after high school. One might expect 
somewhat similar results on the M-F tests 
used in this study. 

Most of the aforementioned hypotheses are 
highly speculative and negative evidence 
would not be crucial for either the theories 
or the validity of the tests. Positive evidence, 
however, would support the theories and test 
validity, particularly if more than one M-F 
test yields results in the expected direction. 


SUBJECTS 


The subjects (Ss) in this study were 2,296 
males employed at “white-collar” jobs in a 
large Canadian business institution who 
were administered a battery of tests in con- 
nection with a program of personnel assess- 
ment. This group was above the general Ca- 
nadian population in educational achieve- 
ment. One percent completed elementary 
school; 38% had one to three years of high 
school; 39% had four years of high school; 
19% had five years of high school; and 3% 
had one to four years of college. The age of 
the group ranged from 17 to 56 (see Table 4 
for the distribution). : 


TESTS 


The tests administered to the group in- 
cluded intelligence, aptitude, achievement, and 
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personality tests. The analyses in 5 study 
will concern only the following tests: Sai 

1. Masculinity-Femininity tests. (a) Gur 

ford-Zimmerman Temperament Survey ( -Z) 
M-F scale, (b) Minnesota Multiphasic Per- 
sonality Inventory (MMPI) M-F scale, (c) 
the Strong Vocational M-F scale. , ; 

2. Intellectual Abilities. (a) Otis Quick- 
Scoring Mental Ability Test, (b) Watson- 
Glaser Critical Thinking Appraisal, (c) Thur- 
stone Mental Alertness Test, (d) Cooperative 
English Reading Comprehension Test, (e) 
Cardall Arithmetical Reasoning Test. 

3. Vocational Interest. The Kuder Prefer- 
ence Record—Vocational Form C. 


MARITAL STATUS VARIABLES 


The following marital status variables were 
analyzed in relation to M-F tests: (a) single 
vs. married status (excluding widowed and 
divorced Ss); (b) number of children; (c) 
wifes age in relation to husband’s (older, 
younger, or same); (d) age of S at marriage. 

One further variable analyzed in relation 
to M-F was the S’s age. 


RESULTS 
Correlations between Measures of M-F 


The correlations between the measures of 
M-F are contained in Table 1. In this table 
and all others giving correlations between the 
MMPI M-F test, the correlations involving 
the MMPI M-F are reversed in sign because 
this test is scored in the feminine direction 
while the G-Z and Strong M-F tests are 
scored in the masculine direction. All three 


TABLE 1 


INTERCORRELATIONS AMONG GUILFORD-ZIMMERMAN, 
MMPI, AnD STRONG MASCULINITY-FEMININITY 


ScaLes* 
(N = 2296) 
MMPI Strong 
G-Z 31 .34 
MMPI Ba 


* All three correlations are significant below the „001 level 
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correlations are significant and of about the 
same magnitude (.3). 

Correlations between the G-Z and Strong 
M-F scales and the other scales contained in 
these tests were available. These correlations 
are of some interest since they enable one to 
evaluate the validity of the correlation be- 
tween M-F scales using the multitrait-multi- 
method system suggested by Campbell and 
Fiske (1959). In this case, however, there is 
only one trait being evaluated, M-F. These 
authors list several requirements for validity: 

1. The validity correlation between two 
variables using different techniques to meas- 
ure the same trait should be significantly dif- 
ferent from zero and large enough to encour- 
age further examination of validity. The cor- 
relation of 34 between the two M-F scales 
was highly significant but its magnitude is 
questionable. Considering the large W in- 
volved in the correlation there is some justifi- 
cation in proceeding to the next criterion. 

2. The validity correlation should be higher 
than the correlations between the variables 
and other variables that have neither con- 
struct nor method in common. The .34 correla- 
tion between the M-F scales is higher than 
the correlations of the Strong M-F with 9 
other G-Z scales and higher than the correla- 
tions of the G-Z M-F with 10 other Strong 
scales. The data satisfied the second criterion. 

3. The validity correlation should be higher 
than correlations between the variables and 
other variables which are designed to get at 
different traits but happen to use the same 


method. The .34 correlation between M-F 
scales is lower than 


tions between G-Z M-F 


G-Z M-F correlated 35 with the Objectivity 
scale. The validity correlation is lower than 2 
of the 10 correlations between Strong M-F 
and the other Strong scales, Strong M-F cor- 
related .55 with the Production Manage" 
scale and .43 with the Certified Public AC 


countant, Senior scale. The data do not satisfy 
the last criterion altho 
relations indicati 


be noted that the scale’ 
developed from the seH 
hat correlations betwee 
ay be a function of item overlap- 


é 


| Sth 
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C 
a pe ATION BETWEEN MEASURES OF MENTAL 
ILITY AND MEASURES OF MASCULINITY 
4 = 22 
— (N = 2296) 


pp “bility Tests GZ Strong ° MMPI 
j ae a .21* ost = —. 14 
| Tha titical Thinking 120 05 —.15* 
i Thuo (Ling) ‘g =02  =22* 
| Thure (Quant.) 17* 12—04 

oop ene (Total) 23 ods — 8 

Carden seins Comp. ‘oat  —03 — 268 
l all Arith, Reasoning .21* att —.07* 


* Sign; 
i 
Enificant at or below .001 level. 


Co; 5 , 
"relations of M-F with Measures of Ability 


me CZ M-F correlates positively and 
bal SUN with all measures of ability, ver- 
Strong qg Wantitative (see Table 2). The 
Cant > AR correlates positively and signifi- 
with T the conservative 001 Jevel) 
Subte: e Otis, the Thurstone Quantitative 
ing St, and the Cardall Arithmetical Reason- 
culinit, St. Unlike the previous two tests, mas- 
With if on the MMPI correlates negatively 
tions a areellity measures and all the correla- 
Bh the p e nificant except the correlation 


6 hurstone Quantitative subtest. 
Orrelan: 
“lations of M—F with Vocational Interests 


With ge_ations of the three measures of M-F 
Can © Scales of the Kuder interest scales 

| of xa Seen in Table 3. The general pattern 
5e Correlations is nearly the same OP all 


ORT TABLE 3 

RE 

ELATION BETWEEN MEASURES OF INTEREST 
AND Measures oF MASCULINITY 


— (N = 2296) = 
Uder n U 
Taterest Scales G-Z Strong MMPI 
Qı 
Mear 14* Zy 
ro chanical ce 26" 
Scien national ‘0s* on 
P Ic ew 
Ap tAsive 06, 
i AStic =) A ZZ 
terary —.12 _o7* 
octal oe ae 
AlS — 24 To 
Clerica] ervice w oE 03 


* 


iknin 
Ci 
Ant at or below ,001 level. 


AGE AND MASCULINITY-FEMININITY 


G-Z Strong MMPI 

Ages N (Masculinity) (Masculinity) (Femininity) 
17-26 1315 4.84 45.3 23.76 
27-36 383 5.07 45.1 24.45 
37-46 468 4.87 43.7 25.22 
47-56 128 4.82 43.8 24.38 

F 1.60 3.99 11.17 

$ ns .01 .001 
three measures. Mechanical, Computational, 


and Scientific interests correlate positively 
with masculinity; while Artistic, Literary, 
Musical, and Clerical interests tend to cor- 
relate negatively with masculinity. Outdoor 
interests correlate positively with masculinity 
on the G-Z and Strong scales but negatively 
with masculinity on the MMPI. 


Age and M-F 

Ss were grouped in four age groups as 
listed in Table 4. The M-F scores in these 
groups were compared using single-classifica- 
tion analyses of variance. Significant F ratios 
between age groups were found on the Strong 
and MMPI M-F scales. Looking at the group 
means on the Strong, it is apparent that the 
two younger groups (ages 17-36) are higher 
in masculinity than the two older groups 

es 37-56). The relationship is less clear 
on the MMPI where femininity scores in- 
se from the youngest to the 37—46 age 


crea: H 
group and then decrease in the 47-56 age 
up- 
Boe TABLE 5 
DIFFERENCES BETWEEN SINGLE AND MARRIED 
MEN ON MASCULINITY-FEMININITY SCALES 
z G Strong MMPI 
(Masculinity) (Masculinity) (Femininity) 
Single (N = 1039) 
í 4.68 44.70 23.86 
T 1.90 10.00 5.46 
Married (N = 1242) 
M 5.06 44.90 24.45 
sD 1.81 8.70 4,40 
CR 4.90 50 2.81 
.001 ns 005 
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TABLE 6 


AcE AT MARRIAGE AND MEASURES 
or MASCULINITY-[EMININITY 


G-Z. Strong MMPI 
Age at Marriage N (Masculinity) (Masculinity) (Femininity) 


1 60 5.15 46.5 23.60 
er 452 5.22 45.7 24.07 
26-30 416 500 44.6 24.60 
F 313 491 44.1 24.99 
F 2.12 2.97 3.60 
Dp ns 05 05 


Marital Status and M-F 


The married group was significantly higher 
than the single group on masculinity as meas- 
ured by the G-Z, but they were higher on 
femininity as measured by the MMPI (see 
Table 5). 


Number of Children and M-F 


Married Ss were divided into five groups: 
no children, one child, two children, three 
children, and four or more children. Single 
classification analyses of variance did not 
yield significant F ratios on any of the three 
M-F scales. 


Wife’s Age in Relation to Husband’s and M-F 


A comparison of the three groups, wife 
older, wife younger, and wife same age, did 
not yield significant F ratios on any of the 
three M-F scales. 


Age-at-marriage and M-F 


The Ss were divided into four groups on 
the basis of their age at time of marriage 
(see Table 6). The F ratios between groups 
were significant (.05 level) for the Strong 
and MMPI M-F scales. On both scales, the 
relationship is similar: going from the younger 
age at marriage to the older a 
one sees a decrease in mean 
(Strong) and an increase i 
scores (MMPI). 


ge at marriage, 
masculinity scores 
n mean femininity 


Discussion 

The correlations between 
which purport to measure t 
linity-femininity are low, albeit highly sig- 
nificant. Nance (1949), who intercorrelated 
the same tests using a sample of male students 
in a college of education, found two of the 
correlations to be somewhat higher (G-Z and 


the three tests 
he trait mascu- 


Gordon A. Barrows and Marvin Zuckerman 


MMPI, .43; G-Z and Strong, .28; Strong and 
MMPI, .51). Considering the size of our sam- 
ple the reliability of the obtained correlations 
is high. A question does arise about possible 
national, vocational, and educational differ- 
ences between populations. Considering only 
the results obtained on the population sam- 
pled in this study, it is apparent that al 
though the three tests have some communal- 
ity, the major part of their variance is not 
accounted for by the common factor. This 1S 
apparent in other aspects of the results, i.¢-, 
the finding that the G-Z and MMPI M-F 
tests give opposite results in correlations with 
ability measures and in comparisons of mar- 
ried and single groups. 3 

Application of the Campbell and Fiske cri- 
teria to the Strong and G-Z tests indicated 
weak construct validity for M-F as meas- 
ured by these tests. Considering the correla- 
tions with vocational interests it is apparent 


that all the tests relate in small but signif- ; 


cant degree to masculine and feminine interest 
patterns. A glance at the M-F tests reveals 
that a portion of their items are stated in 
terms of vocational interests. 

Masculinity on the G-Z is positively cor- 
related with all measures of ability. The re- 
lationships are no more pronounced on quanti- 
tative than on verbal tests. Masculinity or 
the Strong seems to be more highly related to 
quantitative tests of ability in conformance 
with the stated hypothesis. Abilities on the 


MMPI are negatively correlated with mascu- ` 


linity. The results are less pronounced on the 
two quantitative tests. The results on the 
MMPI can be compared with those of Good- 
stein (1954) who found that male college 
students score high on femininity relative t° 
the general population. A number of items 0” 
the MMPI M-F score cultural interests 4° 
feminine. An interest in poetry or the theatet 
is feminine while an interest in sports or 
hunting is masculine. Similarly, a wish to be 
a librarian or journalist is feminine while E 
wish to be a soldier or contractor is masc" 
line. While an increased interest in certain 
activities and vocations stemming from € u 
cation or intelligence may indicate greate 
femininity (in the stereotype of the gene” 
Population) it is doubtful whether this 3 ,, 
meaningful trait in the more educated. 


Construct Validity of Three M-F Tests 


not clear why masculinity on the G-Z is posi- 
mon related to measures of ability other 
an quantitative ability. The G-Z often pairs 
vocations or interests of equal intellectual 
level in the items, e.g, “You would rather 
oo mathematics and science than literature 
nd music, T or F.” Perhaps forcing a choice 
makes the more intelligent males subordinate 
general cultural interests to specific vocational 
interests which require high intelligence- 
The only one of the marital status vari- 
ables giving consistent results on at least two 
tests was age-at-marriage. Those who mar- 


ri . ._ e 
tied when young had higher masculinity scores 


a ie Strong and MMPI than those who 
ee when older. As part of the larger 
ete “success index” was computed for 
very S in terms of average annual incre- 
Ments in salary over the period of employ- 
a with corrections for the change in the 
valas of the dollar. The mean success index 
Wes es for the four age-at-marriage groups 
Me ees follows: under 21, 24.3; 21-25, 19.75 
a ~30, 13.0; over 31, 9.1. Differences between 
de were statistically significant and in- 
i ie that those men who married earlier in 
tion ae been more successful in this organiza- 
riage In view of the association of age-at-mar- 
an both the success index and Strong 
lati IMPI M-F, one might expect some re- 
ìonship between masculinity and success. 

i ese correlations are low, -11 and .09, but 
Bhly significant (p < .001). Evidence from 
cated tests related to the success index ee 
men nan the more active, less restraine 
Pulsi ad greater success. The aggressive, a 
ea ve aspect of masculinity may be is 
oy to earlier marriage in highly mascu ine 
Proa These men may find it easier to ap- 
ach and establish relations with members 

€ opposite sex. 

ang _ °vidence on the relation 
i ulii is ambiguous. 
linit, scale indicates a sharp drop pe 
Mice eS after 36 years of age, the M1 2 
then, wes a drop in masculinity up to 46 ani 


56). an increase in the last age group (47- 


between age 
Although the 
in mascu- 


ta Seems apparent to the investigators that 
4 eq nity—femininity is not a clearly de- 
tu Construct. The tests used contain a mix- 


A : 
€ of interests and emotional attitudes. In- 
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tratest factor analyses might help clarify the 
construct that the tests purport to measure. 


SUMMARY 


The Guilford-Zimmerman, Strong, and 
MMPI Masculinity-Femininity tests were 
given to 2,296 employees of a Canadian busi- 
ness institution along with tests of mental 
abilities and vocational interests. The rela- 
tionships between these tests, and the rela- 
tionships between masculinity and certain 
marital status and age variables were investi- 
gated. The correlations between the three 
M-F tests were low, but highly significant. 
Masculinity as measured by the G-Z and 
Strong tended to correlate positively with 
abilities, particularly quantitative ability. 
Masculinity on the MMPI correlated nega- 
tively with the ability tests, particularly the 
nonquantitative ones. All tests correlated sig- 
nificantly with interest scales. Mechanical, 
Scientific, and Computational interests cor- 
related positively with masculinity while Ar- 
tistic, Clerical, Musical, and Literary interests 
tended to correlate negatively with mascu- 
linity. Married Ss scored higher on mascu- 
linity on the G-Z, but single Ss were higher 
on the MMPI. Number of children and the 
relation of the wife’s age to the husband’s did 
not yield significant differences. Age-at-mar- 
riage yielded significant differences between 
four groups on the Strong and MMPI scales. 
Those married at younger ages tended to have 
higher masculinity scores. S’s age yielded sig- 
nificant differences between groups on the 
Strong and MMPI M-F scales with a tend- 
ency for masculinity to drop with increasing 
age, but the trends were not completely linear. 
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AGGRESSION AND THE PICTURE-FRUSTRATION STUDY’ 


J. KASWAN, 
University of California, Los Angeles 


M. WASMAN, asp LAWRENCE ZELIC FREEDMAN 
Yale University 


The Rosenzweig Picture-Frustration Study 
(P-F) is widely used as a clinical and re- 
search measure of verbal aggression, although 
its validity has not been clearly established. 
Most validating studies have attempted to 
correlate a specific criterion of aggression 
with the P-F scoring term “extrapunitiveness” 
(E)—defined as the percentage of responses 
in which the individual turns his aggression 
outward, against the environment. For exam- 
ple, Fry (1949), Holzberg and Hahn (1952), 
Vane (1954), and Weinberg (1952) have all 
found that juvenile delinquents or adult 
criminal offenders fail to manifest more E on 
the P-F than nonantisocial subjects (Ss), and 
have therefore concluded that the P-F is not 
valid. That such studies may represent too 
narrow an approach to validation is indicated 
by the APA Committee on Test Standards 
(1954, pp. 14-15) who note that test validity 
in the case of most clinical instruments must be 
evaluated by integrating evidence from many 
different sources and that often no single cri- 
terion measure or composite criterion can be 
identified. Thus aggression may occur in 
many different degrees, in many different 
situations, and in many different forms. Con- 
versely, similar degrees of aggression could be 
expressed in different ways. It therefore may 
not be reasonable to expect that any particu- 
lar measure of aggression which is arbitrarily 
selected as the single and specific criterion 


1A version of this paper was presented at the 
Eastern Psychological Association Meeting at Phila- 
delphia, April 12, 1958. 

This study was carried out as part of a research 
project on nonconformist behavior conducted within 
the Department of Psychiatry, Yale University, di- 
rected by Lawrence Z. Freedman, and supported by 
the State of Connecticut and Foundations Fund for 
Research in Psychiatry. The orientation of the paper 
does not necessarily reflect the theoretical frame- 
work of the over-all study. 
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out of the total spectrum of aggressive be- 
havior should relate to P-F performance. 

This paper represents an attempt to in- 
vestigate the validity of the P-F by relating 
the quantity and intensity of aggressive re- 
sponses obtained on it to aggression as de- 
scribed by various other techniques. We as- 
sume that if the P-F does refer to aggression 
it should relate to a substantial proportion 
of the other measures of aggression. 

A measure of the intensity of aggressive re- 
action to frustration was designed in the be- 
lief that mild verbal aggression, such as “I 
don’t agree,” might characterize a different 
reactive process to frustration than that re- 
flected by responses containing references t0 
violent, uncontrolled aggression, such as “I'll 
break your neck.” The failure of the present 
P-F scoring system to take into account such 
variations in intensity has been pointed out 
by Holzberg and Hahn (1952). This addi- 
tional measure is expected to broaden thé 
range of relationships between the P-F and 
other measures of aggression, 

The investigation utilizes part of the datë 
collected in an interdisciplinary study of anti- 
social offenders classified as sexual, aggres” 
sive, or acquisitive. Psychiatrists, psycholo- 
gists, sociologists, and to a limited degreé 
lawyers, cooperated in obtaining data by the 
techniques appropriate to each discipline p 
such a way as to allow comparable statistic 
and other analyses. A great deal of informa 
tion was thus compiled for each individua? 
Providing possibilities for describing and 1 
lating behavior in a variety of contexts. 


í PROCEDURE 
Subjects 


te 
The sample tested consisted of 121 male al 
Prison inmates selected so.as to match the 


$ P ge 
prison population on eleven variables, including ĉ 


| 
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education, and marital status. In terms of general 
characteristics the sample may be described as fol- 
‘a lows: 4l sexual offenders, 40 aggressive offenders, 40 
acquisitive offenders; 72% white, 287% Negro; me- 
dian age 36.4 years; 55% did not have any educa- 
tion beyond the 8th grade, 41% had some high 
school, 4% some college; median IQ was 96.2; me- 
dian time in prison to date of testing was 2.2 years. 
Ss were assured that participation or nonpartici- 
Pation could have no possible effect on their prison 
Status. Cooperation was voluntary. Approximately 
20% of those called refused psychological testing. 
On the basis of the freedom of expression observed 
during interviews there is reason to believe that 
Most Ss accepted the fact that the study had no 
Connection with the prison authorities and Ss were 
not usually on “best behavior.” 


nD 


Picture-Frustration Study 


The P-F was administered individually or in 
groups of no more than five Ss so that the testing 
Performance could be readily observed. All the P-F 
pictocols were scored according to Rosenzweig’s Re- 
ised Scoring Manual (Rosenzweig, 1950) and with- 
Sut knowledge of the S’s identity of his performance 

į On other measures, The final scores yielded the per- 
ane of responses falling within each type and 
cage of aggression provided for by the ae 
Beet Two scorers obtained 81% agreement on the 

x, Scoring of responses. f 

i u n addition to analyzing protocols in terms of the 

pial P-P scoring system, responses scored as extra- 
Punitive were rated along a scale designed to reflect 

e intensity of aggressive reaction to frustration 
to the Each response was assigned a rating according 

the following three-point scale: 

„1l. Mild IE: Responses which contain bland, in- 
irect aggression within the limits of socially ac- 
ceptable behavior, such as “I don’t think so,” or 

T wish she would remove her hat.” À 

2. Moderate IE: Responses which indicate sar- 
casm, strong displeasure, direct blame, or insist- 
ence that the other person do something about the 

Situation, such as “How about next year,” in re- 

“Ponse to an apology for breaking an appoint- 

ment, y 

pated IE: Responi : 

erbal aggression so tha 0 

are weakened, a Fs violent expressions of emotion 

tected toward the frustrating person oe a 

tion, Swearing and other abusive language, inc io 
ing threats of physical harm such as, “TIl knoc 

Your head off,” frequently occurred in such re- 


hich contain direct, 
ocial prohibitions 


SPonses, 
; ae? IE scores referred to in the analysis of a 
mie are median scores. The percentage of agre 


nt on the exact rating of individual responses for 


t r r 
abies was 83.8% and the coefficient ef pi 

ue Pe dian scores Was -°)- 
a E eeaae ated along an m- 


Punitive res Iso T 
ponses were a : 
Dray scale but the restricted range of eo 
F e cluded the development of a measure which cou 
4 Utilized for further statistical analysis. 
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Aggression Measures ° 


In keeping with the conception of aggression as a 
heterogeneous pattern of behavior it seemed neces- 
sary to select measures designed to tap aggressive 
content in as broad a range of behavior as possible. 
Accordingly the P-F measures were related to data 
pertinent to aggressive behavior derived from the 
Rorschach, a psychiatric schedule, a specially de- 
vised Attitude Scale, and the prison record. The P-F 
was chosen because it seemed appropriate to evalu- 
ate the individual’s manner of handling mild verbal 
frustration and provided a standardized system for 
the evaluation of aggressive responses to such situa- 
tions. The psychiatric interview was intended largely 
as an exploration in the individual's personal history, 
including various aspects of his aggressive behavior 
as he saw it. The Attitude Scale was designed to 
permit the individual to express his attitudes about 
family members and his own feelings without such 
responses being filtered through an interpreter, such 
as an interviewer. The Rorschach was selected as an 
instrument presumed to minimize conscious control 
over the meaning of responses which might reflect 
aggressive feelings the S would not or could not ex- 
press in response to the other techniques. The prison 
record provided some indication for the degree of 


antisocial behavior. 


Rorschach 


All Rorschachs were scored according to Klopfer’s 
method, except for form level, where Beck’s criteria 
were used. Records were evaluated with the help of 
a manual devised to analyze the following dimen- 
sions: degree and control of aggression, passive de- 
pendency, degree and control of sexual disturbance. 
degree of thought disturbance, principal defenses 
used, and diagnostic classification. Degree was rated 
from little (1) to extreme (3); control from good 
(1) to very poor (4). Signs for each dimension con- 
tained in the manual were obtained from the text- 
books of Klopfer (1954), Phillips and Smith (1953), 
Rapaport (1945) and Schafer (1954), where two 
or more books agreed on their interpretation, as well 
as Elizur’s (1949) hostility signs. The following ex- 
cerpt of the description of the procedure given to 
the raters will jllustrate the way in which these 
signs were used: 

Very high frequencies of signs on a particular 
variable (e£. aggression) will almost always indi- 
cate serious problems in that area, but a similar 
degree of concern may also be reflected in one or 
very few responses of extreme intensity, Thus 
many aggressive symbols such as knives and spears, 
or one response, in an otherwise bland record, of 

torn flesh and a lot of spattered blood, might both 
indicate a high degree of aggressivity. . . . Such 
evaluation requires much clinical experience, and 
the latter will be the principal tool of evaluation 
of the Rorschachs in this study. 


ane 
2A more detailed description of these measures, 
including scoring criteria, may be obtained from the 


senior author. 
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It is recognized that this approach is quite Ime 
tive but the authors are not aware of any other 
method which can successfully deal with the Ror- 
schach as a clinical instrument designed to tap 

nconscious feelings. R 
po not permit the listing of the signs used 
as indicators of all the dimensions, but the signs 
used in the evaluation of degree of aggressivity may 
serve as an example: Frequency of Elizur’s (1949) 
hostility signs; blood; splattered fluids (eg, paint); 

high visceral anatomy; fire; high FM with low M 

(example: rate “3” if more than 20% FM with no 
or one M where content aggression is moderate) ; 
very critical of cards; rejects (Card IX reject least 

important). As a reliability check, Rorschachs were 

rated according to the above method by another 


judge. Exact agreement varied from 40% to 75% 
on three-point scales, 


ize Ns in all categori 


the variables here reported are shown in Table Z: 
Two-step disagreements constitute 9.8% of all com- 
parisons. Chi square 
over-all comparisons 
gests better than ch 
centages do not give 
purposes. The Rorschach ratings are nevertheless re- 
ported here because it was found that, with very 
few exceptions, by the au- 


thor correl with other 


Attitude Scale 


Adopted from a Procedure suggested by Toman 
(1955), a 205-item Thurstone-type scale was de- 
vised which required agreement or disagreement with 
statements expressing attitudes toward important ob- 
jects and persons. The statements given to Ss were 
arranged in a random sequence and Ss were required 
to circle “Yes” or “No” for each statement. The 
scale was analyzed in terms of 45 five-item continua, 
ranging from maximum approval (Statement 1) to 
maximum disapproval (Statement 5) of a Person or 
attitude. The following continuum of attitudes to- 
ward the father, for which data are reported in 


Table 2, will serve as an illustration of the instru- 
ment: 


1. I loved my father best of all. 

2. I liked my father, 

3. I often liked my father, 
little fed up with him. 

4. I often can’t hel 
for my father, 

5. It should be different, 
my father, 


3 Jacob Goldstein, Ne 
New York City. 


but sometimes I gota 
p having a feeling of dislike 


but I can’t help hating 


w School for Social Research, 


J. Kaswan, M. Wasman, and L. Z. Freedman 


Agreement with Statements 1 or 2 or 1 and 2 or 
1 and 3; or 2 and 3 or 1, 2, and 3 were scored as 
Positive. Corresponding combinations of agreemen| 
to Statements 3, 4, and 5 were scored as negative. 
Any combination of agreements with statements on 
both sides of Statement 3 was scored as ambivalent, 
as was also agreement with Statement 3 alone. Nine- 
teen percent of all responses were ambivalent. Re- 
sponse reliability was checked by a three-item lie 
scale (eg., I have never had a headache) and by the 
repetition of three statements. Seventy-six percent 
of Ss disagreed with all three lie statements and 
12% gave different responses on repeated statements. 

Scale reliability was checked by independent sort- 
ing of all items by a naive rater into graded groups 


of five Statements. The rater differed from a priori 
scaling in 3.6% of cases. 


Psychiatric Schedule 


The psychiatrist followed a Prepared schedule 
which required that specific information be recorded 
in such a Way as to be easily codable, All but 20 of 
the psychiatric interviews were conducted by one 
psychiatrist. Al] information was recorded during 
the interview, Coding criteria Were specific enough 
to permit coding by clerical personnel. As a reli- 
ability check 1,000 randomly selected items, chosen 
from all psychiatric schedules, were recorded by an 
inexperienced coder. She achieved 83% exact agree- 
ment with the previous coder, 

The following illustrates the criteria used in cod- 
ing answers to questions about the frequency of 
aggressive behavior between S and the younger male 
sibling (see Table 2 for data). 

O—no fights 


1—infrequent fights, less th, 


t an 10 instances in all 
2—sometimes, but n 


ot more than once in three 


as once per week o 
4—once per Week or more, 

day 
5—once per day or more 
X—no such individual 
Y—no answer 


but less than once a 


Questions Were not asked in a set way but the in- 
lerviewer attempted to 
“How about your 
and then would ask more spe- 


Prison Record 


The Jast three items in Table 2 were taken from 
the prison record, History of serious antisocial ag- 
gression referred to Previous felony convictions (as- 
Sault). Present offense classification and the activity 
for which convicted (eg. murder) were judged from 
a Police description of the behavior during the 


P 4 : as 
offense. Thus the inactive accomplice to murder W 
* Hugh Storrow, now in the Department of PSY 


chiatry, University of California, Los Angeles. 


St eg 


J 
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not classified either “aggressive” or “murderer” even 
though he may have been convicted for murder. 

Twenty-two variables comprising all the data 
bearing on aggressive behavior of sufficient numeri- 
cal range for statistical analysis were abstracted from 
these techniques and related to P-F performance by 
chi square (see Table 2). The distributions of E and 
IE scores were split into high, moderate, and low 
groups. Statistical requirements of chi square (Coch- 
Tan, 1954) necessitated the collapsing of some con- 
tingency tables and in all such cases the upper ex- 
tremes (high scores) were contrasted with the com- 
bined moderate and low scores. 


RESULTS 


_ Table 1 indicates that the mean extrapuni- 
tive, intropunitive, and impunitive scores for 
Our sample closely matched the revised norms 
Teported by Rosenzweig (1950). The scores 
for our sample, however, appear to be some- 
what more variable on all three measures 
than those obtained from his normative sam- 
ple. The mean for the intensity rating of 
extrapunitiveness was 1.29 with a standard 
deviation of .32 for a three-point scale, indi- 
Cating that most Ss tend to express a rather 
mild intensity of aggression to this technique 
as measured by our ratings. 

The interrelationships of P-F measures were 
determined by correlational and chi square 
analyses and are reported in Table 2. E was 
found to be significantly correlated with the 

“ measure, so that frequency and intensity 
of aggression appear positively associated. E 
was inversely related to intropunitiveness and 
™punitiveness as a result of the dependent 
Scoring system, However, the association of 
both high E and high IE with both high ego 

Cfensiveness and low need peristence scores 
‘presents a relationship between independ- 
ent Scoring categories. The latter measures 
are two of Rosenzweig’s three “types of AS- 
Btessive Reaction.” According to Rosenzweig, 
A Who express intense and/or frequent me 
Ward aggression on the P-F tended to respon 

a manner “in which the ego of the subject 
ed ominated” rather than in a manner m 
enh “the solution of the frustrating prob- 
ch 1s emphasized” (Rosenzweig, Fleming, $ 

arke, 1947, p. 166). i 

Wenty-three out of an over-all total > 
sì relationships tested by chi square per 
Lp ficant at the .05 level or beyond, W i 
ley, as approached significance at 1° - 

el. In relating E and IE to meas 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR PICTURE 
Frustration STUDY VARIABLES 


(Rosenzweig’s Norms in Parenthesis) 


Mean Standard Deviation 
Extrapunitiveness 47,49 (45) 15.71 (13.3) 
Intropunitiveness 26.40 (28) 10.01 ( 8.3) 
Impunitiveness 26.26 (27) 9.73 ( 9.5) 
Intensity of 
Extrapunitiveness 1.29 315 


aggression elicited by other techniques, 12 of 
22 relationships for E and 8 of 22 for IE 
were found to reach the .10 level or beyond. 
The total number of significant findings is 
thus well beyond the number expected on the 
basis of chance alone. 

Chi squares significant at the .05 level or 
beyond revealed that E was associated with 
the S’s report of aggression and negative feel- 
ings in object relationships, as with fathers 
and wives; the perception of peers as hostile 
and threatening; a history of antisocial ag- 
gression; and murder in the present offense. 
Moreover, the data reflect a number of trends 
which fall between the .05 and the .10 level. 
High E tends to be associated with the psy- 
chiatrist’s rating of high verbal aggressiveness 
and, as judged from the Rorschach, poor con- 
trol of aggression, the use of projection as 
a defense mechanism, and the diagnosis of 
schizophrenia. The only findings which seem 
inconsistent with the general direction of re- 
sults was a trend for high E Ss to report in- 
frequent fighting with peers. 

The IE measure was found to correlate .48 
with E—a moderate but statistically signifi- 
cant degree of correlation. It is not surprising, 
therefore, that IE related to many of the same 
measures as E. Thus, like high E, high IE 
relates to aggression and negative feelings 
in object relationships, as with siblings and 
wives; to a history of antisocial aggression; 
and to murder in the present offense. In ad- 
dition, high IE was found to be significantly 
associated with poor control of aggressive im- 
pulses and the expression of intense affect on 
the Rorschach. 

Both E and IE were unrelated to the de- 
gree or amount of aggression manifested on 
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TABLE 2 | 
Tue RELATIONSHIP OF EXTRAPUNITIVENESS (E) AND INTENSITY OF EXTRAPUNITIVENESS 4 
HE A D 


High E Scores are Related to: High IE Scores are Related to: 


Variable Description df Chisquare* Significance level> 
P-F Variables 


Chi square? Significance level? 


(IE) TO AGGRESSION VARIABLES 


RA 5.5. 01 
High extrapunitiveness* A 4 It 25.53 r 
High extrapunitiveness intensity 4 25.53 .01 f 6 
Low intropunitiveness 4 65.35 01 12.59 T 
e o Diven 4 31.90 01 10.32 
Low object dominance 4 7.80 10 5.48 
High ego defensiveness 4 37.10 01 16.54 01 
Low need persistance 4 24.97 .01 10.46 04 A 
Rorschach | 
Degree of aggression (73%)° 2 1.87 1.24 | 
Inadequate control of aggression (65%) 2 5.17 07 6.63 o4 | 
High degree of intense affect (62%) 4 2.71 10.23 05 i 
Use of projection as a defense (73%) 1 2.89 10 
Diagnosis of schizophrenia (65%) 1 4.46 O04 
Attitude Scale 
Reaction to frustration 2 -98 1.59 
Reaction to criticism 2 2.04 32 
Perception of peers as hostile 2 6.89 04 2.85 
Negative feeling toward father 2 14.81 01 3.86 
Psychiatric Schedule 
Subject’s rating of self-control over 1 87 49 
anger s 
Psychiatrist’s rating of subject’s ag- 2 58 1.31 
gressiveness 
Tendency to express aggression 1 3.24 .08 1.23 
verbally (Psychiatrist’s rating of 
manner in which subject expresses 
aggression) 
Submissive behavior toward father 2 2.92 11.37 O01 
(developmental) à ; = l 
Frequent aggressive behavior toward 1 1.35 5.47 02 | 
younger male siblings j ? 
Infrequent fighting with peers (de- 1 3.56 .07 59 
velopmental) g 
Frequent destruction of property 1 3.28 3.47 07 i 
(developmental) ¥ 8 
Fighting with peers (current) 1 31 64 
Frequent verbal aggression toward 1 2.78 10 ; F 
wife : 1.02 
mee physical aggression toward 2 6.97 03 5.42 07 | 
History of serious anti-social aggression 2 8.40 01 Fai 08 
Present offense classified as sexual, ag- 4 1.62 $ > W 
aggressive, or acquisitive 4.11 
Murder in present offense 2 4.68 10 6.83 .04 
^ Yates correction in two by two tables, 
b Two-tailed, 
a gract agreement of two raters on three-point scale, 


*r = 48 significant at the .01 level, 


—_—_—_—*. 


Aggression and the Picture-Frustration Study 


the Rorschach, S's report of his reaction to 
frustration and criticism, S’s rating of his 
self-control over anger, interviewer’s rating of 
S aggressivity, current fighting with peers 
and Present offense classified as sexual, ag- 
Sressive, or acquisitive: 
_ Both amount and intensity of extrapuni- 
tiveness Were compared with age, race, re- 
ligion, rural-urban background, social class, 
Marital status, education, IQ, and length of 
time in prison. Those Ss with high E tend 
to be older (y?=9.86, df=2, p= 01), 
widowed or divorced (x2 = 3.71, df=1, p= 
6), and to have an education beyond the 
8th grade (x? = 7.17, df =2, p= .04). Ss 
With high IE tend to come from an urban 
ackground (x= 7.56, df= 2, p= 03). 
One of the other comparisons yielded chi 
Squares approaching significance. 
he relationships of intropunitiveness and 
'Mpunitiveness to other measures discussed 
above were investigated but results have not 
pen presented since they were generally 
Ound to be the inverse of those relationships 
ported for extrapunitiveness by virtue of 
N€ reciprocal nature of the scoring system. 


DiscussIoN 


This investigation has provided evidence 


that the P.f relates to a variety of manifesta- 


tions of aggressive behavior as assessed by 
Psychiatric interview, Rorschach evaluation, 
of ise attitude scale. The scoring perce er 

related to aggression in terms of hostile, 
` 88tessive interpersonal feelings and behav- 


o ; i i 
t, and antisocial aggression with tendencies 


toy Bags i uate 
dake the use of projection and inadeq 


Pulse control. The score for TE developed 
» this study and the standard scoring as 


Sión of ex ee show consider- 
able extrapunitiveness (E) Jating to the 


Overlap, the former also re h 

adequ ; intensity of 
ac i rol and in Y 

affect Yy of impulse cont nation of the 


on the Rorschach. Determi i 
alue of the intensity measure 1m ete 
lem range of the P-F scoring system is @ PP 

oe further research. 
Cati e results also indicate that 
E ion and IQ are positively relate! 
dat, nd high TE. An additional analy not 
i i sas 
bear C™tly correlated with oN it was 


a 


higher edu- 
d to high 
sis of the 


ari š 
Ng on aggression so that no 
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made to control their effects statistically. One 
could speculate that the P-F technique was 
most suitable for eliciting aggressive responses 
from Ss in our sample who are better able to 
deal with verbal stimuli or to express aggres- 
sion verbally. 

The P-F failed to relate to many aspects 
of aggression, such as type of criminal offense, 
reported behavior in response to frustration 
and criticism, and aggressive behavior toward 
peers. These measures which did not relate to 
the P-F seem no less a reasonable criteria 
than those which had a positive relationship 
to it. The problem of evaluating validity in 
terms of several relationships, some of which 
are significant or nearly significant while 
others are not, may be viewed in at least 
two ways. One is to postulate that items 
which relate positively to the P-F should 
characterize some particular aspect of ag- 
gression (e.g., level). Examination of Table 2 
indicates that for our results such a con- 
ceptualization would, at best, require consid- 
erable and perhaps tenuous speculation, inas- 
much as diverse features of aggressive behay- 
jor relate to extrapunitiveness. This failure to 
find a cluster of conceptually meaningful re- 
lationships may be the fault of the study, but 
we would suggest that expecting such a clus- 
ter may not be reasonable, which brings us 
to the alternative interpretation. 

Suppose, for instance, that there are two 
Ss with the same high E P-F scores for whom 
extensive personality evaluations, presumed 
valid, are available. One of these Ss gener- 
ally expresses his aggression only verbally. 
He is sarcastic, bitter, and superior but never 
attacks anyone directly and has no history of 
overt physical aggressive behavior. For the 
other individual the high E score is but one 
manifestation of generally very aggressive be- 
havior expressed along the entire spectrum of 
this attribute. There would probably be none, 
or a negative relationship between the P-F 
and measures of overt aggression for the first 
Sanda positive relationship with almost any 
measure of aggression for the second S. This 
illustrates how the same form and intensity 
of aggression in a particular situation (e.g., 
high E on the P-F test) may reflect different 
types and degrees of aggressivity for different 
individuals. Tobe sure, aggression shown on 
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one technique, such as the P-F, must be re- 
-fected in some other measures of that at- 
tribute if the technique is valid; however, 
which measures will be found to correlate 
would, from this point of view, be a prob- 
lem of sampling. That is, positive correla- 
tions can be expected to the degree to which 
the personality characteristics of different Ss 
overlap in the expression of aggression. Since 
even elaborate matching procedures are un- 
likely to yield identical samples of Ss, differ- 
ent samples can be expected to show different 
interrelationships between a particular meas- 
ure, such as the P-F, and other measures de- 
signed to reflect a range of that attribute 
(e.g., Rorschach, attitude scale, other meas- 
ures of aggression). 
Nevertheless, while two samples would 
rarely yield exactly the same interrelation- 
ships, it is possible that given a large num- 


ber of succesive samples tested with the same 
battery, the test 


be found to cor 
criterion measu 
While the patter: 
make too much 
ple (as in the 
) that a concept 
test-criteria relati 


pects of aggre 
others. The 


gression. For this 
reasonable test of 
given a pool of criterion measures 
tude scale, Rorschach, etc.), 
positive relations to the measu: 


re tested (P-F) 
exceed chance expectations. 


SumMary 
Extrapunitiveness į 
tion Study and an intensi 


scale, a psy- 
case history 
study were 121 inmates 


J. Kaswan, M. Wasman, and L. Z. Freedman 


of a state prison selected so as to match the 
entire prison population. Of a total of 44 chi 
square comparisons between the Picture- 
Frustration Study and these measures, 10 
were significant at better than the .05 level 
of confidence, and another 10 between the .05 
and .10 level. These results were interpreted 
as indicating that the P-F has some relation 
to other measures of aggression. The P-F was 
not found to tap any particular level or as- 
pect of aggression, and it was suggested that 
repeated cross-validation would be required 


before possible criteria validity could be es- 
tablished. 
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A FACTOR ANALYSIS OF GOUGH’S CALIFORNIA 
PSYCHOLOGICAL INVENTORY* 


JAMES V. MITCHELL, JR. anv JOHN PIERCE-JONES 


University of Texas 


The California Psychological Inventory 
(CPI) developed by Gough (1957) during 
the past decade is a structured verbal inven- 
tory designed to measure such personality 
characteristics as dominance, tolerance, re- 
ely, etc. These characteristics are 

Ought to be more important for compre- 
ending normal behavior than are the traits 
Sampled by instruments like the MMPI, 
Which appear to be most useful in describing 
Psychopathological conditions and maladjust- 
nt syndromes (cf. Sloan & Pierce-Jones, 
rae The 18 scales of the CPI are designed 
Re Yield a presumably meaningful set of 
i Ores which can provide a profile represent- 
Ng the personality pattern of an individual. 
haffer (1959) has written that “the CPI ap- 
Pears to be a major achievement.” Cronbach 
(1959), while eschewing Gough’s frankly 

eoretical approach to personality measure- 
nent, has applauded Gough’s apparent tech- 
ical psychometric skill. Thorndike (1959), 
Owever, has asserted that the scales of the 
ci BE ea provide a very redundant, ineffi- 
Š ent, and confused picture of individual per- 
analities, .. . Of the 18 scales, there are 
ed 4 that fail to correlate at least .50 with 

Me other scale.” 


PROBLEM 


p The present investigation was motivated, 


ing tts by our difficulty in comprehending 
fro dual CPI profiles, a difficulty, resulting 
om the conditions so clearly indicated by 
Orndike’s comments mentioned above. We 


cre also puzzled by Gough’s grouping of 


1 ; r 
lto tls investigation was conducted with support 
ect ae Mental Health in Teacher Education TEO 
Sup the University of Texas, 4 project whic’ f 
Ported by a grant from the National Institute © 


ental Health, 


the 18 CPI scales into several named clusters 
shown in the Manual (Gough, 1957), since 
there appeared no clear empirical basis for 
such clusters in the correlation matrices pro- 
vided. In consequence, the factor analytic re- 
search reported in this paper was undertaken 
to obtain the kind of evidence that would 
help to shed light on the empirical justifica- 
tion of the scales and scale groupings offered 


by Gough. 
PROCEDURE 


A total of 258 cases was employed in this investi- 
gation. This sample included 213 females and 45 
males who were enrolled for study in a teacher 
training curriculum. The subjects (Ss) generally had 
major fields of academic work in university subjects 
other than Education. 

A correlation matrix è was constructed which con- 
sisted of the 153 CPI scale intercorrelations which 
had been computed by the product-moment method 
utilizing an IBM Model 650 electronic computer. 
From this matrix four factors were extracted by 
means of the centroid method (Guilford, 1954). Ap- 
plication of Humphrey’s rule (Fruchter, 1954) and 
Burt’s empirical formula (Thomson, 1951) indicated 
that factorization should be terminated after extrac- 
tion of the fourth factor. Orthogonal rotations were 
then performed by means of Kaiser’s (1958) Vari- 
max technique, the characteristics of which have 
been discussed in recent papers by Comrey (1957a, 


1957b, 1957c). 
RESULTS 


The rotated factor matrix is shown in 
Table 1. The four factors accounted for 26%, 
15%, 7%, and 12% of the total variance, 


2A one-page table of the CPI scale intercorrela- 
tions used in this study has been deposited with the 
American Documentation Institute. Order Document 
No. 6329 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 
for microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 


of Congress. 
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TABLE 1 
ROTATED Factor MATRIX FoR THE CALIFORNIA PSYCHOLOGICAL INVENTORY 


James V. Mitchell, Jr. and John Pierce-Jones 


CPI Scales 


I II II IV 
Class I Measures of Poise, Ascendancy, and 
Self-Assurance 
1. Dominance 18 76 04 —09 
2. Capacity for Status 21 59 04 50 
3. Sociability 27 78 04 15 
4, Social Presence —01 62 —23 51 
5. Self-Acceptance —17 T. 08 05 
6. Sense of Well-Being 79 16 05 23 
Class II Measures of Socialization, Maturity, 
and Responsibility 
7. Responsibility 58 09 44 21 58 
8. Socialization 43 02 57 —06 51 
9. Self-Control 92 —19 08 —01 rid 
10. Tolerance 67 11 15 54 78 
11. Good Impression 83 08 —08 —06 71 
12. Communality 02 05 58 —02 34 
Class III Measures of Achievement Potential 
and Intellectual Efficiency 
13. Achievement by Conformance 80 25 23 08 TF 
14. Achievement by Independence 47 02 11 67 69 
15. Intellectual Efficiency 46 24 16 53 58 
Class IV Measures of Intellectual and 
Interest Modes 
Fic. Psychological Mindedness 47 22 —16 32 40 
17. Flexibility —12 02 =35 56 30 
18. Femininity =02 —02 45 ~04 21 
Percentage of Total Variance 26% 15% 1% 


Note.—Decimal points omitted before all factor loadings in table. a 


respectively. Analysis of the factor loadings 
resulted in the following factor descriptions. 

Factor I, probably the most important fac- 
tor of the present analysis, had its highest 
loadings for the CPI scales named Self Con- 
trol, Good Impression, Achievement via Con- 
formance, Sense of Well Being, Tolerance, 
and Responsibility. Provisionally, it appears 
that this factor might well be named Adjust- 
ment by Social Conformity. Factor II had 
five relatively high loadings, ranging from 
.59 to .78. The CPI scales named Dominance, 
Capacity for Status, Sociability, Social Pres- 
ence, and Self-Acceptance were the important 
ones involved in this factor, suggesting that 
it should be named Social Poise or, alterna- 
tively, Extroversion. Factor III was not as 
well defined as the previous factors, but it 
had loadings above .40 for the CPI scales 


labeled Responsibility, 
zation, and Femininity 
nal descriptions of 
with these scales str 
responsible, conscie 


Communality, Sociali- 
- Gough’s (1957) origi- 
characteristics associate 
ongly suggest the serious, 
ntious person, so we have 
Ego Strength. Fac to name this factor Super- 


factors identified in the 
Present analysis, i S 
higher for CPI E Dating ot Aa 
ectual Effici ; E 
bility, Social Pres Capacity for Status, Flext 


td 


k- LS a i a a 
v 
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Factor Analysis of CPI 


factor Capacity for Independent Thought 
and Action. 


DISCUSSION 


Gough (1957) has classified the 18 CPI 
scales into the following four groups: 


Class I: Measures of Poise, Ascendancy, and Self- 


Assurance r H 
Class II: Measures of Socialization, Maturity, and 


Responsibility y 
Class III: Measures of Achievement Potential and 


Intellectual Efficiency 
Class IV: Measures of Intellectual and Interest 


Modes 


The factor analytic results reported in this 
Paper lend support to four CPI scale classes 
of somewhat different composition from 
Gough’s, as can be inferred from the results 
and factor descriptions already presented in 
Table 1. It is interesting to notice, for ex- 
ample, that our Factor II, Social Poise or 
Extroversion, encompassed five of the 1 
Scales included in Gough’s Class T. If Gough s 
Our classes of scales are viewed as ah ê= 
Ses concerning factor structure, his first class 
pears from our analysis to be @ pace! 
able One, except that we would be force 
to exclude the Well-Being scale from the 
Class. Our Factor III, here called Super -Bgo 
trength, identified three of Gough’s Class 
Measures and one scale, Femininity, from 
Ough’s Class IV, Factor I identified in : e 
Present analysis located five Class II sa e 
tee Class III scales, and one scale a 
ĉach of Classes I and IV when loadings 


g Pacity for Independent 
R E had two substantial 
in Uded by Gough in his Class I, € a 
8 representing a Class I measa aa 
adings for scales included in Class ; faded 
One loading representing @ measure nit ob 
’ Gough in his Class IV. If the restig it 
cited in the present analysis are YA nac- 
ams clear that Gough’s Class IV 1s aan 
ite table as a cluster of related scales, as 
f S constituent scales load on three S E 
gors, Similarly, Class II as Bat justi- 
fob does not seem to be empirica J, to be 
ed, Finally, Gough’s Class T appears eet 
mixture of scales which, in the ae 
ysis, have their loadings 0” Tac 
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and III. If the CPI continues to be scored 
for 18 scales, and if a defensible means for 
grouping the scales into classes is desired, 
classification should probably be based upon 
factor analytic findings such as those pre- 
sented in this report. 

It seems quite apparent from the results of 
this research that the CPI cannot be re- 
garded with real justification as measuring 
the 18 relatively independent personality di- 
mensions that it is purported to measure. It 
is also true, judging by our results, that in- 
dividual personality profiles might well be 
based on only a few selected CPI scales. Per- 
haps, for example, the Self-Control scale can 
be regarded as virtually a pure measure of 
Factor I, Adjustment by Social Conformity, 
since it loaded .92 on that factor and was 
essentially independent of the three other 
factors. The Dominance, Sociability, and Self- 
Acceptance scales seem equally good meas- 
ures of Factor II, Social Poise, and they lead 
to no appreciable inferences regarding Fac- 
tors I, III, and IV. Factor III, Super-Ego 
Strength, is well estimated by the Commu- 
nality and Femininity scales, which appear 
to be independent of the three remaining fac- 
tors. The Flexibility scale occupies a similar 
position with regard to Factor IV, Capacity 
for Independent Thought and Action. In each 
case the decision to use a scale or scales to 
represent a factor would naturally be con- 
tingent upon reliability data as well as fac- 
torial composition. 

It is interesting to note that with the ex- 
ception of the Communality and Sociability 
scales, those scales which seem best to repre- 
sent the four factors identified in the pres- 
ent analysis lead to personality descriptions 
couched in long-used psychological terms. In 
this regard, it can be noted that Cronbach 
(19 59) has argued for such descriptions over 
others couched in terms like “social pres- 
ence,” which apparently refer to social be- 
havior patterns often regarded as compli- 
cated resultants of motivational dispositions, 
abilities and skills, and situation-linked stimu- 
lus conditions. 4 

While the present analysis may have pro- 
vided some clarification of personality meas- 
urement using the CPI, it should be acknowl- 
edged that one limitation of this study exists 
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because factor identification and description 
are partially dependent upon Gough’s origi- 
nal selections of items for scales and upon 
the verbal descriptions assigned to the scales. 
But this fact in no way negates our earlier 
conclusion that the 18 CPI scales represent 
a much smaller number of personality dimen- 
sions. Nor, indeed, does the fact that our 
factor interpretations made use of the origi- 
nal scale descriptions lessen the significance 
of these interpretations, since a careful re- 
view of the items included in the factors 


tends to support the designations which we 
have assigned to each. 


SUMMARY 


A centroid factor analysis was carried out 
with the 18 scales of the California Psycho- 
logical Inventory. The sample employed con- 
sisted of 258 students enrolled in an intro- 
ductory course in educational psychology at 
i of Texas. Four factors were 
rotated by Kaiser’s Varimax 
factors identified were tenta- 
tively named: I. Adjustment by Social Con- 


formity, II. Social Poise or Extroversion, II. 
Super-Ego Strength i 


ms as “dominance,” 


“self-ac- 
ceptance,” and the like, rather than 


in such 


James V. Mitchell, Jr. and John Pierce-Jones 


complex social behavioral terms as, for ex- 


ample, “social presence” and “capacity for 
status.” 
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THE DEVELOPMENT OF AN AFFECT ADJECTIVE CHECK 
LIST FOR THE MEASUREMENT OF ANXIETY 


MARVIN ZUCKERMAN ! 


Institute of Psychiatric Research, Indianapolis 


Most personality tests are designed to meas- 
ure relatively stable traits. Testees respond to 
gems asking how they “generally,” “often,” 

usually,” “seldom,” etc. behave. The time 
referent is vague and subjects (Ss) may in- 
terpret the questions as referring to the last 
Week, month, year, or their entire lifetime. 

any of the traits in which clinicians are in- 
terested, particularly affective traits such as 
hostility or anxiety, are assumed to show 
large intra-individual variation from hour to 
Nour and from day to day. While the concept 
of a “general level of anxiety” may be useful 
for gross discrimination, there are many oc- 
“asions where one would like to measure 
changes in anxiety over shorter periods of 
time, Experiments where an attempt is made 
to induce anxiety, or experiments on the ef- 
fect of the “tranquilizer” drugs are examples 
OF research where temporal ambiguity of the 
Usual anxiety questionnaires might make them 
“sensitive to change. Experiments of this kind 
Conducted at the Institute of Psychiatric Re- 
Search led the author to the development of 
: test which could be given quickly, scored 
sep trade, and adapted for varying time 
id The adjective check list method seemed 
„cely suited for the purposes discussed 

Ove. Gough (1955) has used a check list 
“st for developing empirical scales of various 
personality traits. Nowlis (1953) reported the 
Se of a check list in measuring changes in 
Med by drugs given to college students. 
fin he purpose of the present study yen 
igo P empirically a scoring key for 3 ja 

using a pool of adjectives with vario 


at Department of Psychology, Brooklyn 
a ge, aai 
wip Pe idea for this test emerged out of a discussion 
John 1, Nurnberger on this problem. 


affective connotations, and to test the reli- 
ability and validity of this anxiety score. 


EMPIRICAL DEVELOPMENT oF A SCORING KEY 


Adjectives with affective connotations were 
collected from Gough’s and Nowlis’ lists and 
from a thesaurus. Adjectives which were of 
low frequency in the written language were 
excluded so that Ss of less than average in- 
telligence could understand the items. The 
final list of adjectives consists of 61 items.’ 

The scoring key was derived from item 
analyses in two studies. The first study by 
Persky, Maroc, Conrad, and den Breeijen 
(1959) compared a group of psychiatric pa- 
tients rated high in anxiety with a group of 
normal controls rated low in anxiety on the 
basis of a psychiatric interview. A prelimi- 
nary scoring key of 30 words was used, based 
on an a priori selection of words with “anx- 
iety” connotations. This score yielded a highly 
significant difference between the groups. An 
item analysis was performed by comparing 
the frequencies in each group checking, or not 
checking, each of the adjectives. Twenty-four 
of the words yelded significant differences 
(p < .05). Twelve of these were “anxiety- 
plus” words (checked more frequently by the 
High Anxiety group) and 12 were “anxiety- 
minus” words (checked more frequently by 
the Low Anxiety group). Although the two 
groups in this study were not ideally matched 
on nonansiety variables it is interesting that 
all but one of the anxiety-plus words had been 
previously selected as anxiety words on the 
basis of their connotation and were among 
those used in the a priori scoring. 

The second study by Levitt, den Breeijen, 
and Persky (in press) measured the effects of 


3 A copy of the check list may be secured by writ- 
ing to the author. 
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motically induced anxiety state in nor- 
aN ae on the Affect Adjective Check 
List (AACL) based on the item analysis in 
the preceding study showed a highly signifi- 
cant rise during the anxiety condition. An 
item analysis yielded 38 words showing sig- 
nificant changes in checking frequency dur- 
ing the anxiety condition. Twenty-one were 
anxiety-plus words (increase in checking fre- 
quency during anxiety condition) and 17 
were anxiety-minus words (decrease in check- 
ing frequency during anxiety condition), 
The final scoring key includes 21 words 
“which proved to be significantly related to 
anxiety in both of the aforementioned studies. 
These words are listed below. 


Anxiety-plus: afraid, desperate, fearful, frightened, 


nervous, panicky, shaky, tense, terrified, upset, 
worrying. 


Anxiety-minus: calm, cheerful, contented, happy, 
joyful, loving, pleasant, secure, steady, thoughtful. 


Anxiety-plus words are scored 1 if checked, 
and anxiety-minus words are scored 1 if not 


checked. The possible range of scores is 0 to 
Zi 


Revraziuiry STUDY: CoLLecEe Group I 


Two versions of the AACL and the Taylor 
(1953) Manifest Anxiety Scale (MAS) were 
given to 50 students in two sections of an ele- 
mentary psychology class at the Purdue Uni- 
versity Extension School. The group consisted 
of 43 males and 7 females. The average age 
was 22.3 (SD = 4.6). The items in the two 
versions of the AACL were the same but one 
test asked the Ss to check words which de- 
scribed how they “generally feel” while the 
other test asked them to check items describ- 
ing how they felt “today.” On the latter test 
they were told that “today” was defined as 
beginning from the time they awoke that 
morning. One section received the General 
test first and the Today test second while the 
other section took them in the reversed order, 
One week later the Ss took the two AACLs 
again and each section received them in an 
order reversed from the first session, 

Tests for the effect of order on the General 
and Today tests were performed by compar- 
ing each S’s scores ‘on the occasions when he 
took the tests first and when he took them 
second. The 50 Ss scored an average of .68 
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point higher on the Today test when it was 
taken before and .32 point higher on the Gen- 
eral test when it followed the Today test, but 
these differences were not significant (#’s = 
1.01, .95). One can conclude that order had 
no effect on the two AACL versions. 

Two kinds of reliability were examined for 
the General and Today tests: internal cons 
sistency on the first testing, and retest reli- 
ability from the first to the second test. The 
General test was expected to show reasonably 
high internal reliability and retest reliability 
since it was assumed that the General time 
set would cause Ss to describe a stable trait. 
The Today test was expected to show high 
internal reliability on a single testing but low 
reliability on retest, Since the Today test is 
designed to measure day to day fluctuations 
it is unlikely th 
fluctuations and still remain stable from week 
to week. 

The two measures of reliability for each of 
the tests is given in Table 1. Internal reli- 
ability is calculated using the Kuder-Richard- 
son Formula 20. It can be seen that the ex- 
pectations of the experimenter (E) are borne 
out. The General test is reliable internally and 
in retest, while the Today test is internally 
reliable on a single testing but low in reli- 
ability on a retest, This contrast in reliabili- 
ties bears out the Es assumptions about the 
different nature of tests attempting to meas- 
ure stable and fluctuating traits. The actual 
correlation between the two versions of the 
AACL on the first testing was .43, indicating 
a significant, but only moderate, relationship 
between the two tests. 


VALIDITY Stupy: 


The pur 
Today AA 


CoLtecr Group II 


Pose of this study was to see if the 
CL anxiety score would show an in- 


TABLE 1 


RELIABILITIES OF GENERAL AND TODAY 
VERSIONS or AACL 


Se L e 
K-R 208 Retest 
General 72 63** 
Today 85% ‘31* 
an ula? 
a Signifi a ne 3 by the form 
pay enee of K-R 20 was established by 


kar, —1/N —1 XNo. items —1. 
ze Significant below .05 level. 
ignificant below .001 level. 


at it could be sensitive to these - 


7% 


ye 


imi 


_— 
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Measurement of Anxiety 


Crease when this test is given on the day of 
an examination. “Examination Anxiety” is not 
usually as intense as the anxiety seen in clini- 
cal patients and it varies considerably be- 
tween individuals. However, if the AACL 
Proved sensitive to Examination Anxiety it 
would probably be sensitive to more intense 
forms of anxiety. 


Method 


The Ss were 35 college students in a section of ele- 
mentary psychology. Twelve were females and 23 
were males. The average age was 19.5 (SD =2.7). 
One student dropped the course and his data was 
only used on comparisons involving the initial tests. 
The students were given the General and Today 
eons of the AACL, the MAS, and the Barron 
doo Strength scales (Barron, 1953) on the second 
a the class, They were given the Today AACL 
th e beginning of every other class meeting except 

ose following test day meetings. The E was inter- 
sted in comparing the nonexam day AACL scores 
with the exam day AACL scores. The AACL was 
Siven regularly at the same time so that the students 
Would not guess the connection between the AACL 
and their examinations. At the end of the course 
t ey were asked if any of them had guessed the con- 
nection and none claimed to have seen the purpose 
of the experiment, The AACL was not given at the 
ti Fee meetings after examinations because at these 
mes the students were waiting to hear their grades 
and the Æ was not sure whether these occasions 
should also be classified as “anxiety” situations. 

ere were a total of 10 nonexam days and 3 exam 
pa: All 34 students were present on the exam days. 
À ences varied on the other days from one to three 
padents, No student was absent more than once on 
© 13 days the AACL was given. 


Res ults 


n AACL anx- 


. Figure 1 shows the change i 
f the 13 class 


i 

a Scores during the course O 

Benes. In the case of absen 
Ore on the other 9 nonexam 


t Ss, the mean 
days was sub- 


AACL ANXIETY SCORE 


F: n 2 
19: 1. Changes in AACL anxiety scores during 1° 


class meetings. 
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TABLE 2 


MEAN DIFFERENCES BETWEEN EXAM AND 
Nonexam Day ANXIETY Scores 


Mp 
(exam-J/ 
non- 
exam’) Supt pb 
Exam I +2.91 54 5.39 <.0005 
Exam IT +2.13 .76 2.80 <.005 
Exam IIT +249 69 3.61 <.001 
M all exam days minus 
M all nonexam days +241 53 455 <.0005 


a Mean on the three or four nonexam days subsequent to the 
last exam and preceding the current exam, excluding days when 
the results of exams are reported. 

b One-tailed test. 


stituted for the missing entries in order to 
calculate comparable group means for the dif- 
ferent sessions. The means of the group on 
nonexam days ranged from 6 to 8. The SD of 
the 10 -nonexam day means was .77. On the 
three exam days the AACL scores rose to 9 
or 10. Table 2 gives the mean differences be- 
tween: (a) AACL scores on each of the exam 
days and the means of these scores on the 
three or four nonexam days immediately pre- 
ceding each exam, (b) the mean anxiety score 
on all three exam days and the mean score on 
all 10 nonexam days. The AACL increases are 
significant for each of the exam days and the 
mean for all exam days is significantly higher 
than the mean for all nonexam days. The hy- 
pothesis that pre-examination anxiety will re- 
sult in increased anxiety scores on the AACL 
is supported by these results. 

After these results had been obtained, an- 
other hypothesis occurred to the E. It is pos- 
sible that the amount of anxiety elicited by 
an examination might be related to how well 
one expects to do on the examination. Al- 
though no measures had been made of the 
Ss anticipations prior to the examination, 
their actual performance on the examination 
might be some measure of their preparation 
and consequent anticipation. The class was 
divided into thirds on the basis of their grades 
on each of the examinations. The mean 
changes on the AACL anxiety score of the 
High and Low Grade groups were then com- 
pared. The results of these tests are contained 
in Table 3. It can be seen that on all three 
examinations the Ss who made high grades 
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TABLE 3 


Comparisons OF MEAN DIFFERENCES BETWEEN EXAM 
AND Nonexam Day ANXIETY Scores IN Hicn 
AND Low GRADE Groups 


High Grade Low Grade 


Na N t p> 
11 +1.57 11 44.55 2.32 <.025 
Ean H 12 +0.10 11 +3.59 2:20 <.025 
Exam III 11 145 11 43.76 1.47 hs 
P. l exam days minus = is 
Matt Sonan days 11 +0.75 12 +3.21 2.02 <.05 


a Differences in N are because of ties in grade scores at the 
cut-off point. 
b One-tailed test. 


showed less of an increase in anxiety scores 
than the Ss who made low grades. The dif- 
ferences are significant for the first two ex- 
aminations but not significant for the third 
examination. The class was then divided into 
thirds on the basis of their total percentage 
score for the entire term. The High and Low 
Grade students were compared on the differ- 
ences between the mean for all exam days 
and the mean for all nonexam days. This re- 
sult is also contained in Table 3. The differ- 
ence is significant at the .05 level. The cor- 
relation between the total percentage grade 
for the term and the mean exam minus non- 
exam day differences for the total group of 34 
was —.34, which was significant below the .05 
level. The results generally support the hy- 
pothesis that the students doing well on ex- 
aminations would show less increase in anx- 
iety scores than the students doing Poorly on 
examinations. However, these results do not 
definitively support the idea that the student’s 
anticipation of success or failure is responsible 
for this difference. An alternate hypothesis 
might be that the pre-examination anxiety 
actually caused the poorer performance on 
the examination. In the next study of this 
question, the Æ hopes to obtain anticipations 
before the examination, on the hypothesis 
that these will be more strongly related to the 


anxiety scores than the actual grades on the 
examinations. 


VALIDITY: RELATIONSHIPS WITH OTHER 
Trsts—Groups I anp II, AND 
PREGNANT WOMEN GROUP 


Since it has been hypothesized that most 
questionnaires give the S a general time set it 
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was expected that the MAS and other tests 
would correlate significantly with the General 
version of the AACL but not with the Today 
version given on one particular day. However, 
it was expected that the MAS would correlate 
significantly with the mean of a number of 
AACL anxiety measures since this would pre- 
sumably be a better measure of the general 
anxiety level than a score on a single day. 
The MAS should correlate significantly with 
the Month version of the AACL given to the 
pregnant women since a month is a long 
enough period to consider a general time set. 


Method 


In each of the preceding college groups the MAS 
was given to the Ss. In College Group II the Ss also 
took the Barron Ego-Strength scale. A third group 
of Ss was collected in another study on pregnancy.’ 
This study involved monthly interviews and tests 
with a group of 51 pregnant women, The women 
were mainly from a lower-class population, The av- 
erage age was 21.1 (SD = 5.2). The average educa- 
tion in years was 11.3 (SD = 21.1). The group con- 
sisted of 41 Negro and 11 white Ss. The AACL used 
in this study asked the Ss how they had been feeling 
during the past month. At the time of the first inter- 
view, the Ss were given the Month AACL, MAS, the 
MMPI M-F scale, and three scales from the Paren- 
tal Attitude Research Instrument (Schaefer & Bell; 
1958), the latter constituting a factor called “Hos- 
tility-Rejection” (Zuckerman, Ribback, Monashkin, 
& Norton, 1958). At each subsequent interview the 
Ss were again given the AACL and the MAS. The 
number of interviews ranged from 1 to 6, A sub- 


group of 10 Ss had taken the AACL and the MAS 


during the sixth, seventh, eighth, and ninth months 
of pregnancy, 


Results 


The correlations between the MAS and the 
various versions of the AACL given to the 
three groups of Ss are presented in Table 4- 
In the College Group I the MAS correlated 
significantly with the Today AACL (7 = 29) 
and did not correlate significantly with the 
General AACL (r= .18). The direction ° 
the difference in this group was contrary t° 
expectation but the difference between thes? 
Correlations was not significant (¢ = 78). n n 
the College Group IT the MAS correlated sig 
nificantly with the General (r = .58) and di 
not correlate significantly with the Today 
AACL (r= 32), according to expectation” 


* This study is still in the data analysis stage- 
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The correlation between the MAS and Gen- 
eral AACL was significantly higher than the 
Correlation between the MAS and Today 
AACL (¢= 2.07, p < .05). The differences 
in results between these two similar samples 
are difficult to explain. The MAS correlated 
Significantly with the mean of 10 Today 
AACL scores (r = .52), as expected. It did 
Not correlate significantly with the mean of 
= AACL scores on the three examination 
p (r = .28). The difference between these 
atter two correlations was not significant 
C3 63). However, the correlation between 
AS and AACL on the first examination day 
a 40 which was significant below the .05 
a The correlation with AACL on the sec- 
ee day dropped to .09. The correla- 
t n for the third exam day was .19. Both of 
e latter correlations were not significant. 
va differences between the correlation on 
4 € first exam day and the correlations on the 
Second and third exam days were not signifi- 
cant (#5 = 1.75, 1.26). 
he correlations of MAS with AACL for 
t © group of pregnant women are also con- 
ained in Table 4, The correlation between 
€ tests on the first occasion when they were 
pee given (N = 51) was .65, significant be- 
OW the .001 level. The more General time set 
of a Month resulted in a significant correla- 
‘on with MAS, as expected. The relationship 


TABLE 4 
CORRELATIONS BETWEEN AACL 
z Scores AND MAS 


ANXIETY 


A 
Time Set 
Monde e 
Group N Today General Month 


ee 


Cag a so .29* -18 
ollege Tr tst testing 328 32 58" 
M nonexam days e 
exam days a 
ha ee 
stant; first testing 51 j 
“gnant: Month 6-9 10 
Lonth 6 s 
Month 7 ie 
Onth 8 a 
Onth 9 = 
aie eee 


i Two S, ae 
? Eye, $ did not take the MAS. 
at Signin,” =-40%, Exam II r=- 
Signi icant at or below .05. 
cant at or below 01. 


=.19. 


09, Exam I 7 
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with MAS as a function of the particular 
month of pregnancy was measured in the 10 
Ss who had taken both tests during the sixth 
to ninth months. The average MAS scores 
showed a significant drop during this period 
while the AACL anxiety scores showed no 
trend. It can be seen that the correlations be- 
tween the two tests are very high during the 
sixth, seventh, and eighth months but drop to 
a nonsignificant .38 during the ninth month. 
The difference between the correlation of the 
two tests in the eighth month and in the ninth 
month was not significant (¢ = 1.77). 

In the College Group II, the Barron Ego- 
Strength scale correlated significantly with the 
General AACL score (r = — .37), and non- 
significantly with the mean for exam days 
(—.28) and the mean for nonexam days 
(—.34). The mean AACL scores in the Preg- 
nant Group did not correlate significantly 
with the parental attitude scales (Marital 
Conflict, = .14; Irritability, r= .02; Re- 
jection of the Homemaking Role, 7 = .21), 
and it did not correlate significantly with the 
M-F scale (—.14), or with a score based on 
physical complaints in the interview (.20). 
The failure of the AACL to correlate with 
these measures is not crucial for the validity 
of the test, for these tests are not intended as 
direct measures of anxiety. 


AACL AND SEX, AGE, AND EDUCATION 


The total college group used in the first 
two studies consisted of 65 males and 19 fe- 
males. The differences between the means for 
males and females on the General and Today 
versions of the AACL were compared and 
found to be small and insignificant (General, 
t = 09; Today, £ = .49). In the same group 
of 84 Ss, age was not found to correlate with 
the General AACL (r = .08) or with the To- 
day AACL (r= — .01). In the group of 51 
pregnant women, the Month AACL average 
did not correlate significantly with age (r = 
— 12) or with education as measured by 
years of school (7 = — 07). 


Norms 


Although the samples used are not large 
enough to warrant establishing standard 
scores, the reader may be interested in com- 


paring their samples with the ones used in 
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this study. The means and standard devia- 
tions for the combined college group of 84 
Ss were: Today, M = 7.38, SD = 3.98; Gen- 
eral, M = 5.15, SD = 3.06. For the group of 
51 pregnant women given the Month version 
of the AACL for the first time, M = 7.14, 
SD = 3.74. In using these normative figures 
one should consider the characteristics of the 
samples as described in the article. 


SUMMARY AND CONCLUSIONS 


A list of affectively toned adjectives was 
used in the development of an anxiety scor- 
ing key. Adjectives which differentiated High 
and Low Anxiety groups (selected by psy- 
chiatric interview), and showed significant 
changes in checking frequency during a hyp- 
notically suggested anxiety state, were used in 
the key. Two forms of the test with different 
time sets were used in a reliability study. 
Both the “General” and “Today” forms had 
adequate internal reliability on a single oc- 
casion, but only the General form demon- 
strated marked retest reliability. These re- 
sults were expected because of the differences 
in the time set. The Today version of the 
AACL was given repeatedly to a second 
group of college students at the beginning of 
a class period. Anxiety scores on examination 
days were compared with scores on nonex- 
amination days. The AACL score was found 
to rise significantly on examination days. On 
two of the three examinations, the students 
who made good grades on the examinations 
showed a significantly smaller rise in anx- 
iety scores than the students who made poor 
grades on the examinations. The General 
AACL correlated significantly with the MAS 
in one of the two college samples. The mean 
of 10 nonexamination day AACLs correlated 
significantly with the MAS. The mean of the 
3 examination day AACLs did Not correlate 
significantly with the MAS, although the score 
on the first examination day did correlate sig- 
nificantly with the MAS. AACL and MAS 


were highly correlated in a group of pregnant 
women, 
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The General version of the AACL is sug- 
gested as a quick measure of general anxiety 
level. The Today version of the test is sug- 
gested as a technique in studies where re- 
peated assessments of anxiety must be made 
within a relatively limited time interval, i.e.; 
weeks or months. The time set of the AACL 
can be changed by a simple adjustment of the 
instructions, In the study of pregnant women 
a “month” set was used. In an experiment 
taking place over an interval of a few hours 
a “now” or “this minute” set could be used. 
It is assumed that the anxiety scoring key 
will be valid regardless of the time set. The 
studies reported give some evidence to sup- 
port this assumption although only future 
work can establish the fact, Other experl- 
menters may desire to develop new keys for 
this test. “Depression” and “Hostility” keys 
are two obvious possibilities. 
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The present study was designed to add 
Strands to the nomological network (Cron- 
bach & Meehl, 1955) which comprise the 
Ego Strength (Es) scale (Barron, 1953). It 
Was reasoned that if the Es scale can dif- 
ferentiate between psychiatric and nonpsy- 
chiatric adults (Gottesman, 1959), and if it 
really measures ego strength, then adjusted 
College students should get significantly higher 
Scores on this scale than do maladjusted col- 
lege students, Furthermore, if one of the 
things which may be said about the Es scale 
is that it reflects a defensive test taking atti- 
tude, then it would seem that our sample of 
adjusted students’ defenses would be higher 
than those of the maladjusted students. 

A random sample of 50 college students 
(27 males and 23 females) was drawn from 
a group of 300 sophomore and junior Teach- 
ets College candidates at the University of 

ebraska, MMPIs had been administered to 

ese students as a routine screening device, 
and it was generally known that the test re- 
Sults had some bearing on their futures in the 
eachers College. The criterion of adjustment 
Was met if these students indicated on the 
ornell Medical Index that they had never 
en under psychiatric treatment and if they 

ad never contacted the Mental Health Di- 

Vision ‘at the University of Nebraska. The 


maladjusted group of students consisted of- 
33 individuals (24 males and 9 females) who 
had voluntarily sought psychiatric consulta- 
tion at the Mental Health Division and who 
had remained in treatment for five or more 
interviews. It is possible of course, with such 
criteria of adjustment and maladjustment, | 
that a number of really well adjusted indi- 
viduals might be included in the maladjusted 
group and that conversely a number of mal- 
adjusted individuals might be included in the 
adjusted group. If there is an error here, 
however, the error is in the direction of mak- 
ing it more difficult for the Es scale to dis- 
criminate between the two groups. 

The MMPI records of the two groups were 
rescored for Zs and K and it was found that 
both scales do tend to broadly discriminate 
between adjusted and maladjusted college 
students. These results appear in Table 1. 

The mean differences between the two 
groups on each of the two scales were sig- 
nificant beyond the .001 level of confidence. 
The mean Es scale score for the adjusted 
students was much higher than that of the 
maladjusted students; and the mean K scale 
score of the adjusted students was also much 
higher than that of the maladjusted. Correla- 
tions of .32 (high Es with high K) and .48 
(low Es with low K) between Es and K for 


TABLE 1 


AND MALADJUSTED COLLEGE STUDENTS 


Is anp K SCORES FOR ADJUSTED 
e 
Group N Es SD CR K SD ER 
j 76 19.12 3.96 
Adjusted 50 50.10 4.7 fp 3 Be 
5 15.33 5.53 
Maladjusted 33 45.21 3.35 É 


TE, " 
Significant beyond the .001 level of confidence: 
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PSYCHOLOGICAL 


GARDNER, E. F., & THompson, G. G. Syracuse Scales 
of Social Relations. Elementary Level, Grades 5 
and 6, Junior and Senior high school level. Test 
booklets, pkg. of 35 ($4.90) includes scoring guide, 
report folder, tally sheets, class record, and manual, 
24 pp. Yonkers-on-Hudson, N. Y.: World Book 
Company, 1959. 


This sociometric device makes use of two hypo- 
thetical situations as a basis for ratings by each stu- 
dent of his classmates. Approximately parallel forms 
are developed for use with elementary, junior, and 
senior high school pupils. One of the situations, at 
all three levels, involves rating others’ ability to offer 
support, comfort, and sympathy, and is intended to 
reflect need for succorance; the other is specific to 
the level. At the elementary level achievement-recog- 
nition is tapped; at Junior high, deference, at Senior 
high, playmirth. The student’s frame of reference for 
his ratings is established in a forced distribution, 
using individuals selected from all the persons he 
knows as reference points. Since every pupil is evalu- 
ated by every other one, information becomes avail- 
able on: (a) how each pupil views his classmates as 
being able to satisfy two of his important psycho- 
logical needs; (6) how each pupil is evaluated by 
his classmates as being able to satisfy their needs. 
Large samples, unspecified with regard to such rele- 
vant factors as intelligence and socioeconomic status, 

provide a not fully satisfactory normative basis for 


interpreting the average ratings given and received 
for each need. 


An instrument like this one d 
tern of the simpler pencil and 
tion of its usefulness becomes a 
process. The authors rely on fi 
mostly with college students, Test-retest measures of 
stability over a one-to-two-week period suggest that 
these measures have only moderate stability (from 
62 to .94), The manual is deficient in not warning 
potential users that such factors as the passage of 
time or the period in the school year might have 
considerable effect on a given score. This uncertainty 
would tend to impair its usefulness for routine diag- 
nostic use. It will not be possible to involve a whole 
class in the time consuming procedures required 
every time the need to understand and help a par- 
ticular pupil is discerned. Its validity is, of course, 
not susceptible to simple summary. Brief reference 
is made to studies whose results are suggestive, and 
the reader is referred to the author’s 1956 book and 


leparts from the pat- 
Paper tests, Evalua- 
n extremely complex 
ve years of research, 


- On the same s; 


TEST REVIEWS 


to a number of unpublished doctoral dissertations. 
There is lacking any comprehensive or coherent 
framework for interpreting the results. True, it might 
be argued that providing such a framework requires 
more space than a manual can provide. Then the po- 
tential user ought to be warned that such a gap 
exists and referred to the proper sources. 

This reviewer is more impressed with possible use- 
fulness of these scales for research than for everyday 
clinical or educational work.—E, S. B, 


Dicer, R. E. Survey Test of Algebraic Aptitude. 
8th Grade. Timed, 40 minutes. Test booklets, IBM 
answer sheets, IBM electronic score punch ca! 
scoring key; manual, 18 pp. Los Angeles, Ca f: 
California Test Bureau, 1959, 

This 60-item test of basic arithmetical and sim- 
ple mathematical operations is designed to be used 
as a basis for guidance and placement for ninth 
grade mathematics. It seems to have undergone ade- 
quate standardization and some degree of prelimi- 
nary item analysis. Its reported predictive power 
against achievement tests scores or grades earned in 
ninth grade mathematics js satisfactory, The esti- 
mated reliability is probably inflated through the in- 
appropriate application of Kuder-Richardson Formula 
21 to a speeded test. Considerable correlation with 
similar tests (Lee, Iowa, and Orleans tests) are re- 
ported, but no comparisons with their respective pre- 
dictive validities are offered. This test may prove as 
dependable as the others—E, S. B. 


Jastax, J. F. Jastak Test. (Short form) Junior high 
school, 1 form, 36 minutes, Test booklet ($3.85 
Per package), set scoring key ($.75), manual, 6 
Pp. ($1.50). Philadelphia: Educational Test BU- 
reau, 1959, 
This briefer fi 

sult. Psychol, 

-97 with its 

tremely high 

laciously high 


orm of the 1958 edition (see J. con- 
1959, 23, 283) is reported to correlate 
parent. This correlation and the as 
reliability coefficients reported are fa 

estimates because they were computed 
ample that provided the data for item 
analysis on the basis of which the items to be ie 
cluded in this form were selected. The manual © 
essentially unchanged, I must repeat my conclusion 
regarding the long form: without further validats 
data, the Jastak Test cannot be accepted as 4 va 
of obtaining inferences other than general level 
ability —E, 5, B 
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CLIENT ACCEPTANCE OF RESPONSIBILITY AND 
DIFFICULTY OF THERAPY 


PEARL SCHROEDER 


Student Counseling Service, University of Ilinois 


- ğ 
; This study was designed to test the propo- 
sition that there is a meaningful relationship 
between the degree of responsibility demon- 
= Strated by clients and the difficulty they pose 
) for therapeutic work. Difficulty is here limited 
to its more extreme forms and refers either to 
the instance in which, although a person re- 
quests help, he does not really “get into” ther- 
apy, or to the instance in which many more 
hours than usual must be spent to achieve the 
therapeutic goal. 
Clinical observation suggested that one 
Common dimension of difference among cli- 
ents of varying complaints and symptomatol- 
98y is the way in which they view themselves 
as contributors to their experience. On the 
One hand there is the client who presents him- 
Self as the helpless recipient of the impact of 
Other people and of chance events; On the 
Other, the client who claims himself as the 
Cause of his difficulties. Such difference ap- 
Pears not only to operate in initial presenta- 
tion but also to permeate general speech con- 
tent and to be reflected in the client’s role ex- 
Pectations of himself and the clinician in the 

~¢rapeutic work. Abstracte! he v 
ng contexts in which it was noted, this differ- 
ence can be identified as & variation in the 
extent to which a client views his own eo 
“des and behaviors as being consequentia , 
oa is, as playing a part in sa ret: 
ents. Stated dictionary-wise, it 15 a him- 
self. in the extent to which a client ho events 

op -tuswerable as an agent for these € 


as “Fesponsible.” ” d 
onality an 


A €oretical discussions of pers gnai 
. Osis associate the rejection of 1680 


d from the vary- ' 


and psychological well-being. To illustrate: 
Thompson (1958), speaking of the develop- 
ment of the self, notes that “The ‘good me’ 
tends to deny responsibility for the ‘bad me’s’ 
activities’ (p. 9) and that trouble ensues; 
and Horney (1950), describing the neurotic, 
states that “he always tends to deny laws op- 
erating within himself, refuses to see the in- 
evitability of cause and effect operating in 
psychic matters . . .” (p. 36). Rogers (1951) 
speaks of the ill effects of a failure to “own” 
experiences and then links responsibility to 
the normal development which moves “in the 
direction of greater independence or self-re- 
sponsibility” (p. 488). Fingarette (1955), in 
an examination of Freud’s position anent 
moral guilt and responsibility, after linking 
repression and the rejection of responsibility, 
concludes that “accepting responsibility is a 
necessary condition of maturity” (p. 32). 
Fingarette’s discussion is specific to “moral” 
responsibility. In general use, “responsibility” 
does often carry the additional meaning of 
some form of moral judgment such as guilt or 
blame. This additional meaning may be a 
necessary consequent in terms of personal and 
social values. However, the claiming of blame 
or the making of the moral judgment is sepa- 
rable as a process from the claiming of re- 
lity defined above as viewing one’s 
own behavior as consequential. Fingarette 
points out that guilt which seems dispropor- 
tionate to the assigned reasons is but guilt 
dissociated from behaviors (wishes) which are 
not avowed. Thus there is a claiming of the 
moral judgment but a disavowal of the cause 
for the effect. In the statement, “Registration 
happens to me tomorrow,” no moral judgment 


sponsibi 


i ment z £ x4 
af with repression and the spare js involved, yet the speaker implies he simply 
S ra chological distress, and o A naei experiences what some outside force does to 
Sponsibility with norma 
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ae jects viewing registration as a re- 
A cecal SAEs fulfillment depends upon his 
——— as rejection of responsibility is as- 
sociated with rejection of reality (Horney, 
1950; Thompson, 1958) and assumption of 
responsibility with development and maturity 
(Fingarette, 1955; Rogers, 1951), a simple 
linear relationship between client acceptance 
of responsibility and difficulty of therapy 
would appear to be the most logical one. 
However, clinical observation suggested that 
the client who attributes his difficulties al- 
most totally to himself, later in therapy has 
as much difficulty in accepting and dealing 
with reality relationships as the client who 
attributes his difficulties almost totally to out- 
side forces. Logically, too, it can be argued 
that the person who rejects the effects of out- 
side influences and claims total responsibility 
for the events of his life is in this respect di- 
vorced from reality and will therefore be diffi- 
| cult to work with in therapy. These last con- 
siderations led to the prediction of a curvi- 
| linear rather than a linear relationship and 
| the final form of the hypothesis to be tested 
was: clients showing extremes of responsibil- 
ity pose more difficulty for therapy than those 
showing moderate responsibility. 


$ 


METHOD 
Measures 


| Two measures were required: one of responsibility, 
l the other of difficulty. In both instances a more ob- 
f jective method than global clinical judgment was de- 
] sired. Since the observations leading to the hypothe- 
sis were of what the client said, a content analysis 
| of verbal production was chosen for the measure of 
| responsibility. Personal characteristics can be dem- 
onstrated by very brief verbal material; however, 
identification of such characteristics for later recog- 
nition requires material of some length. Therefore a 
fairly lengthy protocol was necessary. 

A three-years’ collection of data for clients who 
had participated in an extensive research project in 
therapeutic counseling sponsored by the Student 
Counseling Service and the Department of Psychol- 
ogy of the University of Illinois had just been com- 
pleted (Hunt, Ewing, LaForge, & Gilbert, 1959). The 
data included Protocols of therapy interviews, Ror- 
schachs, TATs, and Gilbert Self-Interview Test. 

Because the situation in the Gilbert Self-Interview 
Test is simpler than that of the interview with its 
dyadic interactions, and because the subject (S) of 
necessity must focus more directly on himself and his 
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life events than in the Rorschach or TAT, the Gil- 
bert protocols were chosen for the content analysis. 

The Gilbert Self-Interview Test (Ewing, 1957, pp- 
19-20) is administered by written instructions. The 
S is alone in a room and speaks his responses into a 
microphone which records them on tape. These re- 
sponses are then transcribed and the tapes filed for 
listening. The test asks S to spend about three min- 
utes in describing his “real attitudes and feelings 
both good and bad” and his “problems, if any” about 
cach of the following 10 topics: educational and vo- 
cational goals; family; how he gets along with peo- 
ple of both sexes; relationships with the opposite 
sex; financial situation; ethical, moral, and religious 
views; abilities; unfavorable and undesirable aspects 
of his personality; favorable and desirable aspects of 
his personality; any important aspect of his life not 
covered. The resultant protocol is generally of some 
length, 

Instructions for the analysis of these protocols do 
not presuppose any psychological training, nor do 
they require any clinical interpretation. They are 
but briefly summarized here.! The unit of analysis 
is the statement, a group of words expressing a sepa- 
rate idea, All statements of the following kinds are 
identified and marked as responsibility statements: 
(a) reasons given for an event or condition, (b) 
statements indicating the influence or effect of an 
agent upon a recipient, (c) statements of cause and 
consequence, (d) identification of means used for 
accomplishing ends, and (e) statements of need of 
compulsion arising from Without or from within. 
Those responsibility Statements in w 
himself as the active agent 
those in which he presents h; 
of an agent’s activity, 
able as either of the above, check. 

, Reliabilities of scoring were established for two 
judges on 10 cases, Since the scores of both judges 
are measures of the same thing and therefore means 


and standards deviations should be equivalent, the 
intraclass correlation measure described by Haggard 
(1958) was used 


to estimate reliability, R values for 
the raw scores were: Plus scores, .915; minus scores; 
6845 check scores, 778; sum of all responsibility 
statements, .936. The Probabilities associated with 
these correlation values are in every instance less 
than .01, j 


To measure responsibility as the extent to which 
S views his behavior as consequential, it was neces- 
Sary to take into account those instances in which 
he disclaims his own agency and attributes the source 
of events to something outside himself, Therefore the 
— 


1A complete co 


hich S$ presents 
are then scored plus; 
imself as the recipient 
minus; and those not classifi- 


Py of the instructions for the Use 
of the termination form and for assessing difficulty 
has been deposited with the American Documen- 
tation Institute. Order Document No. 6406 from 
ADI Auxiliary Publications Project, Photoduplic’” 
tion Service, Library of Congress; Washington ee 
D. C., remitting in advance $1.75 for microfilm 
$2.50 for photocopies. Make checks payable ess: 
Chief, Photoduplication Service, Library of Corg" 
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responsibility score had to represent both plus and 
e Scores, Three such compound scores were pos- 
two ı two ratio scores and one difference score. The 
Wo ratios were: number of plus responsibility state- 
ge a of all responsibility statements, and num- 
ae plus responsibility statements/sum of plus and 
was eee ey statements. The difference score 
of mi e number of plus statements less the number 
fbn ae statements corrected for sign by the addi- 
two of a constant. The intraclass correlation for the 
with Judges was computed for each of these scores 
mi the following results: ratio score of plus state- 
eum of all responsibility statements, R = .671, 
ER a ratio score of plus statements/sum of plus 
s minus statements, R = .677, p < 01; difference 
core (number of plus statements less the number of 
minus statements), R = .591, p < 05. 
pl Arbitrarily, the ratio of sum of plus/sum of all 
ure Peai minus statements was chosen as the meai 
cine the test of the hypothesis. Extreme and mod- 
sions responsibility was determined by quartile divi- 
part of the distribution of these scores. Not as a 
i Of the test but as a part of exploratory work, 
sone Planned to subject the other two compotnd 
Plu es to the same statistical analysis as the plus/ 
S and minus ratio score. 


Diffic 


was established. 
th reference to 
ither does not 
a “long 
fficulty was based 
į upon data sup- 
see Footnote 1) 
he end of ther- 


Upon r 
plied te number of interviews and 


Which by the termination form ( 
baad filled out by the therapist at the e! 
Y, describes the conditions of termination. 


more. category “difficult” included all caes ee 
tery; than 30 interviews (roughly two seme ten fa 
Min, ve on a once-a-week basis) and all ihon La 
natio ed after fewer than 10 interviews, wit > 
form data which indicated that, in the thera 


Dist? i 
op opinion, termination was dictated by a 
ther, ack of motivation rather than completion 


cases tic work, “Nondifficult” included all those 
$ not specified above. 


Design 


Usi ibe bove the em- 
i) sasures described above, th 
BEE YO! Oe pothesis becomes 


on Prediction for testing the ai ose cases with 
es Snificantly greater proportion n statements/ 

Teme responsibility scores (sum PUS © ted with 
Plus and minus statements) are associate’ 


di B 
ficult therapy than with nondifficult. fence 
he statistical test was a two-by-twe chi SP 

Sudjects 

> for this 
a 48 Ss who constituted the ang pts 
Sins Were all clients of the Student ©° fi 
S vice who had participated in the therapy if 
fare] nseling Service 


Project conducted by the Cou 
zer- 
"@ the Department of Psychology of the Univer 
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sity of Illinois.? This service is available to students, 
both undergraduate and graduate, and, to a limited 
degree, to faculty and to nonacademic personnel. 
The service has no hospital facilities and the client 
population is of necessity limited, therapy-wise, to 
those who can manage on an outpatient basis. 

The sample was selected on a number of bases. 
Since termination data were necessary for the cate- 
gorization of difficulty, only those clients who ap- 
plied for service and had at least one therapy inter- 
view could be used. Ten such cases, which were also 
used for the establishment of reliability, were pulled 
at random from the file of typed transcripts. Eleven 
were chosen as part of an initial effort to limit the 
number of different therapists involved (11 differ- 
ent therapists contributed to the termination data). 
Eleven cases were selected to maximize the overlap 
of Ss between this study and two others and so to 
provide interstudy information, Sixteen were selected 
because posttherapy Gilbert Self-Interview Test pro- 
tocols were available for them and a pre-post study 
had been tentatively planned. ° 

Although the sample was neither systematically se- 
lected on a single set of variables nor completely 
randomized, the method of selection of cases does 
not appear to foster any bias except an increase in 
the number of Ss with posttherapy test data over 
what it might have been with complete randomiza- 
tion. (Thirty-one of the sample repeated the Gilbert 
test after termination of therapy.) 

The age range of the sample was 17 to 32 with a 
median of 24. Number of interviews per client ranged 
from 2 to 118 with a median of 21. The 48 Ss di- 
vided evenly between the difficult and nondifficult 
categories, 17 men and 7 women falling into each 


category. 
RESULTS 


A two-by-two chi square test of the distri- 
bution of Ss with extreme and with moder- 
ate responsibility scores (plus statements/sum 
plus and minus statements) between the diffi- 
cult and nondifficult categories showed no sig- 
nificant relationships (xë < 1.00). The same 
test applied to the other two compound scores 
also showed no significant curvilinear relation- 
ships between difficulty and responsibility. 

. A check of the rejected linear relationship 
in which difficulty of therapy would have been 
associated with low responsibility, showed 


2 The program of research which supplied the raw 
data upon which this study is based was initiated 
under a grant to J. Mev. Hunt, T. N. Ewing, and 
W. M. Gilbert from the Carnegie Corporation. It was 
also supported by the Research Board of the Uni- 
versity of Illinois, and the major share of support 
ae come from grants from the National Institute of 
Mental Health (M-1041 and M-1388) and a con- 
tract with the Office of Naval Research (Nonr-1834). 
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TABLE 1 


Cut Square TESTS oF ASSOCIATION: RESPONSIBILITY, 
DIFFICULTY or THERAPY, AND MOVEMENT 


D M 

R xX = 6.761** x? = 4,090* 
High RwithD High R with High M 

Xe = 5.333* 


D with High M 


Note.—All chi square values based on 2 X2 tables, 

R: Responsibility as measured by number of plus 
statements less the number of minus statements, 
Median splits used to determine high and low. 


D: Difficulty of therapy as determined by length and 
conditions of termination. Cases divided into difficult 
and nondifficult. 

M: Movement judgments made by therapists at 
termination of therapy. Median splits used to deter- 
mine high and low. 


* b <05. 
** b <.01. 


that statistical significance depended upon 
the particular compound score used but that 
the direction of relationship was the opposite 
of what would have been predicted from the 
literature. The results of these analyses, using 
two-by-two chi squares with median splits 
for the responsibility scores, were: ratio score 
of plus statements/sum of all responsibility 
statements, y? < 1.00; ratio score of plus 
statements/sum of plus and minus state- 
ments, x° = 3.00, p< -10; difference score 
(number of plus statements less the number 
of minus Statements), y? = 6.761, p < 01. 
The last two show difficulty of therapy as- 
sociated with high, not low, responsibility 
scores. 

A different criterion variable, movement 
judgment, had been used in the large therapy 


by the Student 


judgment, based on the Hunt-Kogan Move- 
ment Scale (1950), was made by the thera- 


» Movement, were analyzed. The re- 
able 1, They show 
is associated with 
gh responsibility jg 
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associated with high movement, and that high 
movement is associated with difficult therapy. 
Seventeen cases showed concomitance of 
high responsibility and difficult therapy; of 
these 17, 11 also showed high movement. Ten 
of these 11 were classified as difficult because 
of excessive length of therapy. (Of the 24 
difficult cases, 19 were so classified because 
of excessive length, and only 5 because of 
early termination with rejection of therapy.) 
Sixteen cases showed concomitance of low 
responsibility and nondifficult therapy. Of 
these 16, 13 also showed low movement. 
The relationships of Table 1 are thus 
largely determined by two kinds of cases: 
those with high responsibility scores who pose 
the difficulty of excessively long therapy but 
show high movement: and those with low re- 
sponsibility scores who neither flee therapy 


early nor take up much time but do not show 
much movement, 


Discussion 


Although the data do not support the hy- 
pothesized curvilinear relationship nor yet the 
obvious inference from the literature of a 
linear relationship with low responsibility as- 
sociated with difficult therapy, they do dem- 
onstrate an association of these two variables 
which cannot be ignored, 

Exploration of the meaning of these results 
by means of an analysis of the relationships 
of these two variables to a second criterion, 
movement, showed that high responsibility 
Scores, difficult therapy, and high movement 
Were all positively associated with each other- 
As pointed out in the Presentation of results, 
the high part of this three-way relationship 
depends largely upon those clients who pose 
the difficulty of excessively long therapy; the 
low upon those who neither flee nor take 4 
long time, but who do not show much change 
Therefore it is suggested: that clients with 
high responsibility scores, if they do not ay 
ask more of therapy, are willing to inves 
more in it, and, as a result, both take ae 
time and show more change; and that nee 
with low responsibility scores have a differen if 
perhaps a less self-focused goal, limit we 
they will invest, and therefore take less the 
apy time and also show less change. 
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One implication of the present results for 
replication and further investigation concerns 
the measure of responsibility. The fact that 
the two scores which are most narrow in scope 
are also the most discriminative with respect 
to the criterion suggests it would be worth- 
while to try to simplify and sharpen the in- 
structions for content analysis. Another im- 
plication concerns the whole problem of pre- 
dicting therapy but can be best articulated 
here by focusing upon cases with high re- 
Sponsibility scores: what differentiates those 
cases with high responsibility scores who per- 
Sist and gain from those who do not? It is 
Suggested that motivation is such a variable 
and should be studied in conjunction with 
responsibility for predicting the course of 
therapy. 

SUMMARY 


It was hypothesized that clients with both 
very high and very low acceptance of re- 
‘Ponsibility would present difficulty for ther- 
*Py when such difficulty was defined as either 
Never really “getting into” therapy or taking 
a greater than usual number of interview 
Ours. To test this hypothesis, a method of 
Measuring responsibility by means of content 
analysis of verbal protocols was developed. 
he data did not support the hypothesized 
“Urvilinear relationship. They showed in- 
Stead an association of high responsibility 


with difficult therapy and low with nondiffi- 
cult. They also demonstrated a relationship 
of high movement with high responsibility, 
and high movement with difficult therapy. 

It was suggested for further work that the 
measure of responsibility be sharpened, and 
that motivation be investigated in conjunc- 
tion with responsibility for predicting therapy 
outcomes. 
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SOCIAL VALUE-NEED PATTERNS IN MENTAL 
RETARDATES'’* 


THOMAS E. BOLDUC 2 


Laurelton State Village, Pennsylvania 


One of the important factors that charac- 
terizes the behavior of socially adjusted indi- 
viduals is the awareness of and ability to con- 
form to social values. As individuals mature, 
society expects them to acquire an increas- 
ingly conscious and rational code of behavior. 
Certain minimal standards of socially desir- 
able behavior must be acquired to achieve a 
satisfactory social adjustment. In a sense it is 
the ability to cope with everyday life experi- 
ences without opposing society’s values that 
constitutes an acceptable social adjustment. 

The increasing complexity of the process of 
living in modern cultures augments the handi- 
cap for the person of limited intelligence. His 
ability, for example, to acquire norms of be- 
havior for a wide variety of situations, to ap- 
prehend the character of his experiences, and 
to form comparisons between them is more 
limited and less flexible than that of a per- 
son of normal intelligence. Since he must form 
ideas of situations he has never actually ex- 
perienced and conceive of events and how to 
meet them before they ever occur, it is little 
wonder that the mental retardate becomes a 
potential delinquent. The inability to acquire 
and apply socially desirable behavior may be 
one of the most important reasons for this 
failure in community adjustment. 

The purpose of this investigation was to 
isolate and identify common patterns of solu- 
tions to social value-need conflict situations in 
mental retardates. To achieve this goal an in- 


1 The research was supported by a USPHS research 
grant (M2121) from the National Institute of Men- 
tal Health, United States Public Health Service. Ap- 
preciation is expressed to Alfred Butler, project di- 
rector, for his editorial suggestions and valuable 
critique which were very helpful in the preparation 
of this paper. 

* Now at Psychiatric Services, Wales, Wisconsin. 


strument was constructed to measure the re- 
sponses to social value-need conflicts in situa- 
tions similar to those which might occur 1n 
everyday experiences of mental retardates. 
Through an inverse factor analysis of correla- 
tions between persons on this measure, pat- 
terns representing groups who favor particu- 


lar types of solutions to these conflicts were 
sought. 


Tue RESEARCH DESIGN 


Factor analysis was considered to be the 
most effective method of identifying the multi- 
dimensional variables associated with such 
general concepts of values and needs. Al- 
though the study was exploratory, the explo- 
rations were based on definite principles with 
a specific structure designed to define the 
limits of the area of investigation. 

A review of the literature failed to uncover 
a standardized scale that would adequately 
measure the variables of interest in this study- 
Instruments (Allport, Vernon, & Lindzey, 
1951; Anderson & Anderson, 1954; Bills; 
Vance, & McLean, 1951; Grace & Grace; 


1952; Johnson & Stanley, 1955; McDonald, 


1956; Morris, 1956; Woodruff, 1942) that 
have been developed to measure value struc- 
ture would have required modifications a? 
extensive editing to adapt them for use wit? 
a population of mental retardates. It WaS 
therefore necessary to design a measure basè 
on a classification structure which would re" 
flect individual differences in the mental Te 


s n- 
tardate’s responses to social value-need CO 
flict situations. 


Selection of the Variables 


-opriaté 
The first problem was to select ee gr 
variables for the two major dimensio” 
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value and need, and then to devise a classifi- 
cation system that would facilitate the con- 
struction of the items and provide structure 
for the scale. 

In keeping with the goal of identifying pat- 
terns of social value-social need conflicts in 
familiar situations, it was necessary that both 
the value and the need dimensions include so- 
cial rather than personal areas. Even with this 
limitation imposed on the structure a wide 
variety of possible classifications existed. It 
must be recognized that the concepts of value 
and need are not independent psychological 
entities but complex multidimensional vari- 
ables that overlap other personality variables. 
Therefore, the specific values and needs se- 
lected for this study are “social” only in terms 
of emphasis. It is conceivable, for example, 
that an individual could be honest or loyal to 
his self-system, but it is more common to con- 
sider him honest or loyal to others. Similarly, 
the needs selected are primarily social rather 
than personal in that their gratification de- 
Pends primarily on interactions or relation- 
Ships with others. 

The classification system of values em- 
ployed in Havighurst and Taba’s (1949) 
study of character and personality met the 
above criteria and provided a comprehensive 
Coverage of the value dimension required for 
the present investigation. The categories of 
Values used were honesty, loyalty, moral cour- 
age, and responsibility. Other values ar 
have been employed, but after reviewing the 
literature in this area it was apparent that 
Many of the value descriptions were related 
© or were modifications of the values used in 
the Havighurst study. 


onesty as employed in this study refers 


fo the u rty and truthfulness. Be- 
i ah ota: with authorities such 


'ng truthful, especially J- 
aS parents, ies and employers, and m 
Ing respect for the property of others i ing 
amples of honesty. Loyalty involves E to 
faithful to school functions and staf, 9 
riends who are in trouble, and to ones 
family, Moral courage involves the amg 
one’s own rights and those of am, i- 
€fense of the self and others, including je 
Ority groups, against harmful goss!P, e 
Pression of personal opinions and er) ae 
ticularly when resistance is encounteree, : 
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“preachy behavior” are examples of moral 
courage. Responsibility involves the honoring 
of home, school, and employment duties. The 
completion of assigned tasks, family responsi- 
bilities, and the protection of people or prop- 
erty under one’s care reflect responsibility. 

In a similar manner, needs were chosen that 
met the criteria and yet were comprehensive 
enough to cover a wide range of familiar, 
everyday experiences. Five needs, based on 
Murray's (1938) theoretical framework, were 
employed in the present study. They are nur- 
turance, succorance, dominance, affiliation, 
and deference. All of these imply that a rela- 
tionship with another person facilitates grati- 
fication. Considerable research in many areas 
of study has been reported, utilizing these 
needs. Edwards (1953), for example, includes 
these five needs in his study of personal pref- 
erences. 

Nurturance is defined as the need to give 
affection, sympathy, help, or pity to another 
person. Succorance is the need to be nursed, 
supported, protected, loved, and consoled by 
another person. Dominance is the need to 
control, influence, or direct the behavior of 
others either by suggestion, persuasion, or 
command. Affiliation involves the need to 
form or maintain a relationship with another 
person. Deference involves the need to yield, 
submit, or refuse to compete with superiors 
because of physical size, status, or age. 


The Research Instrument 


A pool of 141 items was constructed according to 
the criteria described above. Twenty pairings were 
possible so that there were items describing a con- 
flict between each combination of the four values 
and the five needs. Each item consisted of a brief 
description of a familiar social conflict situation and 
a forced-choice decision between two alternate re- 
sponses—one depicting a value and the other a need. 
The forced-choice technique reduces the effect of 
such biasing variables as response set. The items 
were typed on 3X 5 cards with the order of alter- 
nates randomized to reduce the possibility of a po- 
sition set occurring. An attempt was made to in- 
clude a comprehensive range of social experience fa- 
miliar to mental retardates. The task of the subject 
(S) was to select one of the alternate solutions as 
the response to the conflict situation. In order to 
minimize the anxiety associated with the administra- 
tion procedures as well as to gain insight into the 
social acceptability status of the items it was de- 
cided that the Ss adopt a “should” rather than a 


474 


i ” set. It was hoped that this would reduce 
bree test behavior associated with aa 
self-statements. Examples of the items are presente 

pail Leona was in a gang. One day the gang broke 
some windows at school. Leona was caught and the 
other girls got away. The teacher asked er who 
the other girls were. What should Leona do? (b) 


Tell the teacher because teachers are the boss. (a) - 


Ji the teacher because she does not want to 
AA girls into trouble. (Structure: Loyalty 
vs. Deference) 7 

#56. Sue had a job checking spelling papers after 
school. One day a boy came by and asked Sue to 
go to the movies with him. What should Sue do? 
(a) Keep on with her job. (b) Go with the boy be- 
cause Sue needed someone to like her. (Structure: 
Responsibility vs. Succorance) 

#60. Blanche was a lonely girl. Blanche’s report 
card had some bad marks on it and she was afraid 
to take it home. What should Blanche do? (b) 
Change the marks and make them higher so that 
her father would be pleased with her. (a) Take it 
home and show it to her father. (Structure: Honesty 
vs. Succorance) 

86. Joan was one of the best players on the 
club’s kickball team. One day the girls from another 
team told Joan that if she would play for them they 
would make her captain. What should Joan do? (b) 
Go with the other girls and be the captain, (a) Stay 
with her own team because they needed her, (Struc- 
ture: Loyalty vs. Dominance) 

After the items were constructed and refined they 
were submitted independently to four psychologists 
who were instructed to identify each item according 
to the criterion definitions of the classification struc- 
ture. A total of 21 items failed to meet the 75% 
interjudge agreement criterion and were eliminated 
from the scale. It should be mentioned briefly that 
this method measures only the degree to which the 
items are descriptive of the criteria definitions. The 
future use, therefore, of the terms “value” and “need” 
will refer only to the judged face validity as em- 
Pirically determined, 

Following this refinement, the remaining 120 items 
were administered on two occasions, seven days 
apart, to a random sample of 20 institutionalized fe- 
males, with an age range of 14 to 18 and an IQ 
range of 57 to 78, none of whom had been institu- 
tionalized more than two years, 

The test-retest reliability coefficients (tetrachoric) 
for each S were transformed to z scores and aver- 
aged, resulting in a mean test reliability of .$7, Simi- 
larly the reliability coefficients of each of the 120 
items were averaged, resulting in a mean item reli- 
ability of .93, Eleven items were eliminated due to 


low or negative reliability coefficients, 
A criterion of 10% or 


randomly from the total popul 
the age and intelligence criteri: 
State Village. This is a state tr: 
male mental retardates of child- 
jority of whom are court com 
failure to achieve a S 
the community, 


ban distribution, N; 
involved in court a 


to measure respon. 
Ss, it was decided 
could be obtained 
except institutionai 
make inferences 
the way for futu: 
Ss, for example, 

would be restrict 
was decided that 
need behavior w. 
alized Ss were 

Teasons random 
significance of d 
variables that m 
cant differences 

reading ability. 

os 


Documentatj 
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As might be expected, the item distribution was 


skewed slightly toward the value dimension in that 
the Ss tended to select the value alternate more fre- 
quently than its need counterpart. For this reason 
some of the biased value alternatives were de-em- 
Phasized, that is, made less socially desirable. The 
above panel of judges agreed that this moderate re- 
wording of some of the items should not significantly 
affect the reliability and should increase the vari- 
ability of the item. 


The final distribution of the items according to the 


classification structure seemed adequately balanced. 
No single category had less than three replications 
or more than seven, with a mean of five, The Ss 
were able to comprehend the items and meaningfully 
select alternates. It was felt that the Ss were able to 
maintain the “should” set and that the defensiveness 
associated with examinations was decreased. 


The Subjects 


The Ss in this study were 50 mental defectives, 


consisting of two groups of 25 each, between the 
ages of 14 and 18 and with an IQ range of 55 to 80. 
One group was institutionalized and the other was 
not institutionalized. All the Ss were without gross 
organic dysfunction and only 

used, to eliminate race and sex di 


white females were 
fierences in response. 
retardates was drawn 
ation of Ss meeting 
a at the Laurelton 
aining school for fe- 
bearing age, the ma- 
mitted because of a 
atisfactory social adjustment in 


The institutional sample of 25 


The noninstitutional sa: 
om the kı 


sample was drawn ap- 
e institutional population in rural-ur- 
one of the Ss selected had been 
ction for delinquent behavior. 

ef objective of the study was not 
se differences between groups © 
that if two random groups of Ss 
which were similar in all respects 
lization, it would be feasible ne 
about differences that might pol 
re research. If only institutionalized 
were used the findings of the ee 
ed to institutionalized retardates. A 
greater variability in social value 
ould be obtained if noninstitution” 
included in the sample. For thes 
Samples were selected and tests 2 
ifferences were computed for thor’ 
ight confound the data. No SENT 
were found in age, intelligence, 


Although the chi 


é ican 
3 This scale has been deposited with the America 


on Institute—see Footnote 5. 
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Testing Procedure 


The scale was administered individually to each of 
the 50 Ss. Each item was presented verbally to the 
S who was requested to identify the better one of 
the two alternative solutions to the conflict situa- 
tion depicted by the item. None of the Ss refused to 
take the test and all Ss tested were contained within 
the originally selected sample. 


Statistical A nalysis 


The major hypothesis of this study was that psy- 
chologically meaningful dimensions would be re- 
vealed if a matrix of intercorrelations of Ss’ re- 
Sponses to pairs of alternate solutions was factor 
analyzed and the arbitrary orthogonal axes rotated 
to simple structure. The principal axis method of 
factor extraction (Hotelling, 1933) and the orthogo- 
Nal rotation method were chosen as the methods 
of analysis. This method provides a more rigorous 
Means of extracting factors from a correlation ma- 
trix and it has the advantage of placing more of the 
Variance in the first factors. The highest correlation 
Coefficient in the column was used as the initial esti- 
Mate of communality. Subsequent estimates were 
based on the previously obtained communality. 

Each $s responses were correlated with those of 
every other S, resulting in a 50 X 50 matrix of inter- 
Correlations, Since the data were dichotomous and 
the underlying dimensions were assumed to be con- 
tinuous, the tetrachoric coefficient was considered most 
*Ppropriate. An approximation of the tetrachoric 
Was made possible by programing PENNSTAC,* uti- 
lizing the first five terms of the tetrachoric equation. 

Xtraction of factors was discontinued after six fac- 
tors, Residual correlations ranged from —.2938 ‘to 
2713, with a mean of .0006 and a standard devia- 
tion of 0877, Rotation of the arbitrary orthogonal 
Fefernce axes was performed by the varimax solu- 
tion (Kaiser, 1956). This solution provides an ia 
Jective method of approximating simple structure. 
comparison of the varimax and the quatrin rota- 
tion methods by Wrigley, Saunders, and Neuhaus 
(1958) indicated that the former has the se 
Providing a somewhat more stable structure W T 
the number of factors is small and the number i 
“Sts (persons) is large, as is the case in this Sh 
lem, Biserial correlation coefficients were ar i 
© measure the relation of each item to each final 
“ctor loading, 


RESULTS 


The rotated factor matrix of the s prë: 
SPonses to the social value-need scale is P 


nsyl- 
wae Medium-speed digital computer ar wink 
mia State University, Special thanks are ro- 
ald Ta: y. developing the P 


Stam ‘rd for his cooperation in 


Ss? Te- 


sented in Table 1.5 This table provides the 
rotated factor loadings on each of the six fac- 
tors for the 50 Ss. In the first 25 rows are the 
factor loadings of the noninstitutional sam- 
ple; the last 25 rows represent the Laurelton 
sample. The distribution of the total variance 
accounted for by the six factors is given in 
the last row of the rotated factor matrix. 

The six factors represent six possible pat- 
terns of preferences of the 50 Ss obtained 
through their responses to the forced-choice 
items of the scale. Therefore, the character- 
istics of the items which differentiate the Ss 
on the six factors could serve as a basis for 
factor interpretation. Accordingly, each of the 
95 items of the research instrument was cor- 
related with the factor loadings of each of the 
six rotated factors. The 570 biserial correla- 
tion coefficients obtained were employed to 
measure the contribution of each item to the 
rotated factors. The highest item-factor cor- 
relations provided the basis for the psycho- 
logical interpretation of the factors. Items 
that correlated highly with only one factor 
were given the greatest weight and provided 
the identification of the specific nature of the 
factors. Where items correlated significantly 
with more than one factor, such items were 
given less emphasis and were used primarily 
to define the general nature of the factors. 

Since the items were designed so that a 
forced-choice response between two alterna- 
tives was required, it follows that when an S 
selected the value alternative she was at the 
same time rejecting the need alternative. 
Therefore the interpretation of the dimension _ 
associated with the preference is inferred 
only from the chosen alternative bu 
from the other alternative in the sitt lation, 
In some instances the preference for one se- 
lection may represent an avoidance response 
—an avoidance of the other alternative. This 


5 The social value-need scale, the correlation ma- 
trix, the original factor matrix, the transformation 
matrix, the item-factor correlations, and the table of 
items defining the factors have been deposited with 
the American Documentation Institute. Order Docu- 
ment No. 6404 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Con- 
gress; Washington 25, D. C., remitting in advance 
$1.75 for microfilm or $2.50 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 


Library of Congress, 
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correct another person’s misbehavior, it is 
better to feel sorry and accept it. For exam- 
ple, Item 87 indicates that it is better to for- 
give a friend for stealing than to report her 
to the authorities. Thus, there is an element 
of in-group cohesiveness similar, perhaps, to 
honor among thieves, where there is a sort 
of mutual protection against authority figures 
and their rules and regulations. 

As might be expected, Ss with the highest 
loadings on this factor are found principally 
in the Laurelton sample. This is substantiated 
by the fact that of the five Ss with loadings 
of .40 or greater, four are in the institutional 
sample. The large number of Ss with negative 
factor loadings in the noninstitutional sample 
further supports this opinion. This is under- 
standable when one considers that it is the 
rebellious and negative attitude toward au- 
thority in general that has been postulated 
as a major factor in delinquency. This factor, 
then, would seem to differentiate individuals 
who allow their own needs for pleasure and 
group enjoyment to interfere with the adher- 
ence to adult requests, from those who respond 
diligently and appropriately to such situa- 
tions. The presence of negativism and rebel- 
liousness toward adults is perhaps the most 
Prominent characteristic of this factor. 


Discussion 


The chief purpose of the study was to iso- 
late and identify common patterns of re- 
Sponse to social value-need conflict situations. 
This was achieved through a factor analytic 
procedure and the extracted factors were in- 
terpreted. 

The factors, however, were extracted from 
a matrix of intercorrelations between indi- 
viduals from two combined samples. Thus, 
the factor loadings of the two samples are not 
independent, since all the Ss were intercor- 
related. The samples were tested, however, 
for any differences in variables that might 
confound the data, such as age, intelligence, 
achievement, and socioeconomic status. The 
items were originally designed and judged for 
the absence of inherent differences in the 
items themselves which might favor one 
group. If these precautions were adequate, 
then any differences observed in factor struc- 
ture between the two groups would be influ- 
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enced by the presence or absence of the vari- 
able of institutionalization. 

Although the design of the study does not 
permit a statistical analysis of group differ- 
ences in factor loadings, the observed differ- 
ences, nevertheless, may be useful in a tenta- 


tive evaluation of the function of the fact of . 


institutionalization. 

Factor I, the factor that extracted the 
greatest amount of variance, and Factor mr 
seem to be weighted by individuals found in 
the noninstitutional sample. That is, Ss who 
have yet to come into conflict with authority 
to the extent that institutionalization would 
be warranted possess the highest factor load- 
ings on these factors, It would seem then that 
the interpretation of these factors would be 


important in the Process of considering ap- 
plicants for commitment. 


Factors II and IV do not 
criminate between the samples 
ably not directly related to t} 
institutionalization, 

Factors V and VI are weighted by indi- 
viduals in the Laurelton sample. That is, Ss 
who have been committed to an institution 
possess the highest factor loadings on these 
factors. The majority of Laurelton students 
are committed primarily because they have 
become a social problem in their community: 
Therefore, a test heavily weighted with Fac- 
tors V and VI may be useful in discriminat- 
ing institutional and noninstitutional samples. 
The preceding analysis suggests the feasi- 
bility of developing these measures to dis- 
criminate subsamples of retardates, For ex- 
ample, the selection of parole students from 
institutionalized populations might be facili- 
tated. Similarly, a screening device for re- 
tardates attending special education classes 
in the public schools could be developed. 


appear to dis- 
and are prob- 
he variable of 


SUMMARY 

The purpose of 
determine the na 
solutions to soci 
situations in me 


this investigation was tg 
ture of common patterns O 
al value-social need conflict 
: ntal retardates. To isolate a”! 
identify the common patterns of conflict solu- 
tions, a scale Consisting of 95 items, devise 
to measure social value-social need confl ier 
was administered to 50 white, female a 
tardates between the ages of 14 and 18 W! 
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an IQ range of 55 to 80. Half of these Ss 
Were drawn from an institutionalized popula- 
tion in a state training school for female re- 
tardates. The other 25 were drawn randomly 
from a population of retardates of similar age, 
intelligence, race, and socioeconomic status 
attending special education classes in 12 pub- 
lic schools. Each item was constructed so that 
the S$ was required to make a forced-choice 
response, on the basis of what was believed 
to be the best choice, to a familiar, social con- 
flict situation. 

_ One choice of response was based on a so- 
cial value and the opposite choice was based 
On a social need. The social value dimension 
Was arbitrarily divided into honesty, loyalty, 
Moral courage, and responsibility. The social 
need dimension consisted of nurturance, Suc- 
“orance, dominance, affiliation, and deference. 
Each value was paired with each need, re- 
Sulting in 20 possible combinations of situa- 
tions, 

The intercorrelations of the Ss on their re- 
SPonses to the items were factor analyzed and 
Orthogonal rotations revealed six definable 
actors, 

Factor I stressed a passive, compliant ap- 
proach to authority figures with responsibility 
cing the most dominant value orientation. 
Factor IT emphasized loyalty and the protec- 
tion of others with loyalty being the most 
Ominant value orientation. Factor IIT was 
tentatively defined as a preference for sub- 
mission in situations involving authority fig- 
Ures but with more assertive behavior 1m 
relationships with peer groups- Factor IV 
Stressed honesty and moral courage, suggest- 
mg an assertive, dominant individual who 
Prefers to participate in the correction oio 
Misbehavior of others. Factor V empie 
© need to be dominant in relationships wit 
poe Sroups. Values are recognized, but are 


"requently distorted and employed as m 
AS with which to gain dominance. Factor 
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stressed a negative, rebellious approach to so- 
cial situations involving authority figures with 
an underlying need for affection and pleasur- 
able activities dominating behavior. 

Some implications of the factors were dis- 
cussed. The feasibility of employing this or 
a similar instrument to differentiate institu- 
tional and noninstitutional retardates was ex- 
plored. 
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SOCIAL DESIRABILITY AND THE FORCED CHOICE METHOD 


MARVIN J. FELDMAN anp NORMAN L. CORAH1 
University of Buffalo 


When subjects (Ss) respond to personality 
questionnaires, it cannot be assumed that the 
test instructions serve as the only or even the 
major set followed by the Ss. Other self-im- 
posed sets may bias test responses. Meehl 
and Hathaway (1946) and Cronbach (1946) 
have both cogently described how test taking 
attitudes intrude upon personality tests. More 
recently, Edwards (1957) has stressed the 
importance of social desirability (SD) stereo- 
types as a biasing factor. He seems to sug- 
gest that this particular response set is so 
Pervasive that it impairs the validity of most 
standard Personality inventories. For Ed- 
wards, the forced choice technique affords a 
Possibility of minimizing the effects of SD. 

is own scale, the Edwards Personal Pref- 
erence Schedule (EPPS), is constructed ac- 
cordingly. The forced choice format calls for 
Tesponses to pairs of statements; the pairs 
presumably have been equated for SD on the 


basis of prior ratings of the separate com- 
ponents of the pair. 


In an earlier study 
Cohen, Gruen, Meadow, & Ringwall, 1958) 
of part of the EPPS, it was found that for 
the majority of pairs Ss could readily choose 
air as being more desir- 
was a high correlation 
of the pairs and de- 
gree of endorsement of the items within the 
pairs. A replication of this study on the en- 
tire EPPS (Edwards, Wright, & Lunneborg, 
mewhat lower degree of 
SD ratings and item en- 


(Corah, Feldman, 


1 Now at Washington University Medical School, 


that single statements may acquire contextual 
meaning when paired; hence their SD values 
may be somewhat altered. An alternative 
possibility is that item pairs on the EPPS 
have not been matched adequately. Edwards 
(1957) noted the difficulty of securing exact 
matches, for although he tried to match pairs 
with a difference no greater than .50 of a 
scale unit (scale units being determined by 
the method of successive intervals) about 
25% of the pairs on the EPPS differed by 
more than this amount. It is possible that 
more stringent criteria for matching pairs 
might indeed minimize the influence of SD. 
On the other hand, it is also possible that the 
forced choice method itself produces inherent 
difficulties. The present study was conducted 
to elucidate these two possibilities, 


PROCEDURE 


Items pertaining to achievement, hostility, and sex 
were culled from three sources: (a) the EPPS, (b) 
the Minnesota Multiphasic Personality Inventory 
(MMPI), and (c) the Buss and 


Durkee (1957) hos- 
tility inventory. The writers also devised additional 


items. This preliminary harvest yielded 132 items 
written as first person statements, 

To insure the face validity o. 
perienced clinical psycholo; 
each item for its appropriati 
ful category was included 
fall readily into the three 


f the items, three ex- 
gists independently rated 
€ need category. A doubt- 
for items which did not 
categories. Items were re- 
tained for further use only if all three judges agreed 
on the need category being represented or if tw° 
agreed and the third Placed the item in the doubtful 
category. A pool of 126 items was retained including 
30 achievement, 66 hostility, and 30 sexual items. 
The 126 items were randomly mixed in the usual 
personality questionnaire form, The questionnaire 
was then administered to 100 male students ae 
introductory psychology at the University of ane 
falo. They were instructed to rate each separat? 
item on a nine-interval scale ranging from mi 
social undesirability to extreme social desirability 
To facilitate the next step, that is, matching ite 
for social desirability, the items were scaled DY ake 
method of successive intervals (Edwards, 1952). 
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Social Desirability and Forced Choice 


Procedure to this point was essentially the same as 
i method used by Edwards in constructing the 
PS i 


On the basis of matching scale values, 36 item 
Pairs were formed.2 Because our aim was to match 
as closely as possible, a stringent criterion of .10 of 
a scale unit was adopted for matching which con- 
trasts with Edwards use of .50 of a scale unit. 
ctually it was possible to comprise 30 of the item 
Pairs with a discrepancy of only .05 or less of a 
Scale unit, It may be of interest to note that the 
achievement items had the widest range of SD rat- 
ings while few hostility items were rated extremely 
avorable and few sex items extremely unfavorable. 
vertheless, in matching pairs, an attempt was 
ade to cover as wide a range of social desirability 

iy this Particular pool of items permitted. The 36 
™ pairs were composed as follows: 


6 ach-ach 6 ach-host 
6 host-host 6 ach-sex 
6 sex-sex 6 host-sex 


x ss 

: cach set of six pairs combining two needs, one 

ni Was presented in the first position in three 

i and in the second position in the other three 
rs, 


a oe 36 item pairs, arranged in random order, were 

ha inistered to a new group of 98 male students. 
© test instructions were as follows: 

ments. Select the 

ou think is more 

t that would 

le if it 


st Here are 36 pairs of stater 
& atement in each pair which y 
°cially desirable, that is, the statemen 
7 ake a person look better to other peop 
Vere said of him. 


RESULTS 


y Tf SD was controlled by the matching, tren 
Vithin the limits of sampling errors, an 
iber of a pair would be chosen sith ei 
‘pyttency (p = .50) as being more desirable. 


P! p as a basis 
f € binomial expansion was used as F 
Before consider- 


U testin is pr ition. 

i g this propositio i 
a8 the main eal we tested for the pres 
“e of a different kind of response on 
tithe.” 2 Systematic position geome ae 
‘py, Ct the first or second member of the pair. 
c 


an 
findi e negative; hence we © 
lings were negative; ets were 


Slude that position response S 
ite? ‘gible in devaentaine responses tO 
Pairs, 
tha, ing to the main results, Table 1 aoe 
sig about half the pairs (19 of 30 a E 
Ong icant differences in SD ratings 8Y ig 
ie Member of the pair. This number 0f S8- 
“ant differences is far beyond chance 


2 . 

g i + jonnaire or 
the zepis of either the 126-item Leper’ 

Paired items may be obtained ©! 
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TABLE 1 


NUMBER or Pairs SHOWING SIGNIFICANT DIFFERENCES 
IN SOCIAL DESIRABILITY RATINGS 


No. of signifi- 
cant differ- 
ences within 


cach set of A more B more 
g six Pairs favorable favorable 

Paired needs (.05 level) than B than A 
A.ach-B.ach 3 
A.host-B.host 1 
A.sex-B.sex 3 
A.ach-B.host 5 3 2 
A.ach-B.sex 4 4 0 
A.host-B.sex 3 2 i 


pectations. It is evident that our stringent 
matching procedures failed to equate items 
for SD. This is further illustrated by the fact 
that the percentage of Ss choosing the A 
member of each pair as being more desirable 
ranged from 12 to 85%. Even items paired 
within the same need category, e.g., ach-ach 
pairs, were evaluated differentially. On the 
basis of the results of our previous study 
(Corah et al., 1958), we had anticipated a 
tendency for achievement items to be rated 
more favorably when paired with the other 
needs, but as can be seen this only held for 
the ach-sex pairings. 

The 36 pairs were grouped into the 12 pairs 
with the lowest (most-undesirable), 12 with 
the middle, and 12 with the highest SD rat- 
ings. We then simply counted the number of 
significant pair differences occurring within 
each zone which turned out to be 9, 2, and 
8 for low, middle, and high, respectively. Thus 
it is apparent that the pairs were equated 
relatively better for SD in the middle range 


than at either extreme. 


Discussion 


The present findings confirm the results of 
our previous study in suggesting that social 
desirability is not readily minimized by the 
use of the forced choice format even with ex- 
tremely careful matching. Perhaps the method 
itself bears closer scrutiny. it seems possible 
that the paired format invites a comparison 
between the components of the pair which 
leads to finer evaluative distinctions than 
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would be elicited in responding to single 
statements. In this respect, the forced choice 
method may actually heighten the problem 
of control for social desirability. 

It would seem to follow that more atten- 
tion must be paid to item matchings if the 
forced choice method is to realize the objec- 
tive of controlling for SD. If the test con- 
structor bases his matchings on SD ratings 
of single statements, then our results suggest 
the greater likelihood of equating pairs drawn 
from the middle range of SD. Such a restric- 
tion on item selection, however, poses serious 
limitations for many purposes when extreme 


statements might be more appropriate. We 
had suggested that: 


The method of successive intervals to scale items 
achieved only a first approximation of equal pair- 
ings, and that additional judgments of the item 
pairs themselves are necessary along with revisions 
in pairing before the variable of social desirability 


can be eliminated from the EPPS (Corah et al., 
1958). 


This method of successive approximation 
was used by Gross (1959) in constructing a 
forced choice scale measuring inner—other di- 
rection. Statistical analysis revealed that the 
resulting item pairs were equated for SD. The 
method is tedious but Gross’ work demon- 
strates that greater control of SD can be 
achieved, 

Our research is certainly consistent with 
Edwards’ findings (1957) that SD is a per- 
vasive factor. It is not surprising that verbal 
tests are markedly affected by such test tak- 
ing attitudes, Along related lines, Osgoods’ 
work (1952) points up the fact that the con- 
notative meaning of words derives in good 
measure from an evaluation factor, that is, 
people can hardly use words without assign- 
ing an evaluative meaning to them. We would 
argue that the forced choice format offers 
Possibilities of controlling SD, but this form 
of inventory may heighten awareness of the 
comparative desirability of the component 
members of a pair; hence, control of SD re- 
quires special care in matching pairs. 


SUMMARY 


The present study was a careful attempt to 
explore the adequacy of forced choice item 
pairs in eliminating the factor of social de- 
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sirability. Items related to the needs of 
achievement, sex, and hostility were rated 
separately on a nine-interval SD scale by 100 
men. Items were then paired both within and 
between need areas on the basis of their scale 
values. Stringent criteria for matching were 
used. The 36 pairs obtained were given to an- 
other group of 98 men to see if they could 
distinguish between the members of each pair 
on the basis of SD. About one-half of the 
pairs yielded significant differences in these 
judgments. The items of pairs with low and 


high SD values appeared to be more easily: 


distinguished than those with middle values. 
It was concluded that carefully matched 
items in a forced choice format do not readily 
minimize social desirability and that the 
forced choice method may actually heighten 


an S’s ability to make discriminations on the 
basis of this factor. 
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THE CONNOTATIVE MEANING OF SEXUAL SYMBOLS 
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fing to Freud (1953a, p. 354), sym- 
orm meaning from their similarity in 
or a the male or female genitalia; pointed 
ols 9 on objects are therefore male sym- 
male vhile round, containing objects are fe- 
rd bols; The meaning, according to 
ject its 3 inherent in the structure of the ob- 
ion Fa f, and therefore, at least by implica- 
i Tanscends cultural differences. 
assum teh Freud’s theory forms the basis of 
tation” tions frequently made in the interpre- 
and of dreams, projective test responses, 
artistic productions, recent studies pre- 
lidi conflicting evidence concerning its va- 
Y. The findings of Starer (1955), Jones 
Sup 9), and Stennett and Thurlow (1958) 
Port the Freudian hypothesis, while those 
aud (1954) and Barker (1957) do not. 
edu vious studies, the experimental pro- 
lon €s involved some variation of matching 
Bated or rounded objects with a masculine 
s minine designation. In general, the meth- 
psed may be grouped into several levels 
etnies. The matching of objects with 
lose. female names (Levy, 1954; Starer, 


treet Stennett & Thurlow, 1958) is the most 
to cand, as Levy recognized, the most open 
defensive behav- 


ior, scious control or to - 
Simi Ones (1956) and Barker (1957), ue 
enti approaches, asking subjects (S s) be 
Sonali the object as male or female mper 
Nin Y, or to make it a masculine or | em. 
lise Character in a children’s book. While, m 
Ws i this method is still fairly direct, Bark- 
Seryeg 2 duction of a reaction time measure 
tespop © & Check on blocking oF avoidance 
ates me Levy also utilized a paired-assoct” 
fth ee task, perhaps the most indirec 
© methods, 
Sime theless, the basic task rem: 
? One of “naming” or classifying, 


mains the 
and 


there is considerable doubt that this denota- 
tive function is the most appropriate for the 
testing of the Freudian hypothesis. One of the 
most important elements of symbolic mean- 
ing remains untouched, namely the connota- 
tive, or “feeling tone,” aspect of the symbol. 
In clinical use, it is this aspect which is the 
most significant. The meaningfulness of a 
symbol, sexual or otherwise, lies in the emo- 
tional atmosphere it creates. It is therefore 
the purpose of this study to test the validity 
of the Freudian theory of symbols in terms of 
connotative rather than denotative meaning. 

Barker’s research (1957), upon which the 
present study is based, introduced the cul- 
tural meaning of symbols as a major variable. 
She showed her Ss drawings of objects which 
could be classified as male or female accord- 
ing to F. reudian theory or to cultural use. The 
stimuli were classified into four categories: 
(a) those with Freudian sexual designations, 
but low in cultural sexual meaning (i.e., snail, 
fish); (b) those for which Freudian designa- 
tions and cultural associations were in agree- 
ment (i.e., purse, gun); (c) those for which 
Freudian and cultural sexual designations 
were in conflict (i.e., needle, drum); and (d) 
those which had neither Freudian nor cultural 
sexual meaning (ie., book, table). Barker 
found that, where culture was not a vari- 
able, sexual designations did not differ from 
chance; where culture was a variable, sexual 
designation was in terms of cultural meaning, 
whether it agreed with or conflicted with the 
Freudian prediction. 

The hypothesis to be tested in the present 
research is as follows: Regardless of the de- 
which they conform to cultural aséo- 


gree to Š 
ciations, pointed or elongated objects are 
designated as male, while rounded, contain- 


ing objects are designated as female. 
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addition, since the two studies with 
ae findings (Barker, 1957; Levy, 1954) 
used children as Ss, while the other three used 
young adults, Stennett and Thurlow (1958) 
have suggested that age may be the signifi- 
cant variable. The present research is de- 
signed to cast light on this issue. 


METHOD 


The Ss of this research were 49 undergraduate stu- 
dents enrolled in an introductory psychology course 
at Long Island University. There were 17 males and 
32 females. 

Both denotative and connotative designations of 
the sexual meaning of each symbol were obtained. 
The denotative designation was obtained by asking 
Ss to indicate whether each of the 24 symbols would 
be most approprjate as a male figure or a female 
figure if it were used in a children’s storybook. The 
connotative designations were obtained by asking Ss 
to describe the 24 symbols plus a male and a female 
figure on 12 scales of the semantic differential. 

To determine whether an S described an object on 
the semantic differential as male or female, difference 
Scores were computed between the S’s ratings of each 
symbol and his ratings of the male and female fig- 
ures. If the difference in ratings over the 12 scales 
between the symbol and the male figure was smaller 
than the difference between the symbol and the fe- 
male figure, that symbol was designated as male. 
Similarly, if the difference between the symbol and 
the female figure was smaller than the difference be- 
tween the symbol.and the male figure, that symbol 
was designated as female, 

In all cases, line drawings of the symbols were pre- 
sented to the Ss, and they were asked either to desig- 
nate the symbol drawn as male or female or to de- 
Scribe it using the 12 rating scales. All Ss first rated 
the symbols on the semantic differential and then 
made the denotative designations. 

Of the 24 symbols used in the present study, 23 
were obtained from Barker's study (1957), using the 
designations based on psychologists’ judgments of 
Freudian and cultural meanings. One symbol, tobacco 
Jar, was added for the present study and was desig- 
nated as female according to Freudian theory and 
male according to cultural theories. 

The 24 symbols were divided into four groups, six 
symbols in cach group. Group 1 consisted of six 
symbols with a clear Freudian meaning but without 
an agreed upon cultural designation. These were, ac- 
cording to Freudian theory, three female symbols— 
snail, suitcase, apple—and three male symbols—fish, 
ice cream cone, 3, Group 2 consisted of six symbols 
about which the Freudian and cultural theories 
agreed. These were three male symbols—hammer, 
airplane, gun—and_ three female symbols—stove, 
vase, hatbox. Group 3 consisted of 


; three symbols 
whose symbolic sexual meaning according to Freud- 
ian theory was male, but whose cultural associa- 
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tions were female. These were: needle, broom, lip- 
stick. The other three symbols in Group 3 were 
those whose symbolic sexual meaning according to 
Freudian theory was female, but whose cultural 
associations were male. These were: tobacco jar; 
basketball net, drum. Thus, Group 3 comprised sym- 
bols for which the Freudian and cultural theories 
Were opposed in designation of symbolic sexual 
meanings. Group 4 consisted of six symbols about 
which there was neither a clear-cut Freudian nor 


cultural meaning. These were: leaf, table, 9, chair, 
book, glasses. 


RESULTS 


Two sets of data were obtained: the deno- 
tative designations of either male or female 
for each symbol and the connotative responses 
inferred from the semantic differential ratings- 
For each symbol, the binomial probability was 
computed using the normal curve approxima- 
tion. Using a two-tailed test, at least 31.4 of 
the choices must be either male or female to 
achieve significance at the .05 level. The data 


and results of the analysis are summarized in 
Tables 1 and 2, 


Denotative Designation 


Table 1 summarizes the results concerning 
the denotative designations of the 24 symbols. 
Group 1 contained those symbols for which 
there was a Freudian designation but no clear- 
cut cultural designation of sexual meaning. 
As indicated in Table 1, no consistent pat- 
tern of results emerged from these data. For 
the three symbols designated in Freudian 
theory as feminine, snail and suitcase were 
chosen as masculine, and apple was chosen as 
feminine. For the three symbols designated as 
masculine in Freudian theory, the distributio” 
for 3 did not differ significantly from chance, 
Ice cream cone was feminine, and fish mas- 
culine, 

For Group 2, which comprised those sy™ 
bols for which Freudian and cultural theories 
were in agreement, the results were in acco” } 
with theoretical expectation. Hammer, a" 
plane, and gun were designated as masculine 
and stove, vase, and hatbox were feminint 
In Group 3, which contained the symbo 
about which Freudian and cultural theorie? 
were in Opposition, the results clearly RE 
ported the cultural theories. Needle, gee 
and lipstick were designated as feminine, A 
tobacco jar, basketball net, and drum W 


Sees 


Connotative Meaning of Sexual Symbols 


TABLE 1 


DENOTATIVE DESIGNATIONS 
Number of Subjects 


Male Ss Female Ss Total 
Symbol Masc. Fem. Masc. Fem. Mase. Fem. $° 
Group 1 
Snail w 7 2 7 35 M 05 
Suitcase 11 6 20 12 31 18 ns 
Apple 10 7 g Bs 19 A P 
k 12 5 18 4 30 19 ns 
lce cream cone 12 5 1 31 13 36 05 
Fish 13 4 237 5 40 9 0 
Group 2 
Hammer 49 o .05 
; 17 0 32 0 
Airplane 17 0 30 2 47 2 05 
gun 17 0 32 0 49 0.05 
rove o 17 2 30 2 #4% 0 
Vase 7 10 10 22 17 32 05 
Hatbox o 17 0 32 0 49 05 
Group 3 
Needle 15 5 27 7 R2 05 
room 2 R 4 ot 2u 8 
Lipstick 3 4 1 3 ç 4 4% 0 
Obacco jar 082i 4 0 05 
Basketball net 16 1 32 0 48 9 %8 
Drum b o3z o 9 0o 
Group 4 
Leaf 7 23 26 ns 
a § 9 15 17 B 2 
Table ee ee ee es s 
2 0s ig í 30 9 m 
Chair 10 7 18 14 28 21 ns 
ook 3 s 9 3an x 
Glasses + 430 B u 2227 ns 
* Probabilities computed on basis of total distribution. 


Masculine, These designations, of course, are 
M accord with the cultural associations of the 
Symbols rather than the form of the object. 
n Group 4, for which there was neither a 
Freudian nor cultural designation, the dis- 
tribution of masculine and feminine designa- 
tions did not differ from chance for any of 

© symbols. 2 

These data are in complete agreement with 

© results reported by Barker (1957). In the 
“Notative designation of the sexual meaning 
°! symbols, the designation clearly is deter- 
„Uned by the cultural association of the ob- 
ect rather than by its form. 


Connotative Designation 
Table 2 summarizes the data obtained from 
© inferred designations based on the se- 
ee differential descriptions. Group figons 
“Ned those symbols for which there was a 
Teudian but no culturally relevant sexua 
aning, ‘The distribution for only one sym- 
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bol attained significance, ice cream cone as 
feminine, which contradicted Freudian theory. 

In Group 2, the three symbols, hammer, 
airplane, and gun, which both Freudian and 
cultural theories designate as masculine were 
indeed described as masculine by the Ss in 
this study. However, the symbols theoretically 
designated as feminine revealed contradictory 
results: stove was described as masculine, hat- 
box as feminine, and the distribution for vase 
did not differ significantly from chance. 

In Group 3, the distributions for needle and 
broom did not differ significantly from chance, 
while lipstick was described as feminine, in 
contradiction to Freudian theory. Tobacco jar 
was described as feminine, which supports the 
Freudian position, but both basketball net 
and drum were described as masculine, which 
supports cultural theories. Thus, for those 
symbols about which Freudian and cultural 
theories are in opposition, no clear-cut pat- 
tern emerged. 

As would be expected on the basis of both 
Freudian and cultural points of view, the 
symbols in Group 4, which had no clear theo- 
retical designation, showed no consistent pat- 
tern of results. Leaf was described as femi- 
nine, and table and chair as masculine, but 
the distributions for 9, book, and glasses did 
not differ significantly from chance. 

In summary, the connotative designations 
obtained on the basis of semantic differential 
descriptions did not reveal as clear a pattern 
of results as did the direct designations. How- 
ever, the data tend to contradict rather than 
support the Freudian position. Of the six 
symbols about which there was theoretical op- 
position, the results for three symbols sup- 
ported the cultural point of view, while re- 
sults for only one symbol were in the direc- 
tion predicted by the Freudian position. When 
both Freudian and cultural theories were in 
agreement that the symbols were masculine, 
as in Group 2, the results were in substantial 
agreement with both theories. But when there 
was a Freudian designation with no particu- 
lar cultural designation, the only distribution 
which differed significantly from chance was 
in the direction opposite to that predicted by 


Freudian theory. 
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TABLE 2 
CONNOTATIVE DESIGNATIONS 
Number of Subjects 
Male Ss Female Ss Total Ss 
Symbol Masc. Fem. Masc. Fem. Masc. Fem. p 
Group 1 
i 10,5» 6.5 17 15 27.5 21.5 ns 
Se 12 5 18.5 13.5 30.5 18.5 us 
a le 5.5 11.5 17.5 14,5 23 26 ns 
3 E 15 9.5 12 20 19.5 29.5 w 
LS 15.5 10 22 11.5 375 
= eq 7 10 12 20 19 30 ns 
Group 2 p 
13 4 28.5 3.5 41.5 75 05 
eeu 15 2 26.5 5.5 41.5 7S 05 
Gun 15 2 20 12 35 14 05 
Stove 15 2 23 9 38 11 05 
Vase 9.5 7.5 15 17 24.5 24.5 ns 
Hatbox 3.5 13.5 12 20 15.5 33.5 05 
Group 3 
Needl 4 13 15 17 19 30 ns 
coi 9 8 11.5 20.5 20.5 28.5 ns 
Lipstick 6 11 10 22 16 33 05 
Tobacco jar 5 12 10 22 15 34 05 
Basketball net 15 2 18 2 14 33 16 05 
Drum 9 8 22.5 9.5 31.5 17.5 05 
Group 4 
ees 2 15 11.5 20.5 13.5 35.5 05 
Table 12 15 20.5 11.5 32 05 
5 m 2.5 16.5 
9 8.5 18.5 22.5 9.5 31 18 wa 
Chair 11 1S 21 11 | 33 16 05 
Book 11.5 15.5 13 19 24.5 24.5 ns 
Glasses 7 10 16 16 23 A Aa 


a Probabilities computed on basis of totals, 
b In cases of ties, 0.5 was credited to each designation, 


Discusston 


The results of this study do not support the 
Freudian theory of the meaning of symbols, 
Where culture was not a factor, there was no 
clear-cut definition of sexual meaning; where 
culture was a factor, it was, in general, cul- 
ture rather than form which determined the 
sexual meaning of the object. This was true 
for both the denotative and connotative desig- 
nations of sexual meaning. 

Freud’s (1953b) own clinical use of sym- 
bols was characterized by interpretation based 
upon a combination of the patient’s individual 


associations to objects in dreams and the ap- 


plication of the theory tested here. It is cer 
that, if cultural factors determine sexual me? 
ing, universality of meaning can exist only al 
the extent that cultural elements are sit™ 
or identical for large numbers of people- 
understand sexual symbols for a sabe 
tient, therefore, individual associations nd 
e made more meaningful by an underste g- 
ng of the relationship of the symbol wud 
tole expectations in his particular cul ree" 
background. This view is in essential = 5 
ment with Fromm’s (1951) statements 
cerning dream interpretation. c 
The conformity of sexual meaning t° 


ir 


ul 


tural expectation was clearer on the direct, or 
denotative, designations than on the inferred, 
or connotative, designations. This may be due, 
ìn part, to the relative lower reliability of 
Scores derived by this method, or it may be 
due to the complexity of factors contributing 
to connotative responses. Moreover, the cul- 
tural sexual designation presumably was based 
on associations related to sex role; to the de- 
Sree that the sex-role associations may have 
varied in clarity for the stimulus objects, vari- 
ability in designation might be expected. In 
this regard, it is interesting that culturally fe- 
male objects are described as such somewhat 
ess consistently and strongly than culturally 
male symbols are designated as male. 
he clearest inferred designation of sexual 
Meaning occurred when form and culture co- 
incided on masculine objects. This is related, 
part, to the fact that masculine items were 
\dentified more consistently than feminine 
‘tems. But, unless this is a chance result, this 
'S the only indication in our findings that 
orm plays any role in sexual meaning. It is 
Possible ‘that a complex function operates, 
Wherein the Freudian interpretation, as a 
Part of modern culture, may operate to an 
nknown degree under certain unspecifiable 
“onditions, Where form was the only deter- 
poat, sexual meaning was indeterminate. 
the studies which have supported the 
řeudian view, the stimuli were abstract 
ctms rather than familiar objects, thus hav- 
SN no specific cultural meaning. It eee 
© that, under these conditions, Ss utilize 
atever associations were available to them, 
d that either similarity to sex O" 
Ch or the knowledge that such a theory 
“ists determined the responses, at Teast un 
a direct testing. This is of interest, of 
tse, as is the question of what other con- 
op CPS might lead to similar determinations 
ole meaning; on the other hand, pei 
A S or responses to projective tests @ 
mg interpreted, the symbols involved are 


ee, ee sy, aE___~_—S—“‘“‘COCOCOO OS S,:,SLULhUhUhUGsGUrl OO SS 
EE ee 
= ee 


eee 
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almost always objects rather than geometric 
forms. 

Inspection of our results reveals that men 
and women did not differ in their connotative 
reactions to the objects. Moreover, the repeti- 
tion of Barker’s method (1957) with young 
adults resulted in a replication of the results 
she obtained with children. It is, therefore, 
not likely that age is an important factor. 


SUMMARY 


The Freudian hypothesis that pointed or 
elongated objects are male symbols, while 
round, containing objects are female symbols 
was tested using both denotative and conno- 
tative responses. For both types of designa- 
tions, the results did not support the Freud- 
ian theory. In general, culture rather than 
form determined the sexual meaning of the 
objects. These results were discussed in re- 
gard to clinical interpretation of the meaning 
of symbols and in relation to previous re- 


search. 
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COMMUNICATED ANXIETY IN A TWO-PERSON 
SITUATION? 


PATRIK O. MATTSSON 
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The interpersonal basis of anxiety has long 
been recognized in psychological theory. 
Among others, Sullivan (1953), Fromm 
(1941), and Horney (1937) have proposed 
theories based upon interpersonal relations. 

Sullivan (1953) described the process un- 
derlying the interpersonal communication of 
anxiety in terms of “empathy” (p. 41). Sul- 
livan’s (1947) writing indicates that “eu- 
phoria” or the relief of anxiety can similarly 
be conveyed through empathy. While he con- 
sidered empathy to be most applicable to de- 
scribe the arousal of anxiety in the infant by 
the mother, he also assumed that this mode 
of emotional communication can last through- 
out life. Although he pointed out that inter- 
personal communication of anxiety is gener- 
ally recognized in psychological practice, Sul- 
livan did not offer further interpretation of 
the nature of empathy. 

While previous experimental studies have 
not directly investigated interpersonal com- 
munication of anxiety between two physically 
present persons, variables relevant to anx- 
iety in an interpersonal situation have been 
considered. Reiser, Reeves, and Armington 
(1955) demonstrated that release of emo- 
tional tension through verbalization depended 
on the social relationship of the persons in- 
teracting. Seidman, Bensen, Miller, and Mee- 
land (1957) found that subjects (Ss) had 
greater tolerance for self-administered elec- 


tric shock when together with another person 


1 The present paper is part of a dissertation pre- 
sented in partial fulfillment of the requirement for 
the degree of Doctor of Philosophy at Teachers Col- 
lege, Columbia University, 1959, The writer wishes 
to express his thanks to Joel R, Davitz (doctoral 
committee chairman), Kenneth F. Herrold, Rosalea 
A. Schonbar, and Laurance F. Shaffer for their in- 
valuable advice and criticism. 


who they thought shared the shock. Ruesch 
and Prestwood (1949) found that listenné 
to voice and speech recordings of anxious p% 
tients in psychotherapy induced anxiety ™ 
the listeners. 

A number of studies have approached ia 
problem of communicated emotions throug? 
animal experimentation. Masserman (1943): 
Liddell (1950), Davitz and Mason (1955)? 
and Lirzman (1955) demonstrated that the 
presence of a nonfearful animal had a calm 
ing effect on a fearful one. 


The purpose of this study was an expel’ 


mental investigation of increase and reduc” 
tion of anxiety in an interpersonal setting: 
The following hypotheses were tested: A 

1. A person who interacts with a more a0" 
ious person will himself become more anxious: 

2. A person who interacts with a less a™ 
ious person will himself become less anxious 


Drsicn 


N The experimental procedure involved the Dite ni 
tion of pairs of Ss in a cooperative task situati - 
The setting in which interaction took place Na 
signed to Promote maximum opportunity for Ti 
munication, both verbal and nonverbal, Anxiety ock: 
experimentally induced by means of electric SP 
Changes in Ss’ anxiety were measured 
Physiological and psychological manifestations. 


ace 


Subjects 


Je 
The Ss were g0 undergraduate and graduate as 
students at Columbia University. Median 88° he 
20, and the interquartile range was 19 to 2 “other 
Ss who worked together did not know each esti 
previously, In order to obtain at least a onp: for™ 
mate of the S’s chronic level of anxiety, a shor, as 
of the Taylor Manifest Anxiety Scale ( 3 com 
administered (Bendig, 1956). The groups of es We 
Pared for the purpose of testing the hypothes r4 30) 
not significantly different from each other 4 
in terms of anxiety as measured by this scale 
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throug” 
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il S were randomly assigned to one of the fol- 

Ür 5 nds of experimental situations, each employ- 

NS (wS Se nonshocked S working with shocked S, 

Stu, shocked S working with nonshocked S, 

shack Ii nonshocked S working with another non- 

oth ed S, NS(wNS); shocked S working with an- 
er shocked S, S(wS). 


“perimental Induction of Anxiety 


aot current electric shock is a harmless but rather 
dividual means of raising the anxiety level of an in- 
an te (Ericksen & Wechsler, 1955). In this study 
dace shock was generated by transformer 
volt enna) supplied by a source of 3-, 6-, or 12- 
tric Prat current, The intensity of the output (elec- 
350 y Si could be varied continuously from 50 to 
Were gee Electrodes, to be attached to Ss’ wrists, 
sons esigned so that one person alone or two per- 
Were Simultaneously could get shock. Stimulations 
Petimens off by a silent switch, operated by the ex- 
Dede (E), and were given in a random but 
lensit ermined and fixed order for each S. The in- 
ance of shock was set according to each S’s toler- 
or such shock. 

he establishment of the tolerance level for shock 


y . . 
vas done by giving the shocked Ss electric stimula- 


4 +, . 
fhe of low voltage, and then gradually increasing 
tiny in their own 


oltage to i he Ss, 
S| . a point where the 9S, 
baat considered the shock to be “yncom- 
eal The level of voltage used during the actual 
spetiment was twice that which the Ss had re- 


ond 
led to as uncomfortable. 


M 
Casures of Anxiety 


Pa Study evaluated change in anxiety through 
Wa, Omitant tension changes. Physiological tension 
P lic blood pressure 


(pp sured by changes in systolic boo Pre” 
fae and. by changes “ia finger perspiration. a 
a tension was measured by changes 1M respo! 

self-rating scale. 

the ie measured with a sphygmoma 
gased eoan millimeter ee 
a pressure was seen 4 
bigcased lenie (anxiety). Each of the menim a 
Upon Pressure used in the analysis of data was ae 
asion © Mean of two estimates obtained pee 
&s hates Measurement. Based upon 25 of su 

(Sp on selected at random, a relia’ 4c 
btained rank correlation coefficien 


anometer with 
column. In- 
indication of 
s of 


bility coefficient 
“of .99 was 


vig Peer Perspiration was measured by a colorimet- 
Li ethod similar to the one describe by = 
ment gia and Zeleny (1953). At each — 
Wa = Palmar surface of one finger On eac oan 
Raton ited with a solution of ferric chlori ia 
Were © and allowed to dry, The treated fingers i“ 
hi Pressed for three minutes upon drawing P E 
The clier had been impregnated with tannic aci- 


© chemi er and the 
Moist, icals on the finger and se produce ‘i 


dark fingerprint. The more perspiration was present 
the darker was the print. The darkness of the print 
was measured by a photoelectric device (densitome- 
ter). The measure thus yielded was the Finger Per- 
spiration Index (FPI), which could take a value 
from O to 40. A decreased score meant increased 
finger perspiration, i.e., increased tension. The mean 
value of two fingerprints obtained at each measure- 
ment was taken as the FPI for the S at the time of 
the measurement. Based upon 25 of these paired 
estimates, a reliability coefficient (Spearman) of .90 
was obtained. 

Psychological manifestations of anxiety were meas- 
ured by a self-rating scale, specifically designed to 
reflect situational tension changes during the experi- 
mental procedure. The scale consisted of six sub- 
scales, each containing common verbal expressions 
of emotional tension, arranged on a continuum from 
one extreme to the other along a 150-millimeter long 
six-point scale. The six scales included: uneasy-at 
ease, calm-restless, undisturbed-disturbed, comfort- 
able-uncomfortable, upset-peaceful, and relaxed- 
tense. 

The scales were presented to the Ss in the form of 
a booklet with one scale per sheet. At repeated ad- 
ministrations the same scales were presented to the 
Ss again, but arranged in a different order. When 
scoring the questionnaire, the distance in millimeters 
from the “low tension” end of each scale to the place 
where the S had marked the scale with a checkmark 
was measured. An S’s rating scale score (R) was the 
mean distance from the low tension end to the check- 
mark for the six scales. The R could yield a value 
from 0 to 150. An increase in score was seen as an 
indication of increased tension (anxiety), The com- 
posite score for three of the scales was correlated 
with the score for the three remaining scales, yield- 
ing a measure of internal consistency. Based upon a 
random sample of 25 rating scales, a reliability co- 
efficient (Spearman) of .88 was obtained. 


Task 

As a means of facilitating, as well as controlling, 
the interaction of the Ss within each pair, a simple 
work task was given. The task material was two dif- 
ferent picture puzzles, each containing 27 pieces, hav- 
ing a finished size of 10X 12 inches (Task I and 
Task II). 

The rules for putting the puzzles together were 
formulated so that each member of the cooperating 
pair was dependent on the other member for any 
and all progress in the task. The purpose was to 
make the partners “significant” in each others’ eyes, 
at least as far as working with the task was con- 
cerned. The rules also contained specifications that 
required the members to look at each other, talk 
and listen to each other, maximizing the opportunity 
for verbal and nonverbal communication. 

For each pair, as a unit, 2 measure of work was 
tained by counting how many pieces had been 


ob 
ctly before the task was discontinued, A 


placed corre 
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i for Task I 
ation (Spearman) between scores 
and Task II, based upon 30 pairs, selected at ran- 
dom, was 34 (p< .05). 


Procedure 


At the enlistment for the experiment, each S was 
told that he was to participate in a study on inter- 
personal relations. During the session the Ss were 
seated at a table facing each other. The E was seated 
at one end of the same table. 

Because the E was present with the Ss, it may ap- 
pear that the experiment was dealing with a three- 
person relationship rather than with a two-person 
interaction. However, the factor of the E was con- 
stant, and even though a shocked S might see the E 
in a different role than a nonshocked S, the experi- 
mental design and measurements were such that the 
E’s presence would only influence the results mini- 
mally, 

The pairs of Ss were divided into three groups, 
for which the procedure was somewhat different: 
NS(wS), S(wS), and NS(wNS). The procedure for 
the pairs of Ss in which one member received shock 
but the other member did not—S(wNS)—was as 
follows: 


Step 1. Measurement of finger perspiration and 
blood pressure. Administration of self-rating scale 
(“Before” measurement). 

Step 2. Electrodes attached to shocked S. Shock 

tolerance determined. 
Step 3. Cooperative work with puzzle (Task I). 
lectric shock given to shocked S. Time: 5 minutes, 
Step 4. Measurements repeated (“Middle” meas- 
urement). 

Step 5. Ss work with Task II, the other puzzle. 
Electric shock given to shocked 5, Time: 5 minutes, 


Step 6. Measurements repeated (“After” measure- 
ment). 


Step 7. Electrodes 
scales regarding the Ss’ 
the short form of the 


E] 


removed. Additional rating 


evaluation of each other and 
MAS given. 


generally 
as evaluated dur- 
the experiment, 

where each member 
the above procedure, 
h electric shock were 
S where neither § re- 
to electric shock were 


In summary: The three occasi 


ons during the ex- 
periment when the three measu: 


res of tension were 
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administered are referred to as the Before, Middle, 
and After measurements. The time elapsed between 
the Before and Middle measurements was appa 
mately 16 minutes for pairs where no shock was ee 
ministered, and 20 minutes where both Ss got shoc a 
The time between the Before and After measure: 
ments was twice as long. The measures of tee 
were: BP, FPI, and response to a rating scale. T 
measure of chronic anxiety level was a short ee 
of the MAS. Additional measures, in me ae a 
two graphic rating scales, expressed the Ss’ evalu: 
tion of each other during the experiment. 


Treatment of Data 


The analyses of the results were based upon a 
difference between the scores at the Before and Mid- 
dle measures as well as the difference between scores 
at the Before and After measures. The difference 
Scores represented the Ss’ changes in tension during 
the time interval between the occasions of measure- 
ment. í 

Each group of Ss provided two distributions 3 
difference scores for each measure of anxiety. T 
hypotheses were tested by comparing the distribu- 
tion of difference scores for appropriate groups of 58 
The .05 level of significance was used to determine 
the acceptance or rejection of the hypotheses tested. 
Because no assumptions were made regarding tg 
shapes of the populations from which the samples 
were drawn, nonparametric tests were used through- 
out the study. The hypotheses were tested by the 
Mann-Whitney U test, One-tailed tests of signifi- 
cance were used to test the hypotheses, while tw” 


tailed tests were used for additional analyses or test- 
ing relationships where the i 


directions of change, 
any, were not anticipated, 


TABLE 1 


DIFFERENCE SCORES AND VALUES or U FOR 
THE FINGER PERSPIRATION INDEX 


Groups Compared 
a EE Caen 


P Nonshocked Nonshocked 
Time (with (with Value 
Period shocked) nonshocked) ot 
Before to 
Midde Q,=-~s5. Q, =—1.7 
M@=-12 Mi= 02 194.5 
Q= 23 Q= 24 
Before to 
After =— 3.1 
a= 03 965 
Q= 3.4 
es — a BA 
Petey Ot Md, and Os refer to the First Quartile, Media" 


Third Quartile of the distribution of difference scores, pease 
inger CCBative difference score for FPI indicate jt 
finger sweat. j=l 


r sw E 
* Significance at the .05 level =138, and at the -01 lev 
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TABLE 2 


DIFFERENCE Scores AND VALUES OF U FOR 
THE BLOOD PRESSURE 
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TABLE 4 


DIFFERENCE SCORES AND VALUES OF U FOR 
THE FINGER PERSPIRATION INDEX 


Groups Compared 


Groups Compared 


A Nonshocked Nonshocked Shocked Shocked 
Time (with (with Value Time (with (with Value 
Period shocked) | nonshocked) of U* Period shocked) nonshocked) of U* 
Before to Before to 
Middle Q=-8 Middle Qı =— 5.2 1 =— 5.2 
Md= 3 135.5 Md=—-13 Md=— 1.8 182 
Q= 6 @= 35 Q= 06 ` 
Before to Before to 
After Q,=-9 After Q=-49 Q=-9.5 
Md=-2 131 Md= 0 Md=—5.5 152.5 
Q= 4 Q= 21 Q= 0 
mm _ 


oO 
Significance at the 05 level =138, and at the .O1 level =114- 


RESULTS 

Hypothesis 1 

Rephrased in operational terms, 
ste thesis stated that in a coopera 
Lae nonshocked Ss who wor 
4 Ocked Ss show greater anxiety thar 

Ocked Ss who work with other nonshocked 
aa Tables 1, 2, and 3 show the comparison 

the two groups of nonshocked Ss: NS(wS) 
and NS(wNS). 
m he hypothesis was tested twice with each 
t easure of anxiety. As indicated in Table 1, 
t € hypothesis was supported for the Before 
© After FPIs. Table 2 indicates support of 


the first 
tive task 
k with 
n non- 


TABLE 3 


DIFFERENCE Scores AND VALUE 
THe RatING SCALE SCORE 


s oF U FOR 


* Significance at the .05 level =138, and at the .01 level =114, 


the hypothesis for both the Before to Middle 
and Before to After BPs. Table 3, dealing 
with the result of the rating scale, indicates 
support for the hypothesis for the Before to 


Middle measure. 


Hypothesis 2 

Rephrased in operational terms, the second 
hypothesis stated that in a cooperative task 
situation shocked Ss who work with other 
shocked Ss show greater anxiety than shocked 
Ss who work with nonshocked Ss. 

To test the second hypothesis, the two 
groups of shocked Ss were compared: S(wS) 


TABLE 5 


DIFFERENCE SCORES AND VALUES OF U FOR 
THE BLOOD PRESSURE 


Groups Compared 


Groups Compared 


ai 


2 Shocked Shocked 
a, See Sal Value Time (with (with Value 
Period fa nonshocked) of U* Period shocked) | nonshocked) of U’ 
shocke 
B Before to 
Sty Sa Mide Q==7 Q ==2 
£ e == 5 a == Ts Mi= 0 Mi= 3 183 
a = 8 Q= gas Oe y 
a= a 
Before to Before to 
After Qi=-6 
Afte eo = m 
E oa et 1 =n 
Md =— 1 Ss 
Q: = 1 Q: = 8 
¥ Significance at the .05 level =138, and at the ‚01 level =114, 


3 = 
Signi vel =114. 
ignificance at the .05 level =138, and at the 01 level 


492 


TABLE 6 


DIFFERENCE SCORES AND VALUES oF U FOR 
THE RATING SCALE SCORE 


Groups Compared 
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Shocked Shocked 
Time (with (with Value 
Period shocked) nonshocked) of U* 
Before to 
Middle Q= 9 Qı = 14 
Md = 41 Md = 30 180 
Q: = 51 Qi = 43 
Before to 
After Qi = 21 Qi = 23 
Md = 45 Md = 39 191.5 


Q; = 62 Q: = 64 


* Significance at the .05 level =138, and at the .01 level =114, 


and S(wNS). Tables 4, 5, and 6 present sum- 
maries of the analysis of data relevant to this 
hypothesis. As indicated in these tables, the 
result did not support the second hypothesis. 


Additional Analyses 


The additional analyses concerned interre- 
lationships among the three measures of ten- 
sion, the relationship of shock to task per- 
formance, the relationship of shock tolerance 


to measures of tension, and Ss? evaluations of 
their partners, 


Interrelationships among measures of tetti 
sion. Table 7 contains the intercorrelations , 
(Spearman’s rho) of the three measures of < 
tension (anxiety). 

As indicated in Table 7, there was no sub- 
stantial correlation between the several ex- 
pressions of anxiety. It is conceivable that the 
physiological responses changed in opposite 
directions for different Ss. Similarly, some Som 
may have responded psychologically to an*- 
iety by denying it, i.e., rated themselves a 
less anxious on the questionnaire. An analysis 
indicated that the NS(wS) group had a sig- 
nificantly larger variability for the BP scores 
than the NS(wNS), less anxious, group. 

Electric shock and task performance. There 
was no significant difference between the task 
performances of pairs where shock was 44 
ministered—(SwS)—and of pairs where 1° 
shock was given—(NSwNS). 

Shock tolerance and anxiety measures. Sta- 
tistical analysis revealed that shock tolerance 
was independent of all measures of anxiety 
used in this study. 

Subjects’ evaluations of each other. The 
groups of Ss, compared for the purpose ° 
testing the hypotheses, were not significantly 
different in their evaluation of their likes °" 
dislikes for their partners. However, 20% 
shocked Ss felt that nonshocked partners ©” 
operated better than did shocked partners. 


TABLE 7 
INTERCORRELATION AMONG TENSION MEASURES 
S — o i ee 
7 Non- 
Bia i shocked Shocked 
o shockec (with Shock with 
et a , (with Non- Gi ee 
orrelate: ime Period Shocked shocked Shocked shocked 
Finger Perspiration 
and Blood Pressure Before to Middle 1 
14 
Before to After —.23 Pl S a 
: —.16 ‘ 
Finger Perspiration 
and Rating Scale Before to Middle 36 30* 37 
Before to After 26 A ea 35 
Blood Pressure 
and Rating Scale Before to Middle 5 4 
Before to After a = 9 + 


* Significance at the .05 level =.38, and at the 01 ives. 
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DISCUSSION 


P The present study demonstrated that anx- 
lety in one person in an interpersonal rela- 
tionship is communicated and results in in- 
creased anxiety in another person in the situa- 
S However, the results failed to support 
with Ypothesis that the presence of a person 

relatively lower anxiety reduces the level 
°! anxiety in the more anxious person. 


Ci — . 
ommunication of Anxiety 


0 A may be assumed that in the past vari- 
“8 peripheral cues have been associated with 
hc tension or anxiety. Cardiovascular 
Comitants of anxiety are manifested by 
op hing, blushing, fainting, and throbbing 
is oe close to the skin surface. Sweating 
anifested by moisture and a characteristic 
tay” Motor disruptions related to anxiety 
me be cued off by tremors, rapid shallow 
Spe thing, forced motions, restlessness, anc 
aa disturbances, Psychological concoml- 
Driat of anxiety may be evidenced by went 
ings 3 behavior or direct verbalization 0 al 
With A discomfort. As these cues are asi 
beco €elings of anxiety or tension, they oe 
anions Conditioned stimuli for the ase oi 
feelin Y. While the association of signs ane 
Sop 88 Of anxiety primarily involves @ P, 3 
5 Perception of himself, such associatio 


Ma i 
TA 8eneralized so that perception of ree 

~ & also elicits anxiety: 
Thu her persons als ae 


incre, Perception of another anxious en 
brevia anxiety in the observer by vie e 

itioni 2 i a 
Cug, YS Conditioning of observable peri} 


© anxiety responses. 


Ci 


zed Ss 
disp the present experiment, the gote 
Were SEd peripheral cues of oe ac- 
Coy o Served by the nonshocked ae? fo 
Shock of the increased tension, w e d 
Dro S, in terms of condi he phenome 


Yon co One basis for explaining the 
empathy, 


Om, 
Muni, x 
"cation of Comfort 


* Werieration of the nature of cor By 
the sp. 2S Possible responses to such cues for 
en Ss may provide an a aes 
Decr, of Support for the second HYP ps af 
oy, ted tension or increased fee ap 
ate associated with such cues 35 


mfort cues 
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minished blanching or blushing, drying up 
of sweat, increased muscular control, calmer 
speech, and more normal breathing pattern. 
In the present study, the nonshocked Ss were 
to provide the cues of comfort for the shocked 
Ss. As indicated by the results supporting the 
first hypothesis, while the nonshocked Ss work- 
ing with the shocked Ss may initially have dis- 
played less anxiety, they also became more 
anxious during the experimental sessions. But 
the fact that nonshocked Ss who worked with 
shocked Ss became more anxious would in it- 
self not have precluded the possibility of sup- 
porting the second hypothesis. While these Ss 
did become more tense, they did not, in gen- 
eral, become as tense as the shocked Ss. There- 
fore, there was at least a relative difference in 
the anxiety levels of these two kinds of Ss. In 
spite of this, the nonshocked Ss had no “ten- 
sion reducing” effects on their shocked part- 
ners. It is, of course, possible that the rela- 
tively lesser anxiety of the nonshocked Ss was 
less important than the fact that they ex- 
hibited some cues of anxiety or anxiety in- 
crease. The magnitude of an anxiety cue may 
be of less significance than the mere presence 
or absence of such a cue, however minute. 
Certain factors tended to maintain the ten- 
sion of the shocked Ss, in spite of available 
cues of comfort. The experimental procedure 
of anxiety induction through electric shock 
was continued throughout the interaction of 
the Ss. Many of the shocked Ss expressed the 
conviction that the discomfort of the shock 
became increasingly difficult to endure. While 
a shocked S thus may have clearly perceived 
the cues of comfort from his nonshocked 
partner, increased situational stress may have 
prevented him from feeling any anxiety relief. 
Response to stress was most often verbal- 
ized by the shocked Ss as extreme discomfort 
or tension. Some of the Ss did, however, ex- 
ress anger or resentment towards their non- 
shocked partners as well as towards the B. 
Because there is a good deal of overlap in the 
hysiological and psychological concomitants 
af “emotions, such anger or resentment would 
st likely be registered by the employed 
= ures as increased or maintained anxiety. 
eo othesized anxiety decrease of the shocked 


5 in the (SwNS) group was evaluated against 
‘me assumed high level of tension maintained 
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by the Ss in the (SwS) group. Such assump- 
tion may, in fact, have been erroneous, con- 
tributing to the lack of support for the second 
hypothesis. Seidman’s et al. (1957) finding 
seems to indicate alleviation of tension as a 
function of a shared situation, which, applied 
to the present study, would suggest lowered 
tension in the (SwS) group. 

In the light of the above discussion, it 
seems that the experimental design was not 
well suited to test the second hypothesis. Fur- 
thermore, the stronger the first hypothesis was 
supported, the less possibility there would be 
of supporting the second. 


Correlation of Anxiety Measures 


Although the three measures of tension 
tended to support the first hypothesis, the in- 
_ tercorrelations among the measures were low 
(Table 7). There is a substantial body of 
data in the literature which indicates that 
people differ one from the other with regard 
to their autonomic response pattern (Gell- 
horn, 1953). The present finding, suggesting 
that people, in general, respond to anxiety in 
many ways, while the individual person may 
use only one or few characteristic anxiety 
sponses, is in accord with the literature. 
Lack of correlation among the physiologi- 
cal measures may also have been a reflection 
of possible difference in effectiveness of the 
methods of recording the Physiological reac- 
tions. For example, spot checking of the blood 
pressure, as done in this study, may not have 
Suaranteed that the readings represented what 
the blood pressure was during the work pe- 
tiod. The finger perspiration, by contrast, is 
(within time limits) more of a cumulative 
and persistent indicator of autonomic change. 
Two studies, which related social back- 
ground to specific pattern of cardiovascular 
reaction to emotional stress, would demon- 
strate that characteristic autonomic functions 
can be learned by an individual (King & 
Funkenstein, 1957; King & Henry, 1955). 
However, even if learning of physiological 
patterns of responses seems Possible, the con- 
scious change of such responses, once learned 
is not likely to take place. While Physiologi- 
cal measures of tensions are thus less un- 
der conscious control, psychological measures 
such as the employed rating scales, are much 


re- 
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more easily influenced by intentional varia- 
tions. Lack of correlation between the physio- 
logical and psychological measures of anxiety 
may thus be a function of changes in such 
factors as apprehension, unfamiliarity, etc. 
affecting the different measuring instruments 
differently. 

Another possibility of accounting for the 
lack of correlation of anxiety measures, OF at 
least the absence of a significant correlation 
between physiological and psychological meas- 
ures, is in terms of anxious persons’ impaire 
perception of their own anxiety reactions. 
Verbal reports of anxiety level would thus not 
correspond with physiological measurements: 
Support for this suggestion may be found in 
Table 7. The NS(wNS) group, where least 
anxiety was present, yielded two of the three 


significant correlations between the anxiety 
measures. 


SUMMARY 


This study was designed to produce an 17 
terpersonal situation where communication ° 
anxiety as well as relief of anxiety (comfort 
could be demonstrated, It was hypothesize 
that (a) a person who interacts with a more 
anxious person will himself become more a 
tous, and that (b) a person who interacts with 
a less anxious person will himself become le 
anxious, 

Ss, who were male college students, welt 
randomly paired and assigned to an expe! 
mental session or to either of two control a 
Sions. All of the pairs were asked to perfor” 
a cooperative work task, a puzzle, the P" 
Pose of which was to provide an appropry i 
means of bringing the partners together 1P “i 
terpersonal interaction. In the experime? de 
Session, one § received electric shock W % 

'S Partner received no shock, The purpose x 
the shock was to introduce anxiety. The K 
tent to which this anxiety was communict! od 
was measured in three ways: changes in 


ie finger sweat, and scores on a ' 
scale. 


ating 


The first hypothesis was tested by oa 
baring the anxiety change of the none cll 
Ss in the experimental session with a” sio” 
change of nonshocked Ss in a control a ed 
where no S was shocked. The data supP° a 
the hypothesis for all three measures © 
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Oe The second hypothesis was tested by 
Omparing the shocked Ss of the experimen- 
tal group with shocked Ss in a control group 
_ Where both members received shock. The find- 
} ES here did not support the hypothesis for 
ny anxiety measure used. 
Tt appears that the experimental design had 
fort sg to show communication of com- 
of an communication of anxiety. This lack 
si Power could have been due to several con- 
eo (a) the cues of comfort, that 
Were to be communicated to the anxious S, 
ity © not strong enough; (b) the level of anx- 
mu ™ the control Ss, against which the com- 
i nicated comfort was compared, was not 
is enough; (c) the shocked Ss may not 
D! ve attended to the cues of comfort dis- 
ont by their partners; and (d) feelings of 
feelj Ort cannot be communicated as easily as 

Ings of anxiety. 

An additional finding was that the meas- 
a anxiety did not correlate with each 
ine for the groups of anxious Ss. Where 
tro "nal anxiety was present, ie, in the a 

Cay Stoup where no shock was given, signifi- 
Nt correlations were, however, evident. 
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PSYCHOLOGICAL TEST COVARIATES OF CONCEPTUAL 
DEFICIT IN SCHIZOPHRENIA 


WILLIAM W. LOTHROP 1 


Veterans Administration Hospital, Northampton, Massachusetts 


The conceptual ability of the schizophrenic 
has been the object of a great deal of clinical 
interest and research in recent years. Many 
studies, using a number of different tech- 
niques, have demonstrated that schizophrenics 
usually perform less well than normals or 
neurotics on conceptual tasks. As yet, how- 
ever, there have been very few systematic at- 
tempts to investigate factors which might be 
related to this deficit. Different studies have 
described some of these factors, such as in- 
appropriate and/or flattened affect, disturbed 
intellectual functioning, poor prognosis, chro- 
nicity, and hebephrenic diagnosis (Kasanin 
& Hanfmann, 1938; Meadow & Funkenstein, 
1952; Scherer, 1951; Stotsky & Lawrence, 
1955), but replication of these findings is 
generally lacking, and many other factors 
must remain to be described. The present 
study represents a preliminary attempt to in- 
vestigate some of these factors, with the spe- 
cific purpose of determining whether concep- 

tual deficit in schizophrenia covaries consist- 
ently with performance on Psychological tests 
and with certain clinical variables, in the hope 
of generating hypotheses for future research. 


METHOD 
The subjects (Ss) used in this study were all male 
veterans from the VA Hospital in 


diagnosis of schizophrenic 
al or neurological 
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Isidore W. Scherer, Arnold Trehub, and Henry Op- 
penheim of the hospital Psychology Service for their 
critical evaluations of this manuscript, and to Jo- 
seph M. Cuddy and David J. Moriarity, Clinical 
Psychology trainees, for their help in analyzing part 
of the data. 
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involvement were excluded. A total of 64 patients 
was selected alphabetically from the Clinical Psy- 
chology files who had been given the following tests? 
Wechsler-Bellevue I, Rorschach, Object Sorting (aC 
tive part), Sacks Sentence Completion (Form, a) 
Word Association, Bender-Gestalt, Human piena 
Drawings, Babcock Memory Paragraph A, Beni 
Visual Retention Test (1946 form), the Associa 
Learning test from the Wechsler Memory Scale, 2” 
the Paired-Associates from the Hunt-Minnesota ae 
for Organic Brain Damage. The scoring of all et 
was rechecked by the experimenter (E). All patien 
had been tested within 6 weeks of their admissio® 
and approximately 90% were first admissions to thi 
hospital. Thus the chronic, deteriorated type r 
schizophrenic frequently associated with conceptu? 
deficit (Benjamin, 1944; Rabin, King, & Ehrman™ 
1955; Rapaport, Gill, & Schafer, 1945) is generaly 
missing from this sample, and the results must 
interpreted accordingly, t 
The active part of Rapaport’s Object Sorting Te 
(1945), with slight modifications in administratio™ 
Was used as the measure of conceptual performanti 
To obtain one group of Ss showing relatively ut 
conceptual deficit and another group with mae 
deficit, an elaborate set of criteria was initially ing 
veloped. Tt was then found that the same grou 
of Ss could be obtained using only the total num rts 
of concrete responses and failures, For this repor f 
therefore, the criterion for determining amoun 


jer 
conceptual deficit was the number of concrete 7 


Naa re 
remes of the distribution We ons 
Two or fewer concrete resp 


cits 
rele taken to indicate little or no conceptual anny” 
pased on Rapaport's patrol group data. in two 


three (36%) of the patients in this sample ga 
or fewer such responses. Severe conceptua cr 

Was arbitrarily designated as five or more con) 
f-sPOnses out of the seven items. Eighteen 5s ( jned 


pat 


a FA al 
fell in this group. Two groups of 18 were O tically 
by taking the first 18 unimpaired Ss alphabe yar 


and all of 
ables w 
determi 
tained. 


the severely impaired Ss. The ae to 
cre then tested against these two 8" e ob” 
ne if significant discrimination coul 
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RESULTS 
Clinical Variables 


In this sample severe conceptual deficit was 
not related to age at admission; number of 
School grades completed; occupational level; 
religion; diagnostic category; whether the ill- 
ness was rated mild, moderate, or severe by 
the staff; or whether it was classified as 
chronic or acute. Three findings significant 
between the .10 and .05 levels of confidence 
by Fisher’s exact test may deserve replica- 
tion, however: the severely impaired Ss were 
More often married, had more often been com- 
mitted to the hospital, and had spent a greater 
amount of time in this hospital over all ad- 
Missions, 


Psychological Tests 


No relationship could be demonstrated be- 
tween severe conceptual deficit and Verbal, 
erformance, or Full Scale IQ on the Wechs- 
et-Bellevue T. Each subtest was also analyzed 
Y the chi square method, but no test reached 
Statistical significance, nor was there any Sig- 
Nificant difference in intertest scatter between 
e two groups. 
On the Associate Learning test from the 
Wechsler Memory Scale the unimpaired Ss 
ĉarned a significantly greater number of as- 
“ciations, p = .025 by the exact test. This 
ifference appeared to be primarily a func- 
ion of the hard associations, there being no 
difference between the groups in the number 
easy associations learned. Results in the 
“ame direction were obtained with the hard 
Pairs from the Hunt-Minnesota test, but these 
"esults did not attain statistical significance. 
one of the three memory tests used (Digit 
Pan, Babcock Memory Paragraph A, and the 
enton test) discriminated between the i 
thous at significant levels of confidence, pal 
cer ugh in each case the impaired group "° 
ga lower scores. Actually, both RoN 
fee a deficit on these tests relative t 
Of IQs, reducing the discriminatory powe 
the tests, 
Or the analysis of the Bender-Gestalt i 
rom the two groups were matched roug ly 
Pair Q. Their tests were then ar 
Brag. to two independent judges; r N 
uate students in psychology, W20 
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asked to decide which of each pair was the 
most deviant, without knowing the identity of 
the records. The test did not discriminate be- 
tween the two groups. The same procedure 
was followed with the human figure drawings, 
again without significant results. Similarly, 
neither the sex of the first-drawn figure nor 
the size relation between male and female fig- 
ures discriminated between the two groups. 

On the Word Association Test, a modifica- 
tion of Rapaport’s list of 60 words, there were 
no significant differences between the two 
groups in number of popular responses or in 
the number of associations correctly recalled 
during immediate readministration of the list, 
regardless of whether the words were “trau- 
matic” or “nontraumatic.” 

The unimpaired group gave a significantly 
greater number of responses to the Rorschach, 
p = .02 by the exact test. In view of the pos- 
sible qualitative and quantitative differences 
which this might produce in other scoring 
categories (Cronbach, 1949) a matched-pair 
technique was then used. Eleven pairs of Ss 
were obtained by matching to within plus or 
minus one response, and each of the remain- 
ing scoring categories was analyzed by the 
Wilcoxon matched-pair rank test. Three of 
the 17 variables tested yielded significance at 
the .05 level of confidence: M, FC’, and FC, 
with the unimpaired group giving the greater 
number of responses in each instance. The un- 
impaired group also gave a preponderance of 
M over XC which was significant at the .05 
level. In addition, there was a tendency for 
the unimpaired group to give more human and 
human detail responses (p = .10), and fewer 
animal or animal detail responses (p = -12). 
The unimpaired Ss also tended to receive 
lower scores on the Buhler-Lefever Diagnostic 
Sign List (Buhler, Buhler, & Lefever, 1948), 
indicating less pathology (p = .10). Lastly, 
genetic level scores were computed for each 
S (Becker, 1956). In 8 of the 11 pairs the 
unimpaired S received the higher score, indi- 
cating a higher level of perceptual develop- 
ment, but one of the reversals was quite large 
and significance could not be obtained by the 
Wilcoxon test. 

The Sentence Completion test failed to dis- 
criminate between the two groups in the num- 
ber of words used to complete the first 20 
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items, or in the number of items left unan- 
swered. The content of the individual re- 
sponses was naturally quite diverse, except 
for two items. In completing the phrase “To 
me the future looks . . . ,” the impaired Ss 
with but one exception described the future as 
“good” or “bright,” whereas 10 of the unim- 
paired group described the future as “black,” 
“hopeless,” “dull,” etc. (p = .01 by the exact 
test). On the item “Compared with others 
I... ,” 13 of 16 in the unimpaired group 
could be classified as considering themselves 
different from other people, usually in a nega- 
tive sense, while 7 of 11 in the impaired 
group considered themselves no different from 
other people (p = .05). The responses to the 
remaining items were too varied to yield sig- 
nificant consistency. It was subjectively felt, 
however, that the unimpaired group was more 
frank in describing environmental difficulties. 
On the four items having to do with mother, 
for example, the unimpaired Ss made twice as 
many negative or critical statements than the 
impaired Ss. The same also held true for state- 
ments about the father, Lastly, it was felt that 
the impaired group tended to deny illness, in 
that they described fewer fears, fewer inter- 
personal difficulties, etc. In the absence of a 
more objective analysis, however, these im- 
pressions must be regarded as speculative, 


Discussion 


The results of this st 
all, that abnorm 
characteristic of 


udy suggest, first of 
al concreteness may not be 
all schizophrenics, at least in 
terms of their Object Sorting performance, 
While it is true that a majority of this sam- 
ple gave more concrete responses than would 
be expected in the normal population, one- 
third demonstrated little or no conceptual 
deficit by Rapaport’s norms. This finding is 
consistent with the results of other studies 
using different measures of conceptual abil- 
ity (Hanfmann & Kasanin, 1942; Wegrocki, 
1940), and re-emphasizes the need to con- 
sider individual differences along this dimen- 
sion. 

As regards the relationship of conceptual 
deficit to performance on psychological tests, 
the results suggest that a basic intellectual 
deficit exists which Goldstein (1959) has 
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termed a “restriction in the use of the high- 
est mental capacity” (p. 147). In this sample, 
the effects of conceptual deficit became most 
apparent in unstructured situations, such E 
the Rorschach, and in tasks involving new 
learning. In both these cases, as with the Ob- 
ject Sorting Test itself, active and creative 
thinking is required. Under these conditi 
the “concrete” schizophrenics demonstrate 
greater intellectual impoverishment and cong 
striction. This deficit was not significantly 
manifest on tests involving memory and con- 
centration (both groups showed impairment), 
perceptual-motor tasks, or on verbal tasks in- 
volving relatively stereotyped or familiar ma- 
terial. 

The present data do not permit much spect- 
lation about the dynamics of conceptual defi- 
cit. The impaired group in general seems more 
deviant, and in some respects performs like 
an organic group. An alternative interpret” 
tion, however, is that conceptual deficit may 
be related to “defensiveness,” as Goldstein 
(1959) and others have suggested, manifeste? 
by a constriction of interpersonal and affec 
tive responses as well as intellectual constric- 
tion, guardedness, denial, and/or lack of F 
sight. The concrete schizophrenics tended t 


portray themselves as no different from othe 


people, were allegedly optimistic about the su 


ture, and described little stress in their env 
ronments on the Sentence Completion test- e- 
contrast with the unimpaired Ss, the sag 
ments of the concrete schizophrenics seeme” 
patently unrealistic. Perhaps this denial pi a 
ness or lack of insight, if such be the Cê E 
may relate to the tendency for these Pal 
tients to have been more often committo 
since the committed patient is often broug 
to the hospital against his wishes because 
Sees no reason to be hospitalized. In ove 
to make these speculations tenable, ee 
further research needs to be done rega" oj 


the role of Personality factors in conceP 
deficit. 


SUMMARY 


This study attempted to investigate s0™°? 
the clincia] and test covariates of cinei 
deficit in schizophrenia, using number 0 
crete responses and failures on the active 


o 

al 

rt 

of 
ri 

of the Object Sorting Test as the meast 
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deficit. Using the test records of patients who 
had been discharged with the staff diagnosis 
of schizophrenic reaction, two groups of 18 Ss 
each were selected, one showing marked con- 
ceptual deficit and one which demonstrated 
little or no deficit. Clinically, no, significant 
differences were found between the two groups 
in education, occupational level, religion, age 
at admission, or diagnostic category. The pa- 
tients with severe deficit, however, tended 
More often to be married, had more often 
been committed, and tended to have spent 
more time in the hospital overall. When com- 
pared as to performance on psychological 
tests, the results suggested that conceptual 
deficit may be associated with a basic intel- 
lectual impoverishment in unstructured situa- 
tions and in situations requiring new learning, 
as might have been predicted. In addition, 
Owever, it was hypothesized that conceptual 
deficit may be associated with a defensiveness 

at is manifested by restricted interpersonal 
Contacts, affective inhibition, and a tendency 
towards denial of illness and/or lack of in- 
Sight. ‘These results were thought to indicate 
at further research into the role of person- 
miy factors in conceptual deficit might prove 
Tuitful, 
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At some point early in intensive psycho- 
therapy, therapist and patient, collaboratively 
or otherwise, decide how frequently treatment 
sessions will be scheduled. The decision pre- 
sumably relates crucially to treatment out- 
come—if one can judge from the clashes 
within and between different theoretical 
schools over the matter of optimum fre- 
quency. Optimum frequency recommendations 
range from the orthodox analysts’ four or five 
sessions weekly down to the once weekly ap- 
pointment schedule commonly employed by 
nonanalysts. A large Proportion of psycho- 
therapy patients are seen even less than once 
weekly (Hollingshead & Redlich, 1958), al- 
though such schedules are rarely recommended 
or defended in the literature, Decisions about 
treatment frequency are also presumably 
based on patient characteristics important to 
the treatment process—if one accepts the 
practical and theoretical articles citing patient 
variables to be considered in setting the treat- 
ment schedule (e.g., Wolberg, 1954, pp. 258- 
259). Certainly, much time is consumed in 
staff conferences and in discussions between 
patients and therapists teaching decisions 
about interview frequency. 

The purpose of this study was to examine 
the following questions concerning therapists’ 
judgments about what frequency schedules 
are appropriate for their patients: 

1. Do patients respond more favorably to 
psychotherapy if treated on frequency sched- 
ules their therapists consider suitable than if 
treated on schedules their therapists consider 
unsuitable for their conditions? 

2. What patient variables relate to thera- 
pist judgments that treatment schedules are 
suitable or nensuitable for their patients? 

3. What other therapist judgments of the 
patient and the treatment procedure relate to 


therapist opinions about suitability or non- 
suitability of treatment schedules? 


i METHOD 


; + of a large? 
The present data were collected as PRES! tl See 
study to evaluate effects of individual psychother 


P i y pgs“ 
(Veterans Administration Cooperative Study to P 


dict Response to Individual Psychotherapy). Se 
mental hygiene clinics participated in the study- aa 
design of the cooperative study called for ai 
signment of patients to three interview schedu a 
twice weekly, once weekly, and biweekly. The Mer 
majority of VA Psychotherapy patients are sootninie 
one of these three schedules, One cooperating © re- 
randomly assigned patients to all three treatment the 
quencies. The other six clinics employed two dll 
three schedules—once weekly plus either twice We ae 
or biweekly. After the first therapy hour, thee a, 
judged whether the assigned schedules were ee 
too frequent, or too infrequent for their patie o: 
conditions. These suitability ratings are the focus 
this report, ients 
Subjects. The study included all 133 outpatie at 
who were assigned to treatment and remained ior 35 
least 4 months. The average patient was ane aii 
years old and had attended college. All patients male, 
fied these criteria for inclusion in the project: P Jess 
under age 50, Without evidence of brain injury sathss 
than 12 therapy hours in the preceding 3 mm sive 
acceptable for intensive psychotherapy. Sia a 
psychotherapy” was defined as interviews jas abe 
least 45 minutes, aimed at changing personal @ into- 
ment patterns, and excluding supportive oF mek 
nance therapy. Forty-five patients seen twice we oe 
62 Patients scen once weekly, and 26 patients 
biweekly Completed the study. veg) FOF 
Procedure, Clinics followed usual procedint p 
Prepsychotherapy Processing, adding research d 4 
cedures at each stage. Initial assessment inclu ket 
research test battery, evaluation by a social wa pa- 
and appraisal by the therapist, Reassessment ° apy 
ag took place after 4 months of psychother ie 
Hach clinic staff made the assignments of patie 
1 We are ind 
Ministration 
Cities for 
this study: 
cago, D 


d 
ebted to the staffs of the Veterans ng 
Mental Hygiene Clinics in the fol a OE 
Cooperating in the collection of d% cpi- 

Albany, Boston, Bridgeport, Buffalo» 
enver, and Hartford. 
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therapists, and interview frequencies were determined 
randomly after selection of the therapists. In all in- 
stances that the assigned interview schedule had to 
be altered, the patient was dropped from the proj- 
ect. Psychologists were therapists for 45% of the 
Patient sample, psychiatrists for 23%, and social 
workers for 32%. 

Measures. The patient, therapist, and social worker 
measures cited in this report are briefly described 
below. 


Patient measures: 

Anxiety Scale—a slightly modified form of Taylor's 
(1953) MAS 

Ego Strength—s6 items from Barron's (1953) 66- 
item scale from the MMPI (10 items omitted due to 
Overlap with other measures or because they were 
Potentially offensive questions about religious beliefs) 

Symptom Checklist—an inventory of 20 common 
Somatic complaints x 

Self-Rating—15 five-point graphic scales designed 
to measure self-satisfaction 

C scale—15 items from the MMPI K scale ; 

Sociability—a 15-item questionnaire from the Guild- 
ford-Zimmerman Temperament Survey , . 

Behavior Disturbance—SO questionnaire items 
adapted from an inventory by Appleaweig, Dibner, 
and Osborne (1958) to measure antisocial behavior 
and Psychopathic tendencies 

Ord Fluency—based on a test to measure the 

Word Fluency Factor in Thurstone’s Primary Men- 
fal Abilities Test á 

Education—nine-point scale of level of schooling 
completed 

ear of Birth 


Therapist ratings or reports: , M 
i hange Report-IR—inventory of 24 specific changes 
M the Patient-therapist interview relationship, a 
describing a change eoan y reported by 
Þists as accompanying psychotherapy p 
‘hange Report-PG + SR—inventory of 68 specific 
Jositive gains and symptom reductions, each item 
ae ttibing a change commonly reported by therapists 
as accom tide ae 
Panying psychotherapy a ants 

everity Rating—rating on seven four-point 
es related to severity of illness (Lorr, Holsopple, 
urk, 1956) A 
bal Severity—four-point graphic scale for rat- 
„versal severity of disorder , batie 
Er : ing for Patient—four-point graphic scale 

Of liking for the patient as a person 

patient as : 
deg test in Problem—four-point graphic sale we 
Of interest in the problem presented by 

t 


thera 


Seq 


Sea] tivation for ‘Treatment—four-point graphic 
e 


of Patient’s degree of motivation for — 
On Leach of four measures describing we aae 
lice” Set of 32 adjectives from an interpersona 
Dom; oped by La Forge and Suczek (1955): 
i ™inance, Hostility, Submissiveness, Affection. 
s Cla] Worker cathe: , 
: E ši 
Meas Rating—same scales as the therapist 
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RESULTS 
Suitability and Outcome 
To determine if patients on schedules 


judged suitable respond more favorably to 
psychotherapy than patients on schedules 
judged unsuitable, the total sample was di- 
chotomized into Groups S and U. Group S 
consisted of 97 patients on schedules the 
therapists considered suitable. Group U in- 
cluded 36 patients on schedules the therapists 
considered unsuitable—either too frequent or 
not frequent enough. All treatment frequen- 
cies were pooled for the analysis, since, for 
these patients, actual assigned treatment fre- 
quency was unrelated (as shown by statistical 
tests) to outcome. E 

Results of analyses of covariance for 10 
criteria are summarized in Table 1. Pretreat- 
ment scores on the criteria were controlled in 
all analyses except the two Therapist Change 
Reports—where there were no initial scores, 
One-way analysis of variance results are pre- 
sented for these two measures. One-tailed 
tests of significance were made, since greater 
improvement was hypothesized for Group S. 
Column 2 of Table 1 indicates the hypothe- 
sized directions of the mean differences, 

The results indicated one significant change 
in the predicted direction on a patient meas- 
ure (K scale), one reversal (Self-Rating), and 
one positive predicted result on a therapist 
measure (Change Report-IR). The K scale 
difference suggests that Group S patients were 
more ready to report problems after 4 months 
of psychotherapy. Even though statistically 
significant, the mean difference was so slight 
as to be of doubtful clinical importance. 
Therapists reported significantly more changes 
in the patient-therapist interview relationship 
for Group S. The Change Report-IR included 
such items as “more able to report dreams and 
embarrassing material,” “relies less on thera- 
pist for reassurance,” and “misses fewer ther- 
apy appointments.” 

Subsequently, Groups S and U were pooled 
and ż tests were made of correlated pre-post- 
treatment differences on the criteria (except 
Change Index-PG + SR) which were not sig- 
nificant by the above analyses. These tests 
were made to determine if the entire group 
showed other changes over the course of treat- 


un 


oO 
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TABLE 1 


FINAL ADJUSTED MEANS AND F Tests ror Group S Anp Grour U 
(Initial scores controlled) 


Group S Group U 
Criterion Hypothesis N Mean N Mean F 
Patient measure: 
Anxiety S <U 97 29.32 36 27.87 1.462 
Ego Strength s>U 97 32.08 36 32.25 0.030 
Symptom Checklist S<U 97 7.78 36 7.33 0.639 
Self-Rating S>U 96 46.09 36 48.51 5.540: 
K Scale S<U 97 5.30 36 6.09 2.812 
Sociability S>U 97 7.43 36 7.82 0.837 
Therapist report: 
Change Index-IR sS>U 97 9.58 34 7.03 7.244" 
Change Index-PG + SR SSU 97 12.30% 34 10.68» 1.047 
Severity Rating S<U 97 2.09 34 2.04 0.519 
Social worker report: 
Severity Rating S< U 97 1.96 34 2.02 0.486 


* Reversal, not significant by one-tailed test; p < ,05 if two-tail 


b Means are actual, not adjusted. There were no initial 
* Significant at .05 level. 
*#* Significant at .01 level. 


ment which were not associated with the suit- 
ability of the treatment schedule. These data 
are not presented in detail, but none of the 
four patient measures thus tested proved 
significant. The Therapist Severity Rating 
showed a significant decrease (p< .01) as 
did the Social Worker Severity Rating (p 
< .05). 

Following the above analyses, additional 
sets of comparisons of response to treatment 
were made. The 11 patients rated as seen too 
frequently on the twice weekly schedule were 
compared with the 34 patients rated suitable 
for twice weekly interviews; and the 14 pa- 
tients rated as seen too infrequently on bi- 
weekly schedules were compared with the 12 
patients rated suitable for biweekly inter- 
views. (A similar comparison could not be 
made for the once weekly group as too few 
patients were rated as scheduled either too 
frequently or too infrequently.) Thus all pa- 
tients in each comparison were rated in rela- 


tion to the same standard—actual treatment’ 


frequency. It was thought that differences 
might have been obscured in the main analy- 
ses by pooling all unsuitable cases into one 


led test had been specified, 


scores on this variable, 


group regardless of actual treatment fre- 
quency and regardless of whether they were 
considered seen too often or too infrequently- 
As in the main analysis, greater improvement 
on each frequency schedule was hypothesized 
for patients on suitable schedules. j 
_ Covariance analyses on the same 10 criteria 
listed in Table 1 were run for the twice weekly 
and biweekly groups. These data are not pre- 
sented in detail, but there was one significant 
difference beyond the .05 level in each grouP 
—K scale in the twice weekly group and 
Change Report-IR in the biweekly grouP- 
There were no consistent trends of results 0” 
the other criteria. The absence of trends at 
the finding of only two significant results 1 
20 comparisons make it appear doubtful that 
Suitability ratings predict response to psycho- 
therapy over this Period of treatment. 3 
\ natural question might arise as to whether 
a disproportionate number of patients on un- 
Suitable schedules had terminated treatment 
before the 4-month re-evaluation date. A T 
Square test indicated that the 133 and 17 
patients who, respectively, completed br 
terminated the study did not differ sigt! 
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TABLE 2 


INITIAL MEANS AND F TESTS FOR THERAPISTS’ RATINGS OF Group S anp Groupe U 
oN Twice WEEKLY AND BIWEEKLY TREATMENT SCHEDULES 


Twice Weekly Interviews 


Biweekly Interviews 


Means 


Group S Group U 


Means 


Group S Group U 


Variable (V=34) (V=11) F (V=12) (V=14) F 

Liking for Patient 2.50 1.82 8.250** 2.25 2.21 0.016 
Interest in Problem 2.65 2.27 2.33 2.43 0.154 
Motivation for Treatment 2.68 2.64 2.33 2.57 0.624 
ICL-Dominance 3.26 3.55 3.42 2.93 0.108 
ICL-Hostility 6.64 7.03 6.17 6.14 0.000 
ICL-Submissiveness 6.24 10.09 7.67 5.50 1.134 
ICL-Affection 2.76 3.55 4.00 2.64 0.574 
2.88 3.36 3.00 2.79 0.576 


Global Severity 


* 
b <05. 
"5 < 0 


cantly in the number of patients on unsuit- 
able schedules. 


Relation of Suitability Ratings to aE 
Characteristics 


Initial characteristics of patients on suitable 
and unsuitable schedules were compared. F 
tests were made to determine if Groups S and 

on either the twice weekly or biweekly 
Schedules differed initially on the 10 patient 
Measures included in the study. Most of these 
aracteristics have been cited in the litera- 
Ure as related to patient response to psycho- 
erapy—e.g., best prospects for psychother- 
"Dy are often described as relatively fluent, 
Young, educated, nonpsychopathic, anxious, 
Ste. There were, however, no significant dif- 
€rences in patient characteristics between 
toups S and U on the twice weekly schedule 
> Where Group U was described as needing 
ss frequent interviews. Nor were there dif- 
waences between Groups S and U on the ge 
actly schedule—where Group U was 
Cribed as needing more frequent sessions. 


Suitability Ratings and Other Therapist Rat- 
ngs of the Patient 


Suitable and unsuitable patient groups por 
< S0 Compared with respect to their therapists 
Judgments about them, and the results are 
“Mmarized in Table 2. All therapist ratings 


and reports included in Table 2 were made 
after the initial therapy hour. In the twice 
weekly group, patients rated unsuitable—as 
needing less frequent interviews—were liked 
significantly less by their therapists. Thera- 
pists described Group U as less likable than 
most patients, and Group S as more likable 
than most patients. The twice weekly Group 
U was also rated as significantly more se- 
verely ill than Group S. The ICL-Submissive- 
ness difference indicates therapists described 
Group U patients significantly more often 
with adjectives such as “self-punishing,” “pas- 
sive,” “shy,” and “dependent.” 

Therapists’ ratings and reports did not dif- 
ferentiate suitable and unsuitable groups on 
the biweekly schedule. Table 3 may help ex- 


TABLE 3 
NUMBER OF PATIENTS RATED SUITABLE 
AND UNSUITABLE FOR THREE 
TREATMENT SCHEDULES 


Assigned Treatment 


Schedule 
Therapists’ 

Rating 2Xwk. 1Xwk. Biwk. Total 
Suitable 34 51 12 97 
Unsuitable 11 11 14 36 

Total 45 62 26 133 
Note.—x? = 12.33, df = 2, p <.01. 
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plain the rather posi ped a 
n Groups S an l 
ac rereh In Table 3 the relation of 
suitability ratings to actual assigned treat- 
ment frequency is presented. Chi square F 
significant at p < .01, but the frequency ms 
unsuitable ratings of patients on the i- 
weekly schedule produces the significant chi 
square. Thus, an unsuitable rating for a pa- 
tient in the biweekly group appears to reflect 
therapist dissatisfaction with a treatment 
schedule providing for fewer interviews than 
the conventional once-a-week practice. It re- 


flects little else as far as can be determined 
here. 


Discussion 


There is little evidence that patient re- 
sponse to psychotherapy differs depending 
upon whether the treatment frequency is con- 
sidered suitable or unsuitable. A report by 
the therapist on the number of changes ob- 
served in the interview relationship is the 
single indicator of greater change by the suit- 
able group. It is questionable that the num- 
ber of interview changes reflects a true pa- 
tient difference in treatment response. Inter- 
view relationship patterns are modified by 
therapists as well as patients, and therapist 
satisfaction or dissatisfaction with a treat- 
ment schedule could be a factor influencing 
this pattern. It thus appears doubtful that the 
therapist can tailor the interview schedule to 
fit the individual psychotherapy patient—if 
the criterion for “fit? js favorable treatment 
response. Of course, certain obvious qualifica- 
tions—the sample, the length of the treatment 
period studied, the fact that therapists judged 
suitability early in treatment—apply, 

A pertinent question is: were therapists un- 
able to judge the appropriateness of treatment 
schedules because therapy had no effect on 
this sample, or were they simply unable to 
judge accurately or validly how many inter- 
views per week were necessary to obtain that 
effect? The results do not permit a decisive 
answer. One reason is that there is lack of 
confirmation between patient measures and 
observer reports concerning whether the sam- 
ple as a whole showed significant changes 
from initial status over the treatment period. 
Independent ratings by therapists and social 
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workers indicated significantly deceit col 
verity of patients’ disorders as compare’ 
pretreatment status. Yet none of the four a 
rectly assessed patient criteria et a 
such significant changes. There are a en 
two plausible explanations for the lack o go 
firmation: first, patients and observers Te 
be reporting correctly but responding to a 
ferent events; second, there may be a pa 
tematic bias on the part of the observer S 
nicians to report improvement after ma 
months of therapy. Because of the possi ao 
of rater bias, there is some doubt as : 
whether this sample of patients changed P 
preciably from initial status. At any mea 
is clear that the two severity ratings did A 
change as a function of appropriate Ce 
propriate treatment schedules even arse 
they did change significantly from ir 
status. 

Alternatively, therapy on “suitable” 
“unsuitable” schedules represents psychot ae 
apy conducted under two conditions: effec a 
and ineffective. Thus, an equally pertinen 
question is: do patients treated under pa 
tions therapists regard as effective actu y 
respond, or respond more favorably than e 
tients treated under less effective en 
Tt is this question that therapists often +. 
in interpreting studies where patients are be 
domly assigned to different treatments. ce 
evidence suggests that over the period studied, 

i ve 
both conditions are, perhaps, slightly effecti : 
but the Supposedly favorable condition has 2 
advantage. 

Similarly, the therapists’ opinions mar 
suitability of an assigned treatment sche ai 
failed to relate to patient characteristics wll 
sessed by administering objective psycholog 
cal tests or obtaining actuarial information. 
Ten such characteristics were included, an 
most of them have been cited in the litera” 
ture as related to readiness, prognosis, © 


ability to participate in the task of psych 
therapy. 


In a previous study, 
Redlich (1958, 


class related to 


Hollingshead ae 
PP. 269-270) found soc" 
scheduled frequency of par 
chotherapy interviews. In an unpublished sue 
vey, Lorr and McNair found that education 
an important component of social class oi 
dices, related to scheduled interview fre 
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quency. The present findings might appear to 
partially disconfirm the above findings, since 
education did not relate to therapists’ pref- 
erences for more or less frequent interviews 
with the patient. The apparent disagreement 
may be due to the fact that the former of the 
above studies was based on private practi- 
tioners’ patients and the latter was based on 
a broad sample of the total VA Mental Hy- 
giene Clinic population, whereas the present 
study included only patients who met certain 
screening criteria for inclusion in the study, 
including the criterion of acceptability for in- 
tensive psychotherapy. Secondly, early termi- 
nators were eliminated from the present sam- 
ple which included only patients remaining in 
treatment for 4 months or longer. Thus, the 
finding that therapists’ suitability judgments 
did not relate to education or other patient 
characteristics may hold only for patients 
who remained in therapy at least 4 months. 
Tt can be argued, however, that the eliminated 
Patients are probably not amenable to the 
usual methods of psychotherapy, and the 
question of effectiveness of different psycho- 
therapy schedules with such patients is hardly 
apropos. 

_ The concentration of significant differences 
therapists’ ratings and reports suggests that 
therapists’ reasons for preferring one treat- 
Ment schedule to another with given patients 
lie more with the therapists than with the pa- 
tients. The preferences appear to be associ- 
ated with personal reactions to patients (such 
as liking), with presumably objective clinical 
judgments about patients (such as severity of 
ness), and with practical and theoretical 
Opinions about the task of doing therapy 
(such as an aversion to biweekly interview 
Schedules). Whether therapists, in practice, 
actually arrange or schedule interview fre- 
quencies on the basis of such factors 1S not 
answered by this study. Therapists 1m the 
Study had no control over the treatment fre- 
quency as long as the patient remained in the 
Study. They simply rated whether an as- 
Signed interview schedule of therapy was ap? 
Propriate for the patients’ problems, and in- 
dicateq preferences for more oF less frequent 
Mterviews. The importance of therapist 
tors in actual practice is certainly implied, 


ut no more than that. 
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An “ideal” or optimum treatment schedule 
might be described as the one which most ef- 
fectively shapes patient behavior in the direc- 
tion of selected criteria improvement. Whether 
an optimum schedule can be selected for an 
actual patient is an exceedingly complex ques- 
tion which involves, in something like the fol- 
lowing order: knotty and unanswered prob- 
lems concerning criteria of improvement, ques- 
tions of whether different interview schedules 
actually produce different results, questions of 
whether kinds or types of patients respond 
differentially to different schedules, and prob- 
lems of determining predictors of those pa- 
tients who respond “best” on different sched- 
ules. The present results imply, as might natu- 
rally be expected, that in the absence of more 
valid answers, therapists rely on their own 
learnings and judgments in expressing pref- 
erences for interview schedules. The evidence 
is unimpressive that they can accurately pre- 
dict which schedules will most effectively alter 
the patients’ behavior, but only future re- 
search can provide any sounder bases for 


making such decisions. 


SUMMARY 


One hundred and thirty-three outpatients 
accepted for psychotherapy at seven clinics 
were randomly assigned to three treatment 
frequencies. Patients were assessed before 
treatment and following 4 months of psycho- 
therapy. After the initial interview, therapists 
rated suitability of the assigned treatment 
schedule for the patients’ conditions. The 
hypothesis that patients treated under con- 
ditions considered favorable by therapists 
(Group S) would improve more than patients 
treated under unfavorable conditions (Group 
U) was tested on 10 criteria. The evidence 
did not indicate that these groups responded 
differentially to 4 months of psychotherapy. 
As evidence of any treatment effect with this 
sample of patients is indefinite, it remains in- 
determinate whether therapists can or cannot 
judge accurately how many interviews per 
week are needed to obtain optimal treatment 
effects. Treatment effects must be demon- 
strated in order to assess the validity of 
therapists’ judgments. 

Suitability ratings appear to be associated 
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with therapists’ personal reactions to patients, 
clinical judgments about patients, and opin- 
ions about the task of doing therapy. Suit- 
ability ratings did not relate to 10 objectively 
assessed patient characteristics. 
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THE RELATIONSHIP OF FACTORS IN PARENTAL 
RATINGS OF SELF AND EACH OTHER TO THE 
BEHAVIOR OF KINDERGARTEN CHILDREN AS 

RATED BY MOTHERS, FATHERS, AND 
TEACHERS 


WESLEY C. BECKER? 


University of Illinois 


As part of a long term investigation of 
parent-child relationships, a group of psy- 
chologists from the University of Illinois Psy- 
Chological Clinic have been studying a num- 
ber of approaches to the problems of measur- 
ing aspects of parent and child behavior 
critical in understanding child development. 

hese approaches have included adaptations 
Of interview rating procedures developed at 
the Fels Institute (Baldwin, Kalhorn, & 
Breese, 1949) as well as those developed by 
Sears (Sears, Maccoby, & Levin, 1957), pro- 
Jective tests, the Guilford personality inven- 
tories, parent attitude questionnaires, direct 
Observations of parent-child interaction under 
Controlled conditions, objective measures of 
child behavior, and ratings of child behavior. 

The present report discusses the develop- 
Ment of two rating instruments, one for child 
ehavior and one for parent behavior, and 
the relationships between the measures de- 
tived from these two instruments. 


SuBJECTS 


The subjects (Ss) were drawn primarily from five 

A kindergartens in Champaign, Illinois. Out of 
23 families represented in these kindergartens, 64 
agreed to participate. Teachers’ ratings were obtained 
Goe a 


1 The study is a part of a cooperative project with 
Donald R, Peterson, Donald J. Shoemaker, Leo A. 
ellmer, and Zella Luria. The writer wishes to ex- 
Press his appreciation to the State of Ilinois De- 


Partment of Public Welfare for its pet As 
Upport, k, Rogers Elliott, Jacquet 
agana be Sally PEE Moelis, i Geraldine 


Oldman, Ronald Krug, Irvin 5; r 
Viorkowski, and Kenneth Stark for their assistance 


M gathering and analyzing the data. 


on all 123 children, while complete data were ob- 
tained on 60 families. Teachers were paid $2.50 per 
hour and each family was paid $10 for participating 
in the larger study. Eleven families with kindergarten 
children who came to the Psychological Clinic for 
help with behavior problems were added to the sam- 
ple for the analyses of relationships between parent 
and child measures. None of these additional Ss was 
a severe problem and they did not differ from the 
nonclinic Ss on such variables as age, education, and 
occupational level of father. Table 1 presents some 
of the sample characteristics for the 71 participating 
families and for 59 of the nonparticipating families. 
As indicated in Table 1, the sample is primarily mid- 
dle class, and education of parents was a selective 
factor influencing participation. The sex division of 
the children was almost even in both groups. 


TABLE 1 


SAMPLE CHARACTERISTICS 


Participants Nonparticipants 


(N = 71) (N = 59) 
Variables M o M o 
Age of mother 32.4 493 32.8 4.92 
Highest grade com- 14.2 1.93 12.7 1.72 
pleted, mother 
Age of father 34.7 5.71 35.7 5.82 
Highest grade com- 5.8 2.46 145 3.13 
pleted, father 
Warner Occupational 2.25 97 2.86 1.38 
Level, father 
(1 = professional, 
6 = laborer) 
69.3 3.85 69.8 3.71 


Age of child in months 
at time of parent 
ratings 
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A COMPARISON OF CHILD PERSONALITY FAC- 
TORS OBTAINED FROM MOTHER, FATHER, 
AND TEACHER RATINGS 


This section discusses the development of a 
rating instrument designed to broadly sample 
the dimensions of child behavior. In addi- 
tion, the comparability of mother, father, and 
teacher ratings in terms of factor loadings 
and in terms of factor scores is evaluated. It 
is quite possible that though mothers, fathers, 
and teachers (using a common language sys- 
tem) might agree in terms of factor loadings 
on which rating variables covary, all three 
groups could markedly disagree on how to 
apply the ratings to a given child. This latter 
situation could arise because of different 
biases, different frames of reference, or be- 
cause each of the three groups observes dif- 
ferent modal sets of behavior. While several 
previous studies have used both parent and 
teacher ratings (Cattell & Coan, 1957a, 1957b; 
Peterson & Cattell, 1958), the writer believes 
that this paper represents the first system- 
atic effort to compare ratings in terms of both 
factor loadings and factor Scores, and to sepa- 
rately evaluate ratings by mothers and fathers, 


The Child Rating Schedule 


The child rating schedule consisted of 72 


bipolar, Seven-point rating scales with anto- 
nym pairs of adjectives defining the extremes. 
Scales were selected to sample the personality 
domain as outlined by Cattell (1957), as well 
as the preliminary factors found by Osgood 
and Becker (unpublished) in the application 
of the semantic differential method to the 
study of personality. A number of variables 
particularly relevant to the behavior of five- 
year-olds, such as “ease of discipline,” were 
also included. To save space, a formal listing 
of the variables will not be given, However, 
their general nature may be ascertained from 
the tables to follow. Average time required 


for making the ratings was 5-10 minutes per 
child. 


The Factor Analyses 


The general procedure fol 
(with the aid of a high spe 
intercorrelations among thi 
extract centroid factors, us 


lowed was to find 
ed computer) the 
e 72 variables, to 
ing fixed-unit com- 
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munalities, and then to rotate to orthogonal 
simple structure using the varimax criterion 
(Kaiser, 1958). Separate analyses were deter: 
mined for the mother ratings (V = 64), the 
father ratings (W = 60), and the ratings by 
each of two sets of teachers (N = 123). Thus, 
four separate factor analyses were involved. 
Six teachers participated in the teacher n 
ings: two rating 60 children, two rating : 
children, and two rating 16 children. The 
pairs of teachers with the larger groups yee 
ally had two classes, one in the morning and 
one in the afternoon. Before combining teacher 
rater groups, mean differences in ratings for 
the 72 variables were examined both within 
groups and between groups to decide whether 
it was reasonable to convert to standard scores 
by teachers before combining. Combining 
standard scores has the advantage over aa 
bining raw scores in that it is then me 
that the covariance is not due to mean s å 
ferences in rater frames of reference. However, 
if there are real mean individual differences 
between groups for some variables, this vat 
ance will be lost. The average absolute differ- 
ences in mean rating between groups v 
36, .42, and .50; while the average absolu : 
differences in mean rating between teacher 
within groups were .59, .42, and .68. Thus, 
group differences were found to be less tae 
rater differences within groups. As a result, 
Scores were standardized by teachers beer 
combining into Teacher Rater 1 (Tı) a” 
Teacher Rater 2 (T2) groups. 
In the first stage of the analysis, 10-cen- 
troid factors were extracted from each as 
lation matrix. Generally, the decision of poy 
many factors to rotate was based on an f 
amination of the variance extracted by ea¢? 
factor. The cut was made about where the 


variance percentage leveled off and seemed tO 
fluctuate randomly, 


made to keep one 
than was like] 
tor to b 
cedure j 
attempt 


1958). Seven factors ja 
e except for fathers, whe 

rotated. ted 
he variance contributions of the rota 
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pos led to the discarding of one factor TABLE 2 
kom each parent analysis, and one from the Common Factor 1—HosTILE-WITHDRAWAL 
h analysis. Two factors were discarded from =——= = 
ate analysis. Examinations of factor plots i Factor Loadings of Salient 
i icated that two father factors contained Variables 
at One substantial hyperplane, so a hand ro- 

lon was made to eliminate an additional Variables MI FL Tyr TAV ots 


f a S 
ather factor. In a similar way, a small factor 


y imi > : 

on eliminated from the mother analysis and 4. Sociable-unsociable 79 76 78 79 78 

Sth Tı. The only other modification made 19, \Warm-cold 71 82 8381 (O86 

e A i b . 

two f analytic solution was the rotation of 8. Happy-depressed 69 77 76 59 70 

4 actors in the Ts analysis (Hostile-With- A Bemmi G 2 RR o 
sl. es sive-a X 

ne and Submission) by 24 degrees so that SP i A A 

ey would better line up with the factors 10. Loving-not loving 64 8i 71 55 68 

26. Colorful-colorless 49 83 72 56 65 


Sie in the other three solutions. Needless to 
me oo was an arbitrary procedure which 3. 
also have been used to improve other 
Matchings. On the other hand, failure to have 33. 
made this rotation would not appreciably have 18, Optimistic-pessi-, 60 65 66 4 58 


Extraverted-intro- 37 60 76 83 64 
verted 
Interesting—boring 5 S 66 53 63 


affected the results reported below. mistic 
Matchin pan 29, Real-unreal 64 73 58 20 54 
g of Factors 12, Trusting-distrust- 58 12 47 37 38 
There are many reasons why one might fail ing 


to k s 
e find matchings for particular rating scales, 23. Soft hearted-hard 
B different samples of behavior to observe, hearted 


ti ctent rater concept systems; different ral 17, Curious-uninguir- 58 86 56 47 (62) 
tis, biases of raters, different ro ing o o 3 
Scales of the ambiguous terms les ra e 2. Active-inactive — od s5 70 63 (59) 
Writer ue of these ares pene 22. Adventurous-timid 27 ue 69 68 = (57) 
or th id not spend ges ime ST coe oe Interested-bored 44 74 44 47 (52) 
e “best” simple structure not 72, Adjusted-mal- s9 69 37 35 (50) 


e goodness of 


57, for a dis: adjusted 


buy 
een “refined” statistics on th 
tor matching (see Cattell, 19 


CUssio 7 actical 16. Quick-slow 
n of thi m). The more practice 4 : 
Question E b ates or not the 34. Confidente 49 45 48 42 (46) 
numb eee son- tively inferior 
a er of sali arjables showing & rea D - ; 
© match See V aoii to give a reliable 69. Effective-ineffec- 309 3B a z (44) 
actor aha tive 
mir 3. Lik ldi- 32 535 4 4543) 
i i r fe ow- 43. Likes school-dis: 32 5 
% he salient variables for fout oon es poe oe 
Teasoni ~ in all four ana 
te enll <p ap sem ba Since unreli- 35. Formed-formless 33 62 53 20 (%83) 
ae ed 1 es . A ` i : ; 3 
ability of eee en lead to some insta- 37. Noisy-quiet 00 46 49 57 (38) 
ni i if parents and 27, Outgoing-self- 27 30 45 36 (34) 


cacy it factor loadings, even =a ser 
chers were using common systems, & CO 


of salient variable was defined as @ ite 
oy or higher appearing in three i Aaa fles M, F, Tu Tə, have been used to refer to 
Ors ghout the 2 
Which om compared. ae were used: mother, father, I. ae oe 1 z. Tenet 
tee ow these wa made positive Rater 2, respec + yl a l EAE = e has 
aA or loadings have | een arsed where been added to a a o cys signify hat a 
Se e variable descriptions " med ac- rotation was ma e | eyon: e varimax solu- 
Ssary; (b) factors have been na tion; and (e) as discussed later, parentheses 


Coeds j ; 
tding to the direction of & high score; (c) 


TABLE 3 


COMMON Factor 2— RELAXED Disposition 


Factor Loadings of Salient 


Variables 
Aver- 
Variables MIL FVII TVIL TAII age 
54. Tense-relaxed i 56 69 7 70 
64. Nervous-placid 68 35 69 80 63 
46. Excitable-calm 57 63 50 67 59 


56. Emotional-objective 68 19 46 66 50 
49, Anxious-nonchalant 42 43 58 47 48 
42. Fluctuating-stable 61 51 22 50 46 
40. Fearful-not fearful 34 35 56 42 42 


have been placed around factor loadings of 


variables not actually used in the computation 
of factor scores. 


Common Factor 1, presented in Table 2, 


TABLE 4 
COMMON FACTOR 3—Lack or AGGRESSION 


Factor Loadings of Salient 


Variables 
= Se aes 
Aver- 
Variables MV’ FIL TiV’ TV age 
11. Demanding-not 72 60 72 60 66 
demanding 
30. Prone to anger-not 70 70 53 47 60 
Prone to anger 
15. Jealous-not jealous 64 3 64 64 56 
71. Prone to tantrums- 54 60 67 28° 52 
not prone to tan- 
trums 
20. Impatient-patient 46 64 6 28 51 
28. Irritable-casy going 63 22 59 57 50 
47. Conceited-self- 2 3 my 45 
critical 
27. Self-centered-out- 43 11 56 57 42 
going ‘ 
60. Exhibitionistic— 27 38 51 34 38 
modest 
5. Cruel-kind 37 22 49 39 37 
61. Difficult todiscipline-57 64 45 10 (44) 
easily disciplined 
52, Disobedient-obedient 48 44 38 2 (33) 
35. Formed-formless oo 35 20 35 (32) 
38. Masculine-feminine 47 31 4 38 (30) 
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loads most highly on such variables as Sh 
sociable, cold, aloof, not loving, introve e 
pessimistic, and hard hearted. This factor i 
very similar to Cattell’s (1957) behavior "A 
ing Factor A, Cyclothymia—Schizothymia, A 
though it is more general than A and include 
most of the elements of Cattell’s F Pati 
Surgency—Desurgency. This factor will 
referred to as Hostile-Withdrawal versus 
Warm-Extraversion. ctl 
Common Factor 2 (Table 3) loads high 
on relaxed, placid, calm, objective, stab 4 
nonchalant, and not fearful. This factor coa 
trasts a Relaxed Disposition versus a Ner ie 
Disposition and closely parallels Cattell’s 4 
factor, Ego Strength versus Proneness to Neu 
roticism. bed 
Common Factor 3 (Table 4) is describes 
by the following variables: not demanding; 
not prone to anger, not jealous, not ae it 
tantrums, patient, and easy going. The Sage 
factor in Cattell’s system is Factor D, ia 
citability, Insecurity versus Emotional M A 
turity. Common Factor 3 will be referred 
as Lack of Aggression. 5 
Common Factor 4, as indicated in Table >» 
is clearly a Submission-Dominance facto! 


Again there is a parallel to Cattell’s Factor P» 
Dominance. 


TABLE 5 


Cormon Factor 4—SUBMISSION 


= 


Factor Loadings of Salient 


Variables 
Aver 
Variables MUI FVE Ti Tel’ 48° 
58. Strong willed-weak 57 61 47 60 56 
willed 
59. Independent-de- 54 83 21 S4 53 
pendent 
Ji Dominant-submissive 57 50 55 4 5% 
22. Adventurous-timid 60 47 38 53 50 
13. Tough-sensitive as —10 os 72 #4 
37. Noisy-quiet 39 05 54 58 2 
2. Active-inactive 56 —01 32 38 3 
61l. Difficult todiscipline- 29 33 73 77 6? 
easy to discipline 
40. Not fearful-fearful 49 35 30 07 ee 
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TABLE 6 


Ts s RATINGS— 
Facror COMMON ONLY TO TEACHER RATINGS 
SCHOOLROOM INTEL E 


Factor Loadings of 
Salient Variables 


Variables T,UI TAI Average 
intelli 7 5 82 
9. Dull minded-intelligent 79 a 5 
55. Subject to distraction-able 80 82 8 


to concentrate 


44. Poor memory-good memory 76 83 m 
69. Ineffective-efiective 75 76 i 
31. Meaningless-meaningful 75 pa k 
16. Slow-quick 56 68 62 
34. Subjectively inferior-self- ó4 ól 62 
confident 

50. Bored-interested 62 62 62 
35. Formless-formed 69 39 54 
39. Shallow-deep 61 44 52 
17. Uninquiring-curious 60 38 49 
70. Disorganized-organized 83 81 (80) 
63. Irresponsible-responsible 73 S2 ge 
66. Infantile-adult-like 60 (70) 

6. Conscienceless-conscientious 51 An (61) 
59. Dependent-independent 72 48 (58) 
65. Not helping-helping BS S 64) 
29. Unreal-real 48 58 (53) 
48. Disorderly-neat 51 53 (52) 
72. Maladjusted-adjusted 48 52 (50) 
33. Boring-interesting 51 45 (48) 
43. Dislikes school-likes school 34 53 (44) 
67, Obstructive-cooperative 39 50 (44) 


58. Weak willed-strong willed 38 34 


SM 


tors across all raters, an attempt was made to 
reproduce these factors from the averaged 
correlation matrix formed by treating each 
rater’s data as additional cases in computing 
the correlations. Thus, each element of the 
correlation matrix was based on 370 (123 + 
123 + 64 + 60) elements. Because of the dif- 
ferences in N, the teacher ratings were doubly 
weighted in this analysis. Standard scores 
were used in computing these averaged corre- 
lations so that mean rater differences would 
not effect the correlations. Six varimax fac- 
tors were found, only five of which were in- 
terpreted. The factors referred to as Hostile- 
Withdrawal; Relaxed Disposition, and Lack 
of Aggression were clearly extracted. How- 
ever, the Submission factor was fused with 
the Schoolroom Intelligence factor. On the 
other hand, a factor emerged which had previ- 
ously appeared as a more general factor cut- 
ting across the diagonals of the factor plots 
for Lack of Aggression, Submission, and Re- 
laxed Disposition. The salient loadings on this 
factor are presented in Table 7. The factor is 
defined by such variables as disobedient, irre- 
sponsible, obstructive, difficult to discipline, 
disorganized, conscienceless, and lying. This 


TABLE 7 


Common FACTOR 5—Conpucr PROBLEMS 


The final area of some common pattern is 
that of intelligence. An intelligence factor is 
clearly revealed in both sets of teacher rat- 
ings. Table 6 presents the salient variables 
and indicates not only that the agreement be- 
tween teachers is exceptionally high, but also 
that the factor is much broader than what 
is usually measured by an intelligence test. 
Other variables load on this factor which 
might be expected to influence excellence of 
Schoolroom adjustment, namely, responsibil- 
ity, independence, morality, and adjustment. 
This factor will be referred to as Schoolroom 


Intelligence to distinguish it from test intelli- 
gence. 


Since the separate analyses had provided 
considerable evidence for four common fac- 


Factor 
Variables Loadings 
52. Obedient-disobedient 65 
63. Responsible-irresponsible 65 
67. Cooperative-obstructive 65 
61. Easily disciplined-difficult to discipline 60 
70. Organized-disorganized 60 


65. Helping-not helping 55 
66. Adult-like-infantile 5. 
48. Neat-disorderly 


0 
43. Likes school-dislikes school : 
62. Attention avoiding-attention seeking 45 
6. Conscientious-conscienceless 40 
33. Truthfultying 30 
72. Adjusted-maladjusted (55) 
55. Able to concentrate-subject to distraction (55) 
60. Modest-exhibitionistic 


Š 45 
50. Interested-bored a 
69. Effective-ineffective 
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factor is very similar to a Conduct Problem 
factor found on these same Ss by Peterson 
using another rating instrument. Although it 
would be possible to assess this variable by 
using a weighted combination of the four 
common factors above, it was decided that 
this factor was important enough clinically to 
merit direct scoring. 


Computation of Factor Scores 


At this point a decision was made to score 
for: (a) the four common factors, Hostile- 
Withdrawal, Relaxed Disposition, Lack of 
Aggression, and Submission; (b) the non- 
orthogonal Conduct Problem factor; and (c) 
Schoolroom Intelligence based only on teacher 
ratings. The variables used in scoring each 
factor are indicated in Tables 2 to 7 by the 
lack of parentheses around the factor load- 
ings. The goal in scoring was to avoid count- 
ing any variable for more than one factor. 
The most obvious reason for doing this was 


to avoid building in spurious correlations be- 
tween factors. The 


vious reason whi 
Even though or 
used (except for 
tor plots repeate 
ness in the hype 
liqueness, it is t 


structuring. 


In computing factor Scores, the method of 
standard score addition of salient variables, 
weighted according to the size of factor load- 
ings, was used (see Cattell, 1957, for a dis- 


cussion of various ways of computing factor 
scores). 


Comparison of Factor Scores 

The comparison of f 
cated a fair degree of convergence of factor 
patterns. This convergence does not tell one, 
however, whether parent and teacher ratings 
on these factors can be considered equivalent., 
This question can only be answered by ex- 


amining the correlations among the factor 
scores. 


Table 8 presents th 
the five common f. 


actor loadings indi- 


€ intercorrelations for 
actors, plus the Schoolroom 
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Intelligence factor based on an N of 71. or 
relations above .23 are significant at the .0 
level and those above .31 are significant at 
the .01 level. In preparing Table 8, the fac- 
tor scores for the two teachers were a 
aged. Uncorrected reliabilities of the average 
teacher factor scores are presented in teacher- 
teacher diagonal.? S 
Table 8 permits the reader to camie 
obliqueness of factors and the comparabi ity 
of factors across raters. The correlations Bi 
cal for examination of the comparability P 
parent and teacher factor scores are italicize 
in the diagonals of Table 8. ee oe 
correlation (using the z transformation) a 
Factors 1 to 5 was .76 between teachers, - 1 
between parents, .31 between mothers roe 
teachers, and .28 between fathers and teac r 
ers. The average correlation for pooled pae 
ratings with the pooled teacher ratings T 
-34. These findings clearly indicate that pare : 
and teacher ratings have very little in a 
mon (8-12% of the variance). However, t 3 
reader should note that all correlations at 
significantly different from zero at the .0 
level or better. The fact that parents can 
agree on what their child is like to a it 
extent than either can agree with the teac F 
ers, indicates that the trouble is not just ea 
reliability of parent ratings. Indeed, it is pro P 
ably reasonable to assume that the inepe 
ent correlations would be as high as those ž 
teachers if each parent had the same samp’ 
of child behavior to observe and rate. If one 
accepts this assumption, then it is necessary 
to also accept the conclusion that different oe 
formation js contained in mother and fathe 
evaluations of the same factors on the it 
children, since the average parent a ae 
are significantly lower (at least the .01 leve 
an the average teacher correlations. 


Additional Measures of Child Behavior 


An additional 
by P 
were 


four rating factors developed 
eterson as a part of this same proje ù 
available for relating to parent pehitii 
Peterson had parents and teachers rate t 5 
children on 20 Cattell-type bipolar scales Ei 

Cattell, 1957, Appendix 1) which have fairly 
The parallel (uncorrected) reliabilities for the 5? 
Participants y 


3. 
non Were .70, 44, .70, .78, .83, and 8 


| 
| 


ne 


> 
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TABLE 8 
INTERCORRELATIONS OF FACTOR SCORES 
Teacher Factors Mother Factors Father Factors 
t 2 3 4 5 6 1 2 3 4 5 1 2 3 4 5 

Teacher 
- 80° —31 —47 38 52 -66 33 —17 —33 -26 24 26 —22 —34 -07 40 
3 59 51 11 —29 11 —20 26 19 os —09 —16 23 28 —15 —16 
4 73 41 -70 28 —04 08 30 52 —27 04 #12 24 32 ~25 
& 70 —20 —49 15 —07 04 26 —11 04 04 —12 25 07 
: 0 —67 03 —06 —26 -38 40 14 —14 -26 -17 30 
80 00 03 12 16 —14 —03 09 20 04 —26 

Mother 
I —i6 -48 21 44 59 -28 =34 33 33 
3 44 16 —34 —29 34 26 —14 —26 
? 36 —67 —30 30 49 15 —39 
: —26 18 04 13 61 —03 
39 -19 —45 —04 56 

Father 
5 —45 —50 51 65 
3 52 —13 —37 
4 -14 —67 
W 


Not — 
Atsression, ie factors named in terms of directio: 


fore 7 38onal value: eee are Fe Corre 
re Dooling), alues for teachers are the 


Si behavioral descriptions defining end 
P eters, On the basis of a previous pete ds 
for iy (unpublished) scored the 20 sca E 
Extr © two largest factors, Adjustment an 
eser Sion. The Adjustment factor 1s briefly 
Der Š, ed by the following adjectives: patient, 
$ mannerly, good natured, calm, 
e, not jealous, cooperative, serupi: 
Scrip rusting, The Extraversion factor 1s Ce 
frien by: frank, happy-go-lucky, energetic, 

dy, bold, cheerful, assertive, gregarious, 


Com 

à : e 

sey Sed, prefers companions of the opposit 
X, Į 


a 58-item prob- 


Pe 
lep, terson has also developed y which 


bot Checklist (as plished 
yet, unpu 
the Patents and teachers filled out. Two © 


Suggs Ce factors from this checklist were in- 
yg, 42 the present analysis. They are Con- 
attentigne bem (disruptiveness, ge 
Struc” Seeking, fighting, irritabi 3 Sjem 


` $ : 
(ing tenes: etc.) and Personality tee 
ce tty feelings, anxiety, ane e 
> depression, etc.). These four ac m 
*terson’s rating procedures were ava 


n of high score are: 1. Hostile-Withdrawal, 2. Relaxed Disposition, 3. Lack of 


issi om Intelligence. 
ubmiss oblem, and 6. Schoolro tellige 
she eee lations between factor scores 


derived separately for Teacher 1 and Teacher 2 (i.e., 


able for pooled parents and for pooled 
teachers. 

As would be expected from the descrip- 
tions, there is considerable overlap between 
Peterson’s child evaluations and those de- 
veloped by the present writer. This overlap 
can be a virtue in that it allows one to check 
the degree to which a given finding is specific 
to one form of rating scale and therefore per- 


TABLE 9 


SUMMARY OF PARENT-TEACHER CORRELATIONS 
OBTAINED BY PETERSON 


Com- 

Factors* stor mon 

7 Aver- Vari- 

Raters 7 8 9 10 age ance 
Teacher 1-Teacher 2 82 68 75 64 73 53 
Mother-Father 48 52 62 64 57 eS 
44 24 29 26 31 10 


Parents-Teachers 


„factor Definitions: 7. Conduct Problem, 8. Personality 
„i Eectog TRajustment. and 10. Extraversion: The factors are 
Provpered the same way as in Table 10 to facilitate cross- 


reference. 
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haps fortuitous. For example, the interparent, 
interteacher, and parent-teacher correlations 
(Table 9) on these four variables show prac- 
tically an identical pattern to those just dis- 
cussed from Table 8. The findings presented 
in Table 9 indicate that conclusion drawn 
from Table 8 is not restricted to the particu- 
lar rating schedule used. 


Factors among the Child Factor Scores 


To simplify the interpretation of the de- 
pendencies among the child scores and to fa- 
cilitate the later analysis of relationships to 
parent behavior, the 24 child factor scores 
were factored using the centroid method with 
varimax rotation. Table 10 presents the load- 
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ings of each child score on the four mie 
order factors. These four factors are readily 
interpreted as teachers’ evaluations of A 
ality Problem behavior and Conduct Pro ss 
behavior, and parents’ evaluations of the 7 : 
two kinds of behavior. Factor scores for t ok 
second-order factors were computed by a a 
ple addition of scores for the variables E 
Table 10 showing factor loadings which ar 
not in parentheses. 


eNTAL 
THE ANALYSIS OF FACTORS IN PARENTA 
RATINGS OF SELF AND EACH OTHER 


‘ : k 

One disappointment with our earlier oe i 
was with the Guilford inventories, r P 
larly for fathers (Becker, Peterson, Hellmer, 


TABLE 10 
INTERDEPENDENCIES OF CHILD FACTOR SCORES > —= 
Varimax Factor Loadings — 
Variables TPP TCP EEP = 
Mother Ratings of Child 
1. Hostile-Withdrawal (27) - (28) 05) 
2. Relaxed Disposition = = 5 = 4 
3. Lack of Aggression =- = = A 
4. Submission - (—43) 50 z 
5. Conduct Problem - - = os 
Father Ratings of Child 
1. Hostile-Withdrawal - - (30) (39) 
2. Relaxed Disposition š 5 (—35) = 38 
3. Lack of Aggression - = = —68 
4. Submission = 5 60 T 
5. Conduct Problem 5 i = 65 
Pooled Parent Ratings 
7. Peterson Conduct Problem = (39 x 68 
8. Peterson Personality Problem - ) 64 ze 
9. Peterson Adjustment = 2 —12 
10. P Ex i 7 p - 
0. Peterson Extraversion = (32) —55 
Pooled Teacher Ratings 
1. Hostile-Withdrawal = 
2. Relaxed Disposition a (42) 32) = 
3. Lack of Aggression < an iS az 
4. Submission 79 ae = = 
5. Conduct Problem 2 a E = 
6. Schoolroom Intelligence -71 E 3 = 
7. Peterson Conduct Problem - a 0) = ge 
8. Peterson Personality Problem 80 = = 
9. Peterson Adjustment = “99 = 
10. Peterson Extraversion —90 a 2 


® The abbreviations stand 


for Ti 
Problem (PPP), and Parent Congugesnst P 


nduct Problem (Perey Problem (TPP), 


Teacher Conduct 


ality 
Problem (TCP), Parent Person® 


— y 
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Shoemaker, & Quay, 1959). When the two 
main child problem factors (Personality Prob- 
lem and Conduct Problem) were examined for 


' relationships to mother’s and father’s behav- 


ior, only three Guilford variables were found 
to load .30 or higher (.30, .31, and .32). Ex- 
amination of the correlations between the 13 
Guilford factors and 6 child variables indi- 
cated that only 2.6% of the father correla- 
tions were significant at the .05 level, while 
18% of the mother correlations were signifi- 
cant. The mother correlations mainly related 
mother anxiety and hostility to child conduct 
Problems. Considering the fact that groups 
Were extremes on adjustment variables (clinic 
Versus well adjusted families), the magnitude 
Of the correlations (under .40) was not im- 
pressive, These findings, coupled with a pau- 
city of associations between the Guilford 
Scales and factors in the Fels Parent Behav- 
lor Scales, led to the belief that the general- 
ized self-appraisal factors measured by the 

uilford inventories have little to do with 
What parents do (or say they do) as parents. 

To further explore the value of self-ratings, 
the hypothesis was generated that self-rating 
actors more directly associated with the pa- 
"ental role would show more relationships to 
child behavior. A rating schedule was devised 
to test this hypothesis (informally) and at 

© same time to systematically examine in- 
terparental perceptions * in relationship to 
child behavior. 


The Parent Rating Schedule 

sted of 73 bipolar 
pairs 
Scales 
the factors repre- 
Behavior Rating 
personality 


The rating schedule consi 
Seven-point rating scales with antonym 
= adjectives defining the extremes. 

ere selected to sample (4) 
Sented in the Fels Parent i 

cales (Roff, 1949), (b) Cattell’s 
ha, age of the terms perception, 

ior needs clarification. Ratings 


pts o a 
ts or attitudes of the rater as ll 
havior of the person being rated. With only one 


‘ter, it is not always possible to, tal ec et 
One is dealing with, We have tended to call UF Ne 
is les ratings or perceptions if no other ES me ine 
o êVailable, self-attitudes if the rae Echauior’ 
<timinately related to other variables, and behavi 
there ig concensual agreement with another sore 
n occasion our choice of terms has Hae baa 


t See 
tary so as not to encumber communication. 
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factors (Cattell, 1957), and (c) the factors 
found by Osgood and Becker (unpublished) 
using the semantic differential to assess the 
self-concept. The more unusual adjective 
pairs, such as real—-wnreal, are from this latter 
source. For each of the 73 variables, parents 
were asked to rate the following concepts 
(father variations are in parentheses) : 

. My-self-in-relationship-to-my-child 

. My-self-in-relationship-to-my-husband (wife) 

. My-husband (wife) -in-relationship-to-my-child 

. My-husband (wife) -in relationship-to-me 

. The-person-I-would-like-to-be-in-relationship- 
to-my-child 
The-person-I-would-like-to-be-in-relationship- 
to-my-husband (wife) 


UP U N 


S 


The instructions, besides explaining the gen- 
eral nature of the scales, encouraged the par- 
ents to think carefully about the particular 
relationship before making the ratings. Par- 
ents were also advised to beware of halo-type 
biases (not stated in these terms, of course) 
and that the results would be confidential. 
Finally, they were instructed not to go back 
once a rating had been made. 

The average time for the ratings was 40-60 
minutes. Typically, fathers made their ratings 
at home after a 3-hour interview and testing 
session which the complete project involved. 
Mothers made their ratings during a second 
visit to the clinic. Parents were asked not to 
discuss ratings with one another until both 
had finished. There is no reason to doubt 
that this request was not followed. 


Factor Analysis of the Rating Schedule 


In a previous pilot study, Silverstein (1958) 
had demonstrated the feasibility of using a 
single factor-scoring system to score each re- 
lationship concept for each parent. A single 

is necessary if cross-concept 
e to be made. With the aid of 
an electronic computer, correlations were com- 
puted by summing over Concepts 1-4 (listed 
above) and over persons (both mothers and 
fathers). Each correlation entering the factor 
analysis was based on ratings from 62 moth- 
ers and 60 fathers over four concepts or 488 
elements in all. The ideal relationship con- 
cepts were left out because it was not be- 
lieved they would have enough variance to 
make computation of factor scores worth- 


scoring system 
comparisons ar 


516 Wesley C. Becker 
TABLE 11 
DESCRIPTION OF THE VARIMAX Factors IN PARENTAL RATINGS OF SELF AND EACH OTHER boat 
Loading Description Loading Description 
z Š 45 Not-prone-to-anger........ prone to anger 
Factor 1—Hostile-Withdrawal i Dats] 5 "arbitrary 
i7 unloving 38 Conscientious............. conscienceless 
68 . -aloof 
68 cold Factor 5—Commonness (low IQ?) 
67 Emotionally close.......... detached 66 Formed... Aovialess 
OS) Devoted seein s rejecting 62 Individualistic. vordinary. 
63 esta a seme 54  Curious..... oo 
61 ‘Sociable... ia reer 51 Deep..... shallow 
53 Approving - disapproving 48 Flexible... „inflexible 
2 oe es) K pond 45 Adventurous. „timid 
5i ooperative. . . -hostile 45 ia. clay 
49 Meaningful... . -meaningless 39 cac SOR unreal 
48 Vigorous.. . inert 


46 Outgoing... .. -self-centered 
44 Soft hearted.............. hard hearted 


Factor 2—Dominance-Strictness 


71 Permissive 


banania uo ania’ strict 
64 Mild... 


. severe 

. dominant 

. authoritarian 
. demanding 
-rigid 

- -restrictive 

. Critical 

. tough 

.. irritable 

+». hard 

et eee strong willed 


60 Nondemanding 
57 Pliable......., 

56 Nonrestrictive. . . 
53 Uncritical. , 
50 Sensitive. . . 
50 Easy going. 
49 Soft 


Factor 3—Nervousness 
71 Confident 


eaaa i ed unsure 
67 Clear... - confused 
64 Relaxed. . „tense 

64 Happy. „depressed 
60 Fearless. „fearful 
53 Energetic.. . fatigued 


48 Free from guilt 


- -guilt ridden 
43 Calm 


xcitable 
L N i pessimistic 


Factor 4—Immature, Aggressive Emotionality 
54 Level. ..5...05. 
54 Patient... 
51 Consistent. 
46 Understanding. . . 


-. fluctuating 


-inconsistent 


Factor 6—Solicitousness 


59 Nonchalant............... anxious 

43 Not jealousisisiieas cscs ac jealous — 
40 Underhelping -overhelping 
BO) Inactive. i os naait xs a active 

39 Lax.. vigilant 

37 Objective. „emotional 
37 Nonsuggesting. suggesting 
G2 TEE niasa sauce a suspicious 


Factor 7—Nonprotectiveness 
63 Sheltering................ exposing 9 
53 Protective................ nonprotective 


Factor 8—Harmony 
46 Contentious 
42 Punitive... 
39 Conflicted . 
38 Taking... 
36 Threatening 


MS igh aw griond concordant 
-nonpunitive 
nonconflicted 
ae giving © 

Kes mma aay Ma reassuring 


Factor 9—Social Effectiveness 


48 Disorganized.........,.... organized 


48 Colorless... | „colorful 

47 Unsuccessful... „successful 

37 BOP a sa irena oes interesting 
Factor 10—Playfulness 

55 Serious 


ETEN playful 
„humorous , , 
ATP satisfies curiosity 


while. This belief later proved to be wrong. 
After computing correlations, 15 centroid fac- 
tors were extracted using fixed-unit commu- 
nalities. Ten factors, accounting for 50% of 
the total variance,* were then rotated to or- 


4 When fixed-unit communalities are used, the total 
common variance is markedly overestimated and the 
percentage of common variance is underestimated, 


thogonal simple structure using the varima* 
criterion (Kaiser, 1958). 


The varimax rotated factors have been de 
posited with ADT.» Because each variable 
5 The rotate, 
with the Am 
Document Ni 
tions Project 


ited 
d varimax factors have been eae 
erican Documentation Institute. vee 
0. 6467 from ADI Auxiliary Pub 


» Photoduplication Service, Library 


— e 


a 
+ 
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TABLE 12 
SpLIT-HALF FACTOR SCORE RELIABILITIES 


Factors® 
Concept 1 2 3 4 5 6 7 8 9 10 
Mother 
1. Self-child 82 82 49 54 78 69 66 26» 72 45 
2. Self-spouse 94 79 91 66 74 71 54 17 76 69 
3. Spouse-child 92 88 88 76 83 81 79 48 70 66 
4. Spouse-self 93 80 82 80 81 63 68 13 72 74 
5. Tdeal-child 72 64 73 62 70 29 74 28 68 63 
6. Ideal-spouse 83 72 81 75 77 25 77 43 67 58 
Father 
1. Seli-child 87 8s s5 82 30 52 66 52 59 69 
2. Self-spouse 92 S84 74 78 s4 69 65 7 70 75 
3. Spouse-child 93 82 86 84 St 56 78 25 52 83 
4. Spouse-self 94 87 s1 85 SI e o 2 5 Se 
5. Ideal-child s 7% ól w a Z 4 M A 
6. Ideal-spouse 94 86 73 82 90 15 73 15 62 74 
usness; 4. Immature, Aggressive Emotionality; 


af; š 
5. actor titles are: 1. 7. N 

Commonness (Low 1Q?); olicitousness: 7, Norpro rayses 

talicized variables were eliminated from further analyses. 


loaded at least one factor higher than 29,8 
complete listing of the variables is represented 
M the description of factors given in Table Ht, 
T Preparing Table 11, variable descriptions 
ave been reversed where necessary tO make 
all loadings positive. The original ordering was 
nadom with regard to direction. It should be 
= that Table 11 contains only the vari- 
Dles used in scoring each factor. The label 
Riven a factor indicates the direction of a 
igh factor score. Factor- titles have been 
Riven to facilitate communication. The reader 
oust with the literature will see many 
\milarities between the present factors and 
„Ose reported previously by Cattell (1957) 
hd Roff (1949) in behavior ratings: 


Ç 
°mputation of Factor Scores 
d 
Factor scores were computed by the bape 
Standard score addition of fag ding 
es, weighted according to factor a - 
con mputing factor scores no variable ro- 
Pe is for more than one factor. io on 
Peace avoided the building ™ a agers 
—Telations among the factors due 


ab] 
I 


Co, wie ii m 
NEress: . g emitting 1M 
v; ss; Washington 25, D. C» " jes. 
i $1.25 for microfilm or $1.25 for een 
erpi necks payable to: Chief, PhotoduP 

ice, Library of Congress. 


val; inance-Strictness; 3. Nervo 4I a 
1; 2, Dominance Str ss; 8. Harmony; 9. Social Effectiveness; 10. Playfulness. 


lated errors. As noted earlier, even though an 
orthogonal solution was used and spurious 
correlations were avoided, it is to be expected 
that the factor scores derived by the method 
outlined above will tend to be correlated to 
the extent that the factors actually required 


an oblique solution. 


Factor Score Reliability 


To find out whether the factors derived by 
summing over parents and over concepts are 
applicable for both parents and for the vari- 
ous concepts, two split-half estimates were 
made of each factor score, and factor con- 
sistency coefficients were computed. Table 12 
presents a summary of these correlations as 
corrected by the Spearman-Brown prophecy 
formula to give estimates of the reliability of 
the full factor scores. As in the pilot study by 
Silverstein (1958), the reliabilities were gen- 
found to be quite high. On the basis of 


erally 
the results given in Table 12, Factor 8 (Har- 
mony) was eliminated completely, and Fac- 


tor 6 (Solicitousness ) was eliminated for the 
ideal-relationship concepts. Factors 3 (Nerv- 
ousness) and 10 (Playfulness) had reliabili- 
ties below .50 for the self-in-relationship-to- 
child ratings by mothers. To maintain sym- 
metry between the mother and father matrices 
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(and thus facilitate computations), these lat- 
ter two variables were not eliminated. 


Relationships among the Parent Factor Scores 


The correlations among the parent scores 
were computed in two 52 variable matrices 
covering factors for the following relationship 
concepts. 


MATRIX A 


MATRIX B 
Father 1. Self-child Mother 1. Seli-child 
Mother 3. Husband-child Father 3. Wife-child 
Father 2. Self-wife Mother 2. Self-husband 
Mother 4, Husband-me Father 4. Wife-me 
Father 5. Ideal-child Mother 5. Ideal-child 
Father 6. Ideal-wife Mother 6. Ideal-husband 


Since the patterns of interdependencies tended 
to repeat, the structure of the 2,652 correla- 
tions in Matrices A and B can be summa- 
tized by two small tables. 

Table 13 presents the median correlations 


between relationship concepts taken over the 
eight or nine facto: 


Two sets of comparis 


terest in Table 13, F 
tween self- 


44. Secondly, 
ratings of self 


role situations with spouse) 


mostly in the low -80s, considera 
mains for differential correlations of child 


variables with patent-in-relationship-to-child 
versus Parent-in-relationship-to 


ble room re- 
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TABLE 13 


z IP 
MEDIAN CORRELATIONS BETWEEN RELATIONSH 
CONCEPTS OVER FACTORS 


Matrix A 
OoOO 
r5 Fó 
Relationship Concepts M3 F2 M4 FS Fi 
e 5 21 
Father 1. Self-child 35 60 25 be 08 
Mother 3. Husband-child oe 10 18 
Father 2. Self-wife 42 p Ge 
Mother 4. Husband-me = 40 
Father 5. Ideal-child 
Father 6. Ideal-wife 
Matrix B 
aa eae 
Mó 
Relationship Concepts F3 M2 F4 M5 } 
ie 06 
Mother 1. Seli-child 36. 5433 3 08 
Father 3. Wife-child sao en ae 
Mother 2. Seli-husband 44 00 03 
Father 4. Wife-me 27 
Mother 5. Ideal-child 


Mother 6. Ideal-husband 


ions 

Table 14 presents the median correla 
among the parent factor scores acron Aan 
relationship concepts (i.e., both Matrix factor 

). As was apparent in the original note 
analysis, there is a second-order factor pee 
the parent factors defined best by the ete 
tions between the factors labeled ae 
Withdrawal; Nervousness; ange ie 
gressive Emotionality; Commonness; R ctor 
cial Ineffectiveness, The second-order 14 aif 
might be interpreted as a halo, effect, ef- 
evaluative factor, or a social desirabilty an 
fect. Whether the factor is looked on aS 


. of 
ables -Spouse vari- undesirable bias or an important pal 
i personality depends on one’s purposes. 
TABLE 14 
MEDIAN CORRELATIONS BETWEEN FACTORS OVER RELATIONSHIP CONCEPTS 
Factor Title 2 3 4 š 6 7 9 10 
5 
1. Hostile-Withdrawal —59 
2. Dominance-Strictness = s s a = = >32 
3. Nervousness” a 16 —04 R L67 —40 
4. Immature Emotionality os >g =o E 60 = 
5. Commonness 48 meet 13 = 5 —52 
6. Solicitousness ae 16 E 26 
7. Nonprotectiveness =e a4 -2 
9. Social Effectiveness a 
10. Playfulness 


g 


| 


y 
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the point of view of the writer, the factor will 
be considered desirable if it correlates with 
independent evaluations of child behavior and 
a nuisance if it does not. 


Additional Derived Scores 


Two additional kinds of scores were derived 
ftom the parent factor scores. First D? scores 
were computed between parallel sets of pa- 
rental rating factors: eg, the difference 
(Squared) between father’s ratings of self-in- 
relationship-to-child and mother’s ratings of 

usband-in-relationship-to-child. Interpersonal 

Cory would lead to the prediction that the 
a the discrepancies in interparental per- 
e lons, the more likely the presence of prob- 

™ behavior in children. 

Eee were also obtained for the second- 
ea evaluative factors among the parental 
ings. Evaluative factors were computed for 
and barent-in-relationship-to-child concepts 
asi Or the ideal-relationship concepts on the 
o of a factor analysis of the parent factor 
Bis a To the extent these second-order fac- 
and o esent more than a specific rating bias 
will there is evidence to this effect), they 
Shi Permit an examination of the relation- 
tent, of generalized positive or negative pa- 
al attitudes to child behavior. 


P RELATIONSHIP BETWEEN THE PARENT 
AND CHILD FACTORS 
VIR guide in examining the results, ral 
oug pa Ses were formulated from our pre 
eck, esearch (Becker, et al., 1959; nee 
in Ct: Hellmer, Shoemaker, & Quay; 19 ) 
Cus “dition to those which have been is 
Pa in previous sections of this report- 
vant teses will be presented as data rele- 
© their tenability is examined. 
© data to be evaluated consist © 


several 


f 2,496 


Co 
ang tions between the 24 child se 
the Parent measures, the andto a 
Daren, CCOnd-order child measures wi as 


hijg Measures, and the correlations 3 T 
Measures with the D? scores an ; 
there “order evaluative factors. Fortunat 
eg Several ways of dealing wit i 
zati ity of data which will minimize cap? “a 
tion 2N Chance relationships. The corcel 
tween the 24 child measures and 
arent measures will be discusse first. 
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Significance of Correlations by Blocks of 
Variables 


In Table 15, the percentage of correlations 
significant at the .05 level has been compiled 
for the various submatrices interrelating the 
104 parent and 24 child variables. Table 15 
permits one to appraise the probable signifi- 
cance of relationships between parent and 
child variables. However, interdependencies 
among the variables and correlated sampling 
errors prohibit precise probability statements. 
Table 15 serves as a guide for further elimi- 
nation of blocks of variables, permits a test 
of some hypotheses, and reveals other facts 
of interest. 

A large and consistent rating bias is ap- 
parent in the father correlations such that if 
he rates himself favorably, he also rates his 
wife and child favorably on most factors. 
This leads to a high percentage of significant 
correlations among all father ratings. The per- 
centages of significant correlations between 
father ratings of child and father ratings of 
self and spouse are 84%, 47%, 73%, and 
56%. When mother ratings of child behavior 
are compared with the same father ratings of 
self and spouse, the percentages are 53%, 
16%, 27%, and 33%. Thus, there is an av- 
erage drop of 33% in significant correlations 
when rater contamination is avoided. A simi- 
lar, but less marked, effect can also be demon- 
strated in the mother ratings. Obviously, it is 
necessary to control for this effect in analyz- 
ing the relationships between parent and child 
behavior. 

Support for two of the hypotheses guiding 
this research is gained from Table 15. First 
it was hypothesized that self-ratings more di- 
rectly associated with the parental role would 
show more correlations with child behavior. 
While a direct comparison with the previous 
findings using the Guilford inventories is not 
possible, there are some comparisons within 
the present study which support the hypothe- 
sis. Using only correlations uncontaminated 
py rater bias, the percentage of significant 
correlations between child behavior and par- 
ent ratings of self-in-relationship-to-child (F1, 
M1) was found to be 24% for fathers and 
9% for mothers. The corresponding percent- 
ages for self-in-relationship-to-spouse were 
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TABLE 15 


5 NT AN! LD 
THE PERCENTAGE OF SIGNIFICANT CORRELATIONS (.05 level) BETWEEN PARENT AND CHI 
VARIABLES ANALYZED BY CONCEPT AND BY CHILD RATER 


Child Variables by Rater 


Parent Relationship 


Teacher Teacher Mother Father Parents 


b 
Concept BO)" PY) BG) BG) Pd) Romea 
Father 1. Self-child (9) 07 17 53 84 53 = 35 
Mother 3. Husband-child (9) 15 14 51 3i 30 2 

Father 2. Self-wife (9) 02 00 16 47 19 17 14 
Mother 4. Husband-me (9) 04 06 33 di 08 12 

Mother 1. Seli-child (9) 06 06 42 16 28 19 24 
Father 3. Wife-child (9) 00 00 27 73 55 30 

Mother 2, Self-husband (9) 06 00 36 00 14 11 18 
Father 4. Wife-me (9) 00 03 33 56 30 24 

Father 5. Ideal-child (8) 06 06 20 13 25 14 11 
Mother 5. Ideal-child (8) 04 03 25 08 06 09 

Father 6. Ideal-wife (8) 00 00 10 10 00 04 6 
Mother 6. Ideal-husband (8) 00 00 25 03 13 08 


Column Means 04 


» Le., mother’s ratings of 
-in-rela- 


tionship-to-child relation- 


ships. 

The second hypothesis to gain support from 
Table 15 concerns the relative sj 
father’s attitudes and beh 


between father and 
equal or exceed that for mothers. Table 15 
reveals that 35% of the correlations þe- 


tween father-in-relationship-to-child variables 
and child behavior w. 


24% of the correspo 
were significant. W 


nated correlations are considered, the figures 


are 22% 


ot 
(60/270) for fathers and pa 
(24/270) for mothers. Again, these latter 
ures overweigh the low teacher correlations. a 
A further reduction in the number of poe 
ations to be examined in detail can be 4! 


rel 


complished by a few summary statements; 
The reader will find the structural ayon wW- 
Table 15 helpful in understanding the follo 
Ing points: 
l- When the F2, F4, F6 and the M2, vi 3 
M6 variables did correlate significantly, Ple 
child behavior, typically the same varia : 
Were involved as were significant for we es- 
£3, FS and the M1, M3, and MS variab 
ince the former correlations were eee 
fewer and lower, and since they added no 1” as 
information to the prediction of child beha 
‘or, they will not be discussed further. ib 
. By eliminating from the remaining SY 
Matrices, those blocks of variables where less 
Percentage of significant correlations is ter 
than 10%, as well as those subject to on 
bias Contamination, the number of corre! 
tions to be examined is reduced to 346. d- 
3. If the Correlations of the four seco” 


Fl | ! 
I 
lly 
e 
ki 
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order child factors are included for those par- 
ent variables surviving the above eliminations, 
the total number of correlations to be given 
close attention is 459, Of these 459 correla- 
tions, 122 or 26.6% reached the .05 level of 
Significance. These correlations are presented 
in Table 16. 


Behavior of Children 521 
Evaluation of Relationships between Parent 
and Child Factors 


Space does not permit a complete descrip- 
tive elaboration of the implications of the 
correlations in Table 16. As noted in Table 10, 
Child Factors 2, 3, and 5 load primarily on 
a second-order Conduct Problem factor and 


TABLE 16 


SIGNIFICANT CORRELATIONS BETWEEN 


PARENT AND CHILD FACTOR Scores 


Child Scores* 


Rater Mother Parents Teachers 
Father 1. Self-Child 1 2 ae ee q 8 7 g 
p re 3 ae = 5. 2 - - 
1. Hostile-Withdrawal 46 —31 34 47 53 8 
2 i i 29 —30 -26 - 24 34 - - 
+ Dominance-Strictness 2 
3. N 36 - —30 - 31 H 24 30 - 
» Nervousness D 35 — 36 43 = i x 
4. Aggressive Emotionality 23 > =% ž 
5 29 =] = - - 30 31 - - 
+ Commonness (low IQ?) 2 - a -a 23 g - 
Fj Solicitousness =30 = 7 ne i = 2 - 
- Nonprotectiveness = m = = ” 
. Š a g - —35 —43 -31 - - 
9. Social Effectiveness —38 2 3s ne 31 —42 —37 2 = 
10, Playfulness —48 2 
Father Parents Teachers? 
Rater 
P 5 7 8 7 8 
Mother 3. Husband-Child 1 2 3 2 
5 = - 27 35 39 23 - - 
1. Hostile-Withdrawal a - _ ai 33 41 ma 32 25 
2. Dominance-Strictness F —39 a 40 44 = 38 = 
3. Nervousness , 36 j jy a 26 w = = 5 
4, Aggressive Emotionality = a = 35 wi 24 25 - - 
5. Commonness (low IQ?) > i = P pe = = a 
$ Solicitousness 7 i = = = = - =- - 
-~ Nonprotectiveness 36 B 26 =33 —27 =38 —27 = S 
i Social Effectiveness = _ h sa = = = i = 
9. Playfulness = 
Father Parents Teachers 
Rater 
— 1 2 3 4 5 7 8 7 8 
Mother 1, Self-Child 
m —24 - 25 42 = = - 
1. Hostile-Withdrawal W O Se CO a ë- m = 
2. ominance-Strictness E i —39 - 23 54 - = = 
3. Nervousness ; - E: ki = = 44 = = m 
$ Aggressive Emotionality = z = i a 28 ea = _ 
P ‘ommonness (loiv IQ?) E = be - - # ae = = 
A Solicitousness - zi = E po = i - 
9 Vonprotectiveness n an 30 a —28 —53 = = = 
10 Social Effectiveness = S = = - = = a - 
- Playfulness 
5 level. Correlations of .31 are significant at the .01 level. ada 
7 j ignificant at the .05 level val; 2. Relaxed Disposition; 3. Lack of Aggression; 4. Submission; 
5 w aao ene oF Red as Follows: 1, Host itet Broblem, second-order factor; 8. Personality Problem, second-order 
fotor ott Problems:6. Sehnolroam + evaluations of the children and mothers’ evaluation of father have been omitted to 
faci’ The 54 correlations between teachers’ ck8 hc rrelations in this submatrix were: (a) father Hostile Withdrawal correlated 
Zsilitate tabular presentation. The eight signife m fress correlated -31 with Child Factor 1, m27 with Child Factor 2 726 with 
Cathie Child Factor 1; (b) father Doors; (9 father Nervousness correlated —.29 with Child Factor 2, —.34 with Child Factor 3, 
1 Factor 3, and .23 with Child Factor >; 
And .34 with Child Factor 5. 
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TABLE 16—Continued 


= Mother Parents Teachers 
ater 4 
8 
2 5 8 7 
Father 3. Wife-Child 1 2 3 4 5 f 
5 5 36 ae ” 
1. Hostile-Withdrawal 50 - —23 - 25 a6 4 3 É. 
2. Dominance-Strictness - = - - = ef z z 2 
3. Nervousness 34 = = 5 i A 25 a = 
4. Aggressive Emotionality 37 = - - = ae 34 3 ss 
5. Commonness (low IQ?) 36 - - - - ka E 3 a 
6. Solicitousness 35 - = ~ p a È: B vi 
7. Nonprotectiveness - - - - ae FA 7 7 a 
9. Social Effectiveness —48 - - = =82 = i i i 
10. Playfulness —48 - - —25 - —3 : 
Rater Mother Parents Teachers 
a 2 7 8 
Father 5. Ideal-Child 1 2 3 4 5 7 8 
1, Hostile-Withdrawal - - - - - - = a0 sai 
2. Dominance-Strictness - —23 - —31 - - —26 Ki 5d 
3. Nervousness 29 - —28 - - 27 = a K 
4. Aggressive Emotionality - - —28 - - - - = z 
5. Commonness (low IQ?) - - —24 - - - - - 7 
7. Nonprotectiveness - - - - = = = È Re 
9. Social Effectiveness —28 - 25 - - = = = 
10. Playfulness - = - - - - - = i 


Primarily on a Person- 
while Child Factor 1 


On the basis of 
al., 1959; Peter- 
» it was hypothesized that 
blems are related to general 
pen venting of negative emo- 
tions, and arbitrary and inconsistent disci- 
pline by both parents. The father-child cor- 
relations in Table 16 indicate that the ag- 
gressive, uncontrolled, hard to discipline child 
(as seen by parents) has a father who shows 
more Hostile-Withdrawal (cold, aloof, 


€ Although these second-order factors lead to some 
confounding of rater bias effects when correlated 
with parent behavior factors, it is possible to esti- 
mate the degree of such influence by examining the 
uncontaminated correlations of the first-order child 
factors with the parent behavior factors, 


re- 


jecting); Dominance-Strictness (severe, A 
thoritarian); Nervousness (unsure, aoe: 
tense) ; Immature, Aggressive Emotiona pe 
(fluctuating, impatient, inconsistent, tele: 
anger); Social Ineffectiveness (disorgami2 re 
unsuccessful); and Seriousness. The pii 
is much the same whether mother evalue 
father or father evaluates himself. Only ‘ Cc 
Nervousness and Dominance-Strictness a 
tors correlate significantly with teac 
evaluations of Conduct Problem. The mot e 
correlations with child Conduct Problem n 
very similar to those for father, suggesting > 
picture of rejection, withdrawal, nervousness, 
inconsistency, and irritable aggressiveness. 7 

primary difference between the mother p 
father correlations with child Conduct Pro r 
lem is that the Dominance-Strictness ae 
appears to play less of a role for mothers- 


all the pattern of correlations strongly SUP” 
perts the hypothesis, 


Following the general behavior theory d€- 


veloped by the Iowa-Vale group (Dollard 
Miller, 1950; Sears, et al., 1957; Wat 
Child, 1953), Bandura and Walters (195 
have hypothesized that frustration aris!?e 
from a lack of affectional nurturance al 


re rae 
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punitive attitude on the part of at least one 
parent are essential conditions for the de- 
velopment of aggressive behavior. In addition, 
they consider inconsistency in discplinary 
practices and the kind of model the parents 
present as critical factors. If the father and 
mother markedly differ in demands and ex- 
pectations, or if they are inconsistent in these 
areas, the child is left without clearly defined 
standards of behavior. Bandura and Walters 
(1959) find strong support for their position 
when extreme groups of aggressive and non- 
aggressive adolescents are studied using pri- 
marily the structured interview methods de- 
veloped by Sears et al. (1957). It is appar- 
ent that the present findings based on self- 
| ratings and cross-parental ratings lead to a 

similar conclusion. Probably most impressive 
is the fact that the behavior theory hypotheses 
appear to stand up even when the behavior of 
most of the children falls within normal limits. 
y Child Personality Problem. Our empirical 

hypothesis concerning child Personality Prob- 

lem predicted associations to measures of pa- 

ternal maladjustment and autocracy and a 
+ relative independence of maternal behavior. 
The results for fathers tend to support only 
the maladjustment component of the hy- 
pothesis. Both mother and father agree that 
fathers of Personality Problem children show 
more Hostile-Withdrawal (aloof, cold, reject- 
ing), more Commonness (formless, uninquir- 
ing, inflexible, timid), and more Social Inef- 
fectiveness (disorganized, colorless, unsuccess- 
ful). Fathers add to picture associations with 
Nervousness, Nonsolicitiousness, and Serious- 
ness, Actually the father picture is one of 
general social withdrawal not unlike the Per- 
sonality Problem pattern found in the child. 
No associations are found between mothers 

ratings of self and child Personality Prob- 

lem. However, when father rates mother, sev- 

eral significant relationships to child Person- 

ality Problem are found. In fact, when father 

considers mother most things bad the child is 

rated more in the Personality Problem direc- 
, tion. ; 
There are at least two possible explanations 
of the differences between mother and father 
ratings of mother in relationship to child Per- 
<onality Problem. First, the results might re- 
flect father attitude rather than mother be- 


23 


un 


havior. However, correlations between the 
second-order evaluative factors (discussed 
later) indicate that mother tends to agree 
(r = 46) with father’s evaluation of her. The 
second possibility is that a covertly rejecting 
mother defends from awareness her negative 
behavior which the father clearly sees. This 
interpretation fits with findings by Hewitt 
and Jenkins (1946) that overinhibited chil- 
dren came from homes where father’s disci- 
pline was inconsistent; where father was hy- 
percritical and unsociable; and where mother 
was unsociable, dominating, and covertly re- 
jecting. 

From animal studies of conflict and punish- 
ment (see Diamond, 1957) and from the 
work of the Iowa-Yale group, personality 
problems (dependence, anxiety, and with- 
drawal) are to be expected theoretically where 
sufficient warmth and acceptance is provided 
to promote a strong dependency need, and 
where punishment for misdeed is strong and 
inconsistent, and where withholding of love is 
used as a means of discipline. Unfortunately, 
most of this theory has not been developed 
in a context of research where both parents 
were considered, so the differential relevance 
of mother and father is unclear from theory. 
The work of Hewitt and Jenkins (1946), the 
previous study by Becker et al. (1959), and, 
the present results all suggest that maladjust- 
ment of the father as manifested by hostility 
and social withdrawal is important in the de- 
velopment of shy and anxious children.’ 

Father Ideal Relationship to Child. The 
ideal correlations in Table 16 were rather 
surprising, since one would expect ratings of 
an ideal concept to cluster around a social 
norm with little variance. That the ideal con- 
cept factors did have appreciable variance 
was forewarned by the reliability findings. 
The correlations given in Table 16 indicate 
that the variance in ideal ratings is related to 
child behavior in important ways. There are 
two interesting things to note about these cor- 


7 Initial results from our interview rating data sug- 
gest further confirmation of this. Ratings of a Hostile 
relationship of father to child correlate significantly 
with both Conduct Problems (.36) and Personality 
Problems (.31) when Peterson’s problem checklist is 
used to obtain the latter measures. The correspond- 
ing correlations with mother’s hostility were 37 and 
04. 
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ions. First, because the F5 factors corre- 
eo lowly with the F1 factors ( median 
.21) and the M3 factors (median .05), new 
information relating to the prediction of child 
behavior is contained in these relationships. 
For example, with a simple equal-weight ad- 
dition of F1 and F5 scores, the correlations 
with child behavior are increased for 10 of the 
24 significant variables in the submatrix based 
on father’s ratings of self (F1) and mother’s 
ratings of child. Secondly, since the direction 
of the correlations is the same as that found 
for _self-in-relationship-to-child correlations, 
these relationships would not have been 
found if the ideal ratings had been used in the 
typical way to compute discrepancy scores. 
The nature of the correlations between fa- 
ther’s ideal-in-relationship-to-child and child 
behavior are all in a direction to indicate that 
if father’s conception of his ideal relationship 
is more loving, democratic, and emotionally 
mature, the child is rated as being better ad- 
justed, outgoing, less demanding, etc. 


Relationship of Interparental D? Scores to 
Child Second-Order Factors 


The factor score d 
mother’s and father’s p 
relationship-to-child sh 


iscrepancies between 
erception of father-in- 
owed significant rela- 


CORRELATIONS BETWEEN PARENTAL SECOND- 


ORDER EVALUATIVE F 
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tionships to the second-order child pe 
Problem factors as evaluated by parents (7 


= .31) and by teachers (r= .27). The simi- { 


lar correlations for father-in-relationship-to- 
wife were .31 and .30. The relationships to 
Personality Problems for the above discrep- 
ancies were insignificant. When the discrep 
ancies concerned the perception of niher 
relationship to child or to husband, the or a 
significant correlation was between the mot i 
in-relationship-to-child discrepancy and pi 
rental ratings of Personality Problem a 
.23). This latter correlation should not i 
taken too seriously without cross-validatio 
since only one of the four possible elation 
ships to child Personality Problem was rd 
nificant. On the other hand the relationships 
to child Conduct Problem were significant n 
all four comparisons made. If lee pone a 
consistency in discipline and lack of eo 
can be assumed to be related to small ie 
crepancies in perception, then these findir r 
fit well with the behavior theory account 
the etiology of aggressive behavior. 


: rs t0 
Relationship of Parent Evaluative Factors 
Child Scores 


n sec- 
The correlations between the parental. is 
ond-order evaluative factors and some 0 


TABLE 17 


ACTORS AND CHILD SCORES 


Child Scores 


Rater Mother Parents 
Father Evaluative Factors 
F1. Self-Child : - : : : : = 
F1. Self-Child —48 29 =í os =n 
F3. Wife-Child -48 i 3 ® -u 2 a 
F5. Ideal-Child -o = @ SS = -a oO 
Fó. Ideal-Wife = i ig a oe =11 OF 
Rater Father Parents 
Mother Evaluative Fact a 
other Evalua’ 2 actors it = * 3 4 5 7 f am 
M1. Self-Child ~% o 3z = 3. 13 
M3. Husband-Child -33 -0 3 IB —26 = B 
MS. Ideal-Child =08 =i3 o 726 -34 m a 
M6. Ideal-Husband —21 10 14 i Aa y -2i 


* See Table 16 for des 


0 cription of child variables. 
t High score indicates 


a positive evaluation, 
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ga scores are presented in Table 17. Be- 
Sates pc bias contamination slightly in- 
; factor He correlations between the evaluative 
aera the child second-order factors, 
child (eee with uncontaminated first-order 
mit ge have also been included to per- 
the reader to judge the effect of this in- 
et It is apparent that the general good- 

* relati father evaluation of self- or wife-in- 
both Bene to-child is related to a lack of 
the c} acer and Conduct Problems in 
related d. Similar evaluations by mother are 
The r ae to a lack of Conduct Problems. 
the Fi ationships of the evaluative factors in 
confirn al-relationship ratings to child behavior 
c ila findings discussed earlier, except that 
signifi ersonality Problem is now found to be 
Ore cantly associated with mother having a 
0 be Negative ideal of what she would like 
f m relationship to her husband. In view 
y mothe, oe of significant correlations of this 
‘Or “ideal factor with uncontaminated child 

°S, this finding should be viewed with 


Caution, 


$ Out x reader is left to his own devices to pull 
} tela ti Of the fuller implications of the cor- 
G ns in Table 17. One implication, appat- 
bartin en the appropriate evaluative factor a 
is t ed out of the correlations in Table 16, 
lar Pi the second-order factors account for a 
havig Part of the covariance with child be- 
the ot This finding tends to emphasize again 
| ta 'Mportance of the general parental atti- 
Corren, Warmth or acceptance. One final set of 
lati ations should be of interest. The corre- 
in. el tween mother’s evaluation of father- 
tion Ationship-to-child and fathers evalua- 
lati Ot himself was .40. The parallel corre- 

Tee mother-in-relationship-to-child was 
ale ese correlations indicate that the gen- 
uq Valuative factors are more than 


just atti- 
the Gi Ut also reflect behavior observable by 
er parent. 


let 
kodologicag Problems , 
bo © Present findings give nO information 
hiq Causality, It is just as possible that 
tio; Š i avior is generating parental rear 
that 3S the contrary, or it is also possible 
nt: ar geneotypes are the basis for rA 
dolg Ound covariations. The present metl f 
BY is defensible only as a necessary firs’ 


) 
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step toward finding truly causal relationships. 

The rather large difference in number of 
significant associations to mother and father 
behavior was somewhat surprising. Such a 
striking difference does not seem reasonable 
and the question of artifact must be consid- 
ered. If one assumes that father’s ratings of 
mother and child are less valid than father’s 
ratings of self or mother’s ratings of self, fa- 
ther, and child, then the obtained results are 
explicable. This hypothesis is plausible since 
most fathers do not spend much time at home 
when mother and child are interacting. Fur- 
ther support for this hypothesis is found in 
the fact that father’s ratings seemed to suffer 
more from a general bias effect than did 
mother’s ratings. Such an effect is most ex- 
pected when ratings are made with insuffi- 
cient information available. Perhaps other 
data now under analysis will shed light on 
this problem. 

SUMMARY 


Three problems were investigated: (a) the 

interrelationships of factors in mothers’, fa- 
and teachers’ ratings of the same chil- 
dren; (b) the nature of and interrelation- 
ships of factors in parental ratings of self 
and each other; and (c) the relationship be- 
tween the factors obtained in a and b above. 
The primary sample consisted of 71 kinder- 
rten children and their parents. 
Problem a was investigated by having moth- 
ers, fathers, and two teachers rate the chil- 
dren on a 72-variable bipolar adjective rating 
schedule. Two other kinds of rating instru- 
ments were also used in evaluating child be- 
havior. Problem b was investigated by hav- 
ing parents rate the following concepts on a 
73-variable bipolar rating schedule: myself- 
in-relationship-to-my-child, myself-in-relation- 
ship-to-my-spouse, my-spouse-in-relationship- 
to-my-child, my-spouse-in-relationship-to-me, 
the person-I-would-like-to-be-in-relationship- 
to-my-child, and the person-I-would-like-to- 
pe-in-relationship-to-my-spouse. 

Highlights of the main findings are as fol- 
100 While it was possible to find reasonable 
ngs between mothers’, fathers’, and 
of the children in terms of 
patterns, the parent and 


thers’, 


ga 


matchi j 
teachers’ ratings 
factor loading 
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teacher scores for children on the same fac- 
tors showed little correlation. For five fac- 
tors, the average correlation was .76 between 
teachers, .52 between parents, and 34 be- 
tween parents and teachers. These findings 
indicated a need to keep parent and teacher 
rating measures separated in developing de- 
scriptions or testing explanations of child be- 
havior. 

2. When parents were asked to rate their 
behavior with the child under study, rather 
than with spouse, many more significant as- 
sociations with child behavior were found. 
For example, using only correlations uncon- 
taminated by rater bias, 24% of the correla- 
tions between ratings of father-in-relationship- 
to-child and child measures were significant, 
while the similar figure for ratings of father- 
in-relationship-to-wife was 6% significant cor- 
relations. These results emphasized the im- 
portance of focusing directly on parental- 
role behavior rather than general personality 
in seeking relationships between parent and 
child behavior. 


3. The relationships of parent rating f 


ac- 
tors to child behavior Wi 


ere significant, by and 


of the children. Also, 
mind that even though 
based mostly on cross- 
avoid inflation due to rat 
assume that all judgme 
eliminated in the evaluati 

4. On the basis of pre 
pected that the numbe: 
tween father and child behavior would equal 
or exceed that for mothers, This hypothesis 
was confirmed. Again using only uncontami- 
nated correlations 22% of the father-in-rela- 
tionship-to-child correlations Were significant 
while only 9% of the Corresponding mother 
correlations were signi 


€ ficant. A posible arti- 
factual explanation for the marked difference 
was considered. 


5. Father’s ratings of his ideal re 
to his child produced a number of 
correlations with child behavior. 


arental ratings to 
er bias, one cannot 
ntal bias has been 
on of child behavior, 
vious work it was ex- 
r of associations be- 


lationship 
significant 
If father’s 


Wesley C. Becker 


conception of his ideal relationship is more 
loving, democratic, and emotionally mamie 
the child is rated by mother as better ad- 
justed, outgoing, less demanding, etc. inia 
estingly, the father ideal ratings added nev 
variance in the prediction of child behavior 
from parent behavior. 

A The hypothesis that child Conduct Prob- 
lems are related to general maladjustment, _ 
Opeh venting of negative emotions, and ee 
trary and inconsistent discipline on the pa 
of both parents was given considerable sup 
ort. ; 
7 7. The hypothesis that child Personality 
Problems are related to paternal meadi 
ment and autocracy and are independent s 
maternal behavior was only partly sippore 
Maladjustment of the father, oe 
by hostility and general social withdrawa y 
was found to be associated with child OTe 
ality Problems. Associations with mother s : 
havior depended upon who was rating we 

8. Discrepancies between factor scores ( 2 
for mother and father ratings of father a 
found to be significantly related to child Co 
duct Problems. ME- 

9. Analysis of second-order evaluative i 
tors among the parent factor scores reye a 
a number of significant relationships to: Ga) a 
problem behavior, There is a strong maica 
tion that the general positveness or a d 
ness of parental attitudes is critical in chi 
adjustment. i 

10. As in the previous study (Becker, et ha 
1959), the findings point emphatically to it 
importance of the need for more systema 


study of the role of the father in child de- 
velopment. 
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In contrast to research on psychotherapy 
which studies outcome, or the substantive 
content of interviews and the patient’s re- 
ported life experiences outside of the psy- 
chotherapy session, a number of recent ap- 
Proaches have focused upon the immediate 
interactional behavior of therapist and pa- 
tient within the interview itself, The descrip- 
tive dimensions within which the interactional 
behaviors have been classified are multiple 
and varied, ranging, for example, from Sny- 
der’s (1945) early study of the effects of as- 
sumption of responsibility by the therapist 
on the patient’s release of feelings, expressions 
of Problems, insight, etc, to quantitative 


during the session, Recent studi 
ple, have demonstrated that 
“approved” by the therapi 


School during the 
We wish to ack 


dene Allen and Thomas Cunning! 
of the data. 


those “dis/’'y!~:ed” decrease throughout ie 
course of therapy (Murray, 1956), and tha 
use of subtle verbal reinforcers by the inter- 
viewer can significantly increase the Ie 
quency of affect responses in a er 
schizophrenics (Salzinger & Pisoni, 19 =A 
Numerous authors (see the summary by 


j badia Di 
Lacey, 1959) have found changes in patient's 


autonomic responsiveness to be related a 
therapist’s behavior. In a series of stu ee 
utilizing a standardized interview en 
first described by Chapple (1953), Saslow, 
Goodrich, and Stein A 
razzo, and Guze (1955), Saslow and su 
razzo (1959), and Matarazzo, Saslow, ner 
razzo, and Phillips (1958) have howe 
demonstrated an interviewer effect on 6 
temporal aspects of the patient’s A beeen 4 
when the interviewer's own verbal behavior s 
varied along the same time-unit parametre 
Goldman-Kisler (1952) also reports this ‘© 
fect in a somewhat less restricted AE 
Similarly, Kanfer (1959) found variations 2 
verbal rate when interview topics and i et 
menter’s (Z's) participation in the interacti 
were controlled, -he 
en interviews are conducted with wa 
Peutic intent, interpretations have long pe te 
considered the most crucial to successful a 
come. In several psychoanalytically orien in 
theories, it js held that the major changes ,. 
à Patient’s behavior result from interpreta” 
tions which make conscious those unconscior® 
Motivational factors which determine the Ve 
tient’s actions, Rogers (1942), on the ae 
hand, states that insight and consequent ae? 
havioral change is “often delayed, and our 
times made impossible, by efforts of the co 
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(1956), Saslow, Mata- * 
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selor to create it or bring it about” (p. 195). 


While dispute thus exists as to the direction 


or desirability of the effects of interpretations, 
there certainly is none as to the presence of 
significant effects, or the need to assess care- 
fully their locus and quality. Consistent with 
interest in immediate interactional conse- 
quences of different classes of therapist ac- 
tions, interpretations may be viewed as hav- 
ing an impact on the further course of the 
Psychotherapeutic process not only as a func- 
tion of their substantive qualities or veracity, 
but also through changing the patient’s cur- 
rent interview interactional patte“ (cf. Auld 
& White, 1959; Speisman, 149). 

Especially in its early stages the thera- 
Peutic task is characteristically defined as 
maintaining a flow of communication be- 
tween patient and therapist. Therefore, the 
temporal dimension (duration) of the pa- 


> tient’s verbal output at any moment has re- 


Peatedly been considered as an important 
Measure of his response to various therapist 
Operations. Since a direct test of the effect of 
experimentally controlled interpretations in 
Psychotherapy presents many methodological 
and practical problems, one possible approach 
'S to study the influence of interpretations 
during standardized personal interviews, mod- 


eled after the clinical interaction in which 


hey are usually given. While several alterna- 
i ie tations for the 


tive consequences of interpre 
duration of the interviewee’s subsequent ut- 
terances can be considered and further refine- 
Ment of the definition of interpretations as a 
Class is also feasible, the basic hypothesis of 
the present study is simply that interpreta- 
tions, as a global category of interviewer s be- 
taviors, differ from exploratory or informa- 
tion seeking statements in their immediate 
fect on the interviewee’s verbal output, in 
time units, regardless of their differing con- 
tent or their role in the interview strategy. 
This implies that all interpretations share to 
Some extent a broad similarity with respect 
to the interviewee’s perception of their signifi- 
cance in the immediate interpersonal relation- 
ship. Hence responses tO them as a distinct 
class of interviewer actions would also be 
Similar, . 
The direction of the change in the inter- 
Viewee’s duration of utterances may also be a 


n 
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function of several other factors which deter- 
mine the interviewees perception of the in- 
terpretation, such as the length of familiarity 
with the interviewer, the particular phase of 
psychotherapy, the degree of elaboration with 
which it is given, the various characteristics 
of the patient, etc. By using a single initial 
interview in this study some of these vari- 
ables were brought under strict experimental 
control, albeit at the expense of sacrificing 
some of the richness inherent in the usual 
therapy sessions. 

As has been mentioned, extensive work of 
several groups of investigators on normals 
and patients with measures of verbal behavior 
in time units has already clearly demonstrated 
the existence of functional relationships be- 
tween such variables as duration of speech 
and duration of silence on the part of inter- 
viewer and interviewee. Indeed, it was the 
recognition of such relationships which led 
Chapple (1953) to develop a partially stand- 
ardized interview in order to eliminate con- 
tamination and unreliability in the use of the 
interview as a research tool. This technique 
utilizes temporal variables, recorded by the 
Interaction Chronograph (Chapple, 1949), 
for which strikingly high reliability has been 
established both across different interviews 
and over time, for a number of patient and 
normal samples (summarized in Saslow & 
Matarazzo, 1959). Relationships have also 
been found between these time-unit charac- 
teristics of the interviewee’s interaction pat- 
tern and a variety of organismic variables 
(Matarazzo, Matarazzo, Saslow, & Phillips, 
1958). The present study permits exploratory 
investigation of possible relationships between 
these temporal variables and functional con- 
tent within the same interview. Specifically it 
asks whether changing what the interviewer 
says (interpretation) while controlling kow 
he says it (time-unit variables) has any effect 
upon the stable temporal aspects of the S’s 
utterances. The latter are already known to 
be responsive to modification of kow (in time 
units) the interviewer speaks. 


METHOD 
Subjects 


The 60 Ss in this study were female volunteers 
drawn from the hospital staff and student nurse 
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populations of the University of Oregon Medical 
School.? 


Procedure 


Ss were told that they would be seen for a per- 
sonal interview, conducted by a visiting research 
psychologist who was not on the staff of the school, 
and that the interview would be confidential and 
had no relationship whatever to their status as stu- 
dents or nurses, respectively. 

The Ss were asked to talk freely about any topic 
and were told that the interview was designed to find 
out how to best conduct intervi i 5 


control 
À ere conducted individually 
» n an office with a one-way screen, 


Ene average duration 


Matarazzo et al, 
riods; 


sity of Oregon Medical School 
sistance in obtaining Ss, 


F. H. Kanfer, J. S. Phillips, J. D. Matarazz 


, and G. Saslow 


tional activities. The same topics were discussed in 
cach of the three interview periods for all groups. — 

The major independent variable was the modi 
cation of the content of E’s utterances in Period I : 
For all control groups this period was identical in 
all respects to Periods I and III. For all espotimen- 
tal groups, E introduced the period by an anfouma 
tion seeking question. He then followed with 12 a 
terpretive statements, separated by as many yeh 
pretive (information seeking) comments as_needec 
to maintain the natural tone of the interview. gor 
the purpose of this study, interpretations were de- 
fined as any statements, often phrased as questions, 
which: analyze; relate several of S’s described ex- 
periences, attitudes, or feelings; generalize S's states 
ments to suggest a determinant of his behavior; o: 
give a psychological explanation; or a motivational 
description. All intended interpretive statements were 
signaled in advance by E to the observer and ae 
corded. No interpretive statement referred directly 
to S’s motor behavior in the interview or to his ee 
lationship with Æ. The topics in this period were the 
Same as in Periods I and III and no new topic was 
introduced by E, The following examples, randomly 
selected from the interview transcriptions, illustrate 
the two classes of statements 


Exploratory (noninterpretive) statements: 


1. I wonder if you can tell me a little bit more 
about what kind of things you enjoy doing when 
you are off duty, 

2. You have ‘ala me about your family. Now I 
Wonder if you can tell me a little about your fa- 
ther, what kind of person is he? ; hat 

3. When you decided to go into nursing, aes 
kind of satisfactions did you think you were going 
to get out of your work as a nurse? ; f 

4. Could you tell me a little about the ma 
things that you came across in your work which 
seem irritating to you? 


Interpretive Statements: 


1. It seems that you’re the kind of person yna 
avoids hurting other people, sometimes even @ 
© expense of hurtin ourself. 
hen you sce a aeui get better it makes ae 
for some of the suffering that you yourself had 
ear, R 
3. Doing a conscientious and thorough job make 
you feel better, and you feel that you are an im 
Portant person, 
I imagine that you resented the fact m 
i aetimes when you are with him you almost fec 
ike a little child yourself, 


In Period IT E again limited his actions to ox 
Ploratory, information getting questions. At the oe 
of the interview, E discontinued his standardized F 
and askeq several specific questions concerning 
feelings about the interview in an informal manze 

Since it was expected from previous findings ai 
change in Action in response to the interpretat 
variable might be related to the level of maturity 


' 
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and the extent of Ss’ life experiences, as well as fa- 
miliarity with psychological procedures, the addi- 
tional variables of age and professional status were 
introduced by replication of the procedure with stu- 
dent nurses, and nurses who were working on the 
Neuropsychiatric ward. The design resulted in the 
following five groups: 


1. Student nurses, experimental (interpretations given) 
(SN-E), N = 15. 

2. Student nurses, control (no interpretations given) 
(SN-C), N = 10. 

3. Graduate nurses, medical staff, experimental (GN- 
E), N=15. 

4. Graduate nurses, medical staff, control (GN-C), 


N=10, 
5. Graduate nurses, NP staff, experimental (NP-E), 


N=10. 


With this design the control groups totaled 20 Ss, 
the experimental groups totaled 40 Ss. 


RESULTS 


The mean durations of Ss’ utterances for all 
groups over the three periods are shown in 
Table 1. Inspection of the table shows a gen- 
eral trend of all experimental groups to de- 
crease in mean Action during Period II and 
to recover in Period III. These data were sub- 
jected to an analysis of variance, using an 
unweighted means analysis to take into ac- 
count unequal frequencies in the subclasses. 
The summary of the analysis is presented in 
Table 2. As indicated by the nonsignificant 
F between experimental and control groups 


TABLE 1 


Prrtop MFANS AND STANDARD DEVIATIONS 
yor Ss’ ACTIONS * 


Period 
Group 7 @ m F 
NP-E ay Ce eee 
(V=10) SD 33. 5 
GN-E Mem 44 34 59° 22.38" 
W=15) SD 93 T 27 
SN-E Mean. 39% a Sore GTO 
(V=15) SD 2 8 4 
GN-C Micon B mu ee” OF 
(V=10) SD ne Cee 
SN-C nina AS G0” ag DS 
(V=10) SD 2 16 27 


^ Measured in seconds. 


TABLE 2 
ANALYSIS OF VARIANCE OF MEAN ACTIONS 


Source df MS F 
Between Ss 59 
Exper. vs. Control (A)* 1 152.01 0.58 
Student vs. Staff (B) 1 70.25 0.27 
AXB 1 1,245.24  4.75* 
GN-E vs. NP-E 1 174.03 0.66 
Ss within groups 55 261.96 
Within Ss 120 
Periods (P) 2 258.14  9.78** 
AX Pb 2 279.02 10.57** 
BXP 2 44.65 1.69 
AXBXP 2 39,42 1,49 
GN-E vs. NP-E X P 2 84.63 3.21* 


m. 
S 
to 
A 
8 


Ss in groups/periods 


Total 179 


a The partial breakdown of the between-groups sum of 
squares, with 4 df, compares SN-E and GS-E ys. SN-C and 
GS-C (A); SN-E and SN-C vs. GS-E and GS-C (B); the AB 
interaction; and an a priori comparison of GN-E vs. NP-E 
which is nonorthogonal to the first three, 

b The group X period interaction, with 8 df, was broken 
down into interactions derived from the student and general 
staff groups, with (GN-E vs. NP-E) X P representing a 
separate comparison of special interest to our hypotheses. 

t$ s 


+p L 0l 


(A), the duration of utterances for the total 
interview did not differ for the groups. How- 
ever, mean Action differed significantly over 
the three periods (F = 9.78). In addition, pe- 
riods interacted significantly with the treat- 
ment variable A. Therefore, tests for the sim- 
ple effects of periods for each group were run 
(see Table 1). They show that the significant 
treatment X periods interaction (F = 10.57) 
is due to the effects of the experimental treat- 
ment. The experimental groups show signifi- 
cant changes over periods (for NP-E, F = 
4.50; for GN-E, F = 22.38; and for SN-E, 
F = 6.70), while the control groups show no 
change in mean Action over the periods (for 
GN-C, F = 0.07; for SN-C, F = 0.95). 

The effect of interpretations during Period 
II is seen by a decrease in the average length 
of utterances immediately following the in- 
terpretive responses. However, when the in- 
terpretations are discontinued in Period III, 
the mean duration of utterances increases con- 
siderably. The pattern of decreased Action in 
Period II and increased Action in Period ITI 
is shown by 31 of the 40 experimental Ss and 
by 4 out of 20 control Ss. A chi square test 


S32 


for these frequency distributions is significant 
beyond the .01 level. In this “aftereffect” of 
the interpretation phase, the psychiatrically 
less sophisticated Ss in Group GN-E differ 
significantly from the NP-E group (F = 3.21) 
in increasing the duration of their verbaliza- 
tions beyond their mean for Period I (see 
Table 1). The student nurses similarly ex- 
ceed their mean Period I Actions subsequent 
to interpretations. The NP-E group, however, 
returns to their Period I mean, 

The comparisons between student and staff 
nurses yield a significant F ratio when the in- 
teraction between treatment (A) and age (B) 


compari- 
son of SN-E vs, GN-E, but a comparison of 
Groups SN-C and GN-C showed no signifi- 


ore, the differences in 


Period III than did the GN-E 
In order to further evalu 
two possible sources of bij 


group. 

ate these findings, 
as were examined. 
he present study is 
be separated into a 


c i escribe the independ- 
ent variable, Fifty-five of B’s statements were 


Present all Ss and all 


earlier. Judges’ agr 
with Es signaled 
ment, 

On 48 out of the 55 items there was 100% 
agreement among the 12 judges in ordering the 
statements as they were used and intended by 
E. At least 10 of the 12 judges agreed with 
H’s usage on five more of the statements, On 
the two remaining statements, 6 judges agreed 


F. Hy Kanfer, J. S. Phillips, J. D. 


Matarazzo, and G. Saslow 


with E’s use of the statement as representa- 
tive of an interpretation or noninterpretation. 
Thus, there was unanimous agreement by the 
judges with E’s usage on 87.3% of all sam- 
pled statements while only 3.6% of the sam- 
pled statements presented any serious diffi- 
culties in categorizing them. These results 
indicate that interpretive statements (the in- 
dependent variable) and the control state- 
ments are clearly discriminable as separate 
classes and were applied in accordance with 
the prescribed procedure. , r 
A second assumption is that Æ, himself, dic 
indeed conform to the rules governing his ut- 
terances in the standardized interview and 
that his behavior in the experimental and 
control interviews was therefore essentially 
identical except for the presence or absence 
of the independent variable, the interpreta- 
tions in Period IT. To ascertain whether this 
assumption had been met, several analyses 
were performed, First, to test the ee 
that the durations of the three periods anc 
of the total interview were as prescribed (15’, 
10’, 10’, and 35’, respectively), their mean 
durations were compared for all groups. The 
ranges of mean duration were: Period I; tA 
to 16.09 minutes; Period II, 10.10 to 12.05 
minutes; Period II, 9.20 to 10.47 minutes; 
and the length of the total interview, 35.76 
to 37.85 minutes. The very small differences 
in period duration were randomly distributed 
CVt the groups, These realt therefore a 
rate that the prescribed control of perio’ 
length by £ had been fulfilled. " 
_ Another Primary aspect of the standardi- 
ing Tules has to do with duration of E’s ute 
terances (F's Action), ie, that Æ’s utter- 
ances average approximately 5 seconds in 
length. The duration of individual utterances 
by E was obtained and averaged separately 
for each period for each S. All of Æ’s Actions 
Period IT were utilized, while 10 or more 
utterances jn Periods I and III were ran- 
domly Sampled for each S, after this sampling 
een demonstrated to be a highly a 
able and valid measure of the durations of al 
utterances within these periods. The means 
and SDs of fs Actions are shown in Table 3- 
F tests were then computed for mean E’s AC- 
tion across the three periods, for each group- 
Vhile the values for F indicate significant dif- 
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TABLE 3 


F’s Action: PERIOD MEANs*® AND 
STANDARD DEVIATIONS 


Period 

Group I II Ill 

NP-E Mean 546 6.98 5.30 
(V=10) SD 82 71 78 
GN-E Mean 6.53 8.52 6.73 
(V=15) SD 93 1.56 118 
SN-E Mean 521 72 514 
(V=15) SD T 1.35 75 
GN-C Mean 625 719 789 
(V=10) sp 7O ial 
SN-C Mean 6.38 690 6.71 
(V=10) SD 445 88 88 


* Measured in seconds. 


ferences (p <.01) in mean Æ’s Action be- 
tween periods for all groups except SN-C, the 
absolute difference between successive period 
Means was never greater than 2.00 seconds, 
While 4 of the 10 differences were less than 
l second. The very small magnitude of E's 
Variation in length of utterance thus sug- 
8ested that the statistically significant differ- 
ences between periods could not be respon- 
sible for any differences in S’s Action between 
Periods. As can be seen in Table 1, the differ- 
ences between means obtained for S’s Action 
Were always at least five times as great as 
those for Æ and usually this factor was much 
arger, _ 
However, in view of the significant values 
for F for E’s Action across periods, two cor- 
relational analyses were carried out to investi- 
Bate whether these significant but small al 
ions in Æ’s Action had had any pecan y 
ects on $’s Action. Rho’s were compute A 
8toups between each S’s mean Action 
“'s concurrent mean Action for each period. 
€ans for Æ and S in Period I were sepa- 
rated into two parts, interpretations and ion- 
Mterpretations. Out of the 18 rho’s compii ; 
nly 3 were significant. For SN-E, the rho for 


Period II (noninterpretations) ees a 
05; and for Period III, rho was - Ta Tod 


e other significant correlation, 
Ub ( interpretations) in the “yr gro 
~.84, p < .01, Examination of ‘Ds, 


up, was 
sample 
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size of Ss, their number of utterances per pe- 
riod, and other data did not help in inter- 
preting these quite inconsistent findings. The 
absence of any correlation for 15 out of 18 
comparisons suggested that there was no sys- 
tematic relationship between Æ’s Actions and 
S’s Actions because E’s Actions were rela- 
tively constant. This conclusion was supported 
when correlations were obtained individually 
for each of the 25 Ss in the groups for whom 
the rho’s above were significant (SN-E and 
NP-E). That is, rho’s were computed using 
cach single utterance by E and the following 
single utterance by S as the scores. None of 
these 25 correlations was significant. 

Additional evidence supporting the conclu- 
sion that the minor variation in E’s Action, 
while undesirable, was not a contaminating 
factor in the experimental results, is the fact 
that previous studies have shown a positive 
correlation between the Actions of two inter- 
acting individuals when their Actions are left 
free to vary (Saslow & Matarazzo, 1959). 
Therefore, if any bias due to lack of con- 
stancy in E’s Actions had occurred in the 
present study, it would be expected to 
lengthen S’s utterance after E’s longer inter- 
pretations, whereas in fact, the opposite 
changes were found in Ss’ Actions in Period 
II. 

Discussion 


The results of the present study clearly in- 
dicate that interpretations affect the temporal 
duration of S’s utterances, both immediately 
and subsequently in a standardized interview 
which focused on personal material. There- 
fore, the findings suggest that interpretations 
are a Class of interviewer responses which 
have a distinct consequence on the inter- 
viewee’s behavior and thereby modify the 
subsequent interaction. The data thus sup- 
port the hypothesis that a class of interpretive 
responses leads to differing interactional be- 
havior on the part of S than exploratory, in- 
formation seeking questions. 

In extending the gross relationships sug- 
gested by the present findings, it would be 
appropriate to ask further questions concern- 
ing the specific properties in which members 
of the general class of interpretive responses 
differ. For example, can more highly differ- 
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entiated effects on the immediate and subse- 
quent temporal interaction pattern of S be 
attained by systematic variation of the loca- 
tion of interpretive statements on a dimen- 
sion of “depth,” appropriate timing, theoreti- 
cal origin, etc.? That any member of a class 
of interpretations contributes toward modifi- 
cation of the temporal interaction pattern of 
an interviewee by no means reduces the likeli- 
hood of additional effects attributable to fur- 
ther subdimensions within the broad class of 
interpretive statements, 

When making interpretations in psycho- 
therapy there is usually a shift in the thera- 
pist’s role from permissive listener to a person 
who evaluates and “knows” the patient and 
Corrects his self-view by presenting for con- 
sideration alternate explanations, Thus, at 


least those interpretations which “ 
unconscious motive 


punishing stimuli 


also imply the aversive 


to more brief Tesponses by 


i yes” or “no 
mediate response i 


ment, then Ss in the experi 

ing Period TI might be epo tal groups dur- 
much larger Tange a 
from their mean, sinc 
noninterpretive state 
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riod IT. None of the groups showed such in- 
creased variability or range. In fact, the dis- 
tributions for Period IT, including both inter- 
pretive and noninterpretive statements, were 
somewhat narrower in the experimental groups 
than in their appropriate control groups. 4 
The specific content and dynamic eo 
tance of an interpretation probably also nasi 
some long term effects which are neither a 
parent nor measurable during the a 
immediately succeeding it. In our poread 
mental questioning of Ss, almost all of t na 
in the experimental groups mentioned an a 
terpretation when asked to recall that be 
ment by E which had stood out most m i 
interview. By contrast, the control Ss did ae 
recall any specific questions which the in F 
viewer had asked. Ss in the experimen 
groups tended to discuss again the reg 
tion itself and their attitudes about it, ee 
cating that Es interpretive comments | its 
stimulated self-exploration, beyond the ina 
of the interview itself, Further research wo in- 
be needed to evaluate the effects of these 
terpretations on subsequent sessions. pg 
The present study, like most curren mpt 
search on clinical phenomena, is an atte ari- 
at defining the effect of one group of va 
ables by means of measuring penera 
changes which themselves have not yet pe 
fully Correlated with central personality € ade 
acteristics. In this sense it represents oa at 
ditional one in a series of studies aime in- 
exploring a standardized interview ana o 
vestigating its utility for the examinatio re- 
clinical hypotheses by means of objective re- 
cording of two-person interactions. TAE 
sults obtained here add to the evidence va- 
the temporal interaction measures have of 
lidity not only with respect to a vA Y p's 
temporal stimulus variables which modi tats 
temporal behavior pattern (Saslow & ; 
razzo, 1959), to organismic variables E as? 
fatarazzo et al., 1958), and to som ut 
pects of verbal content (Phillips, 1957)» io 
also with respect to the functional a 
of verbal stimulus variables. That a 
variables appear sensitive to changes ! ñ 
interpersonal role of E’s utterances neS 
temporal patterns are under epin 
control. Further, the finding that the y tation’ 
Student nurses responded to interpre 
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é 


| 


é 


| 
| 


Experimental Modification in Interviews 


a shorter utterances may be thought of as 
Snsistent with the earlier finding (R. G. 
on et al, 1958) that younger per- 
sti Seem to respond to potentially aversive 
muli of the standardized interview (e-g., 
a rua! with reduced verbal output. It 
Gii am then, that there is a partially 
ne response in the younger, less ex- 
fiat or less “poised” person to con- 
thet d changes in either the functional con- 
» OF certain time-unit characteristics of 
stent havior, even though E’s behavior would 
terpr ho be more assertive when he makes in- 
‘a etations and more passive when he fails 
€spond and remains silent. 
is tig implication of the present results 
questi Yy of mention. This concerns the broad 
ce of the effect of the content of the in- 
7 eH on interviewee interaction behavior. 
Resa, Chapple made the interesting sug- 
Beton, that the actual content discussed by a 
ti n would affect the other person’s reac- 
to him much less than would the timing 
€ first person’s interactions (Chapple & 


e veig, 1940, p. 33). The results of the 
min study, while not dealing specifically 


ie eats upon the content of the heat 
View, but, in fact, with changes in the m 

Wer’s gross content (i.e., from informa- 
^ Betting content to interpretive content), 
Pe, at least, that interviewer induced 
coe content do produce a demonstrable 


Nge in the interaction behavior of the in- 
igning a study 
t content 
S, thus 


te 
ti 

Si 
cha; 
c 
te 

i 


t 
in wee, We are currently des 
ea uch predefined and differen 
Droyį Will be discussed with a given 9, [DNS 
ding a more direct test of Chapple’s sug 


Stior ; 
2 that i í tent differences 
Mm Nat interviewee conter : 

F Borie interviewee 


inte, Correlate only minimally with 1 
ction patterns, 
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TA 3 Study investigated the hypothesis na 
teg Pretations as a category of interviewe 
tion USS differ from exploratory oF ef 
te ane statements both in their immea 
tion A their subsequent effects On the di 
theip interviewees’ utterances, regardless 0 
tetyig Pecific content or their role in the in- 
berime, Strategy, at least in standardized €x- 

Ske al interviews. 


3 F b 
female nurses were interviewed DY 
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means of a standardized interview which 
limits Æ’s verbal and gestural participation in 
the interaction to 5-second durations. The in- 
terview consisted of three periods. During 
Periods I and III Æ made only exploratory 
or information seeking statements for all Ss. 
During Period II E gave 12 interpretations 
to Ss in the experimental groups, while con- 
tinuing exploratory statements for the con- 
trol groups. The procedure was replicated 
with student nurses, medical nurses, and psy- 
chiatric nurses since age and experience were 
expected to affect S’s response to the inter- 
pretations. All interactions were recorded by 
the Interaction Chronograph and the sessions 
were tape recorded. 

The results showed significantly lower mean 
durations of utterances for the experimental 
groups during Period II (the interpretation 
period). The control groups showed no change 
in mean duration of utterances over the three 
periods. Furthermore, a significantly higher 
mean duration of verbalizations was found in 
Period III in these groups which had been 
given interpretations in the preceding period. 
Student nurses did not differ from older staff 
nurses in the control groups, but yielded a 
significantly lower verbal output when inter- 
pretations were given. 

The results showed that interpretations as a 
class of interviewer techniques have a dis- 
tinct effect on interviewee’s interactional pat- 
terns in the standardized interviews used in 
this study, with normal Ss. Interpretations 
tend to shorten the duration of utterances 
immediately following them. In addition, after 
interpretations cease, interviewees tend to in- 
crease the duration of their verbal output in 
comparison with Ss who were not given any 
interpretations at all. The interviewees’ age, 
experience, Or both tend to further modify 
their reactions to interpretations. 
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DEVELOPMENTAL ASPECTS OF IMPULSE CONTROL’ 


NATHANIEL N. WAGNER 
Astor Home for Children, Rhinebeck, New York 


The concepts impulsivity and impulse con- 
trol appear frequently in the vocabulary of 
the clinician who works with children. Such 
writers as Redl and Wineman (1957) attach 
extreme importance to the “control system” 
in the understanding.and treatment of emo- 
tional disorders in children. Aside from Gold- 
farb’s (1949) and Colvin’s (1958) work on 
the effects of institutionalization upon impulse 
control, this writer has been unable to find 
any experimental investigation on the de- 
velopment of impulse control. This condi- 
tion is all the more remarkable in view of 
the long history of theoretical interest in 
impulsivity (Freud, 1938; Rapaport, 1951; 
Schilder, 1951) and the many studies on 
the relation of Rorschach movement and other 
Measures of fantasy to motor inhibition 
(Meltzoff, Singer, & Korchin, 1953; Singer 
& Herman, 1954; Singer, Wilensky, & Mc- 
Craven, 1956). 

Colvin (1958) studi 
Parental deprivation, 
and prolonged child c 


ed the relation between 
in the form of early 
are institutionalization, 


and various measures of ego and social de- 
Velopment. His results indicated a significant 
relationship between the extent of institution- 
alization and impulse control with the lack of 
Opportunity for parental identifications seen 
as a major factor in the low impulse control 
Of the institutionalized child. The present in- 


Vestigation focuses upon other characteristics 
Of the child’s interpersonal experience, 1-€., 
the effects of specific aspects of the nature 
and quality of the parent-child a 
Upon the development of impulse control. 

i incinnati 

1 Presented at the APA meetings at Cincinna i 
September 7, 1959, in a symposium, Manifestations 
ane. Measures of Impulse Control and Its Behavioral 
°ncomitants. . 
„The author wishes to thank Ralph W. Colvin for 
is assistance in all phases of this study. 


PROCEDURE 


The 36 subjects were the entire population of the 
Astor Home for Children, a residential treatment and 
research center for emotionally disturbed boys. Ages 
ranged from 7 years 1 month to 15 years 3 months, 
with a mean of 11 years; WISC IQs from 68 to 119, 
with a mean of 97; diagnoses from schizophrenia to 
primary behavior disorder; and symptoms from se- 
vere withdrawal to explosive acting out. 

To secure a measure of impulse control, behavior 
rankings were made independently by five profes- 
sional staff members, all of whom could rank the en- 
tire population. Each judge was instructed to rank 
the 36 students on impulse control which was de- 
fined as: “ability to delay immediate satisfactions for 
long range goals ..., a capacity to restrain ac- 
tion until the formulation of a plan” (Colvin, 1958). 
Intercorrelations among judges ranged from .77 to 
.92, with a coefficient of concordance of .84, which 
is significant beyond the .001 level. Therefore, the 
ranking of the sum of the ranks was used as the 
“best” ranking (Walker & Lev, 1953). 

These exceptionally high intercorrelations certainly, 
to some extent, reflect the high level of communica- 
tion between staff members, which is an integral part 
of residential treatment. However, all rankers were 
instructed not to communicate with each other con- 
cerning these specific impulse control rankings. It is 
this writer’s impression that all five staff members 
fulfilled this condition. None the less, the probability 
of earlier staff meetings and other types of communi- 
cations influencing the level of ranker agreement can- 
not be ruled out. 

The Bloch and Behrens Multiproblem Family In- 
dex (Bloch, Behrens, Guttenberg, King, & Tendler, 
1959) was selected as the measure of parent-child 
interaction to be related to the development of im- 
pulse control. The following six areas comprise the 
index: ? 

1. Marriage: Either parent abandoned child as 
infant or after one year; there was a marital his- 
tory of separation and reunion; the marriage was 
broken (other than by death). 

2. Parent-child relationship: Either parent was 
abusive toward child, beat him, over-disciplined 
or was punitive; the atmosphere in the home was 
grossly psycho-pathogenic. 


2 Quoted by permission of the authors. 


537 


538 


3. Social pathology: Diagnosis of specific social 
pathology in any family member, such as alcohol- 
ism, drug addiction; either parent or a sibling was 
in a penal institution at time of child’s application 
to a residential treatment center. 3 

4. Physical illness: Either parent had a chronic 
physical illness with or without some incapacity ; 
either parent had a serious physical illness which 
interfered with his parental and family functioning. 

5. Mental illness: Either parent or a sibling was 
hospitalized for mental illness; either parent, sib- 
ling, or other family member had a psychiatric 
diagnosis of mental illness (whether psychotic or 
non-psychotic) or mental deficiency. 

6. Financial support: Family was either wholly 
or partly dependent on public assistance funds at 
time of child's application to a residential treat- 


ment center or in the past, family housing was 
grossly inadequate. 


The incidence of these areas of family pathology 
was computed for each child through an interview 
with two professional child care workers thoroughly 
familiar with the children’s case histories. Where 
there was disagreement, 


areas and was 97% for the total 
The scorings on the i 
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Multiproblem Family Index. Also, the staff members 
who acted as informants on the index were not 
aware that it would be related to impulse control. 
The population was divided into high and low im- 
pulse control groups through a division at the me- 
dian of the rankings with 18 boys in each group. 


Family pathology, as measured by the Multiproblem - 


Family Index, was then related to the impulse con- 
trol level of the subjects by means of chi square tests 
of the relationships between: (a) high or low impulse 
control and the absence or presence of each area of 
pathology, (b) high or low impulse control and the 
low (0-3) or high (4-6) incidences of the six areas 
of family pathology. 


RESULTS 


Table 1 presents the two-by-two contin- 
gency tables relating impulse control with 
each of the areas of the Multiproblem Family 
Index. All of the areas, with the exception of 
physical illness, are significantly related to 
impulse control in that their presence As, as- 
sociated with low impulse control, their ab- 
sence with high impulse control. P 

Table 2 presents the relationship of high 
and low impulse control and the extent to 
which the subjects were exposed to the vari- 
ous areas of family pathology. It indicates 
that children with high impulse control tend 
to come from families having no or few prob- 
lem areas present, whereas the low impulse 
control group tends to come from families 


TABLE 1 


COMPARISON oF 18 Hick AND 18 Low 
AREA OF THE Mutrr: 


IMPLUSE CONTROL Supjecrs on EACH 


ROBLEM FAMILY INDEX 
Multiproblem Area Area i; ee . E 
Family Index Areas Absent Present Poa cee b 
re 
1. Marriage 8 10 High 5.0 05 
2 < 
T 16 Low 
2. Parent-Child 13 5 : 
Relationship 4 14“ h 9.0 a 
w 
3. Social Pathology 16 2 7 
High 11.7 001 
6 12 Low 
4. Physical Illness 14 Higt 3 
n 0 ns 
9 9 Low 
5. Mental Illness 16 2 . 
High 6.4 .02 
9 9 Low 
6. Financial Support 12 6 ; 
High 
4 14 Tee 7.2 OL 
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TABLE 2 


IncipyXces oF MULTIPROBLEM FAMILY INDEX AREAS 
FOR 18 Hic axp 18 Low Imeuts 
CONTROL SUBJECTS 


Incidence of Multiprob- 
lem Family Index Areas 


Number of Areas Scored 0123456 
High Impulse Control (18) 6 4 3 2 2 1 0 
Low Impulse Control (18) 0 1 1 2 6 7 1 
Number of Children (36) 6 5 4 4 8 8 1 


evidencing four or five dimensions of family 
pathology. 

In Table 2, when boys of high and low im- 
pulse control are compared on the basis of low 
family pathology (0-3 areas) and high family 
pathology (4-6 areas), a chi square of 13.5 
results. This division of the subjects on the 
total Multiproblem Family Index is signifi- 
cantly related to impulse control beyond the 
-001 level. Here, few family problem areas are 
associated with high impulse control, many 
family problem areas with low impulse con- 
trol. 

The relationship between the extent of 
family pathology, as measured by the Multi- 
problem Family Index, and the child’s level 
of impulse control, is a rather striking one. 
These results become all the more revealing 
through a study of the individual cases. As 
inspection of Table 2 reveals, the six families 
evidencing none of the six areas of family 
pathology all had boys who were in the high 
impulse control group. On the other hand, the 
one family that scored on all the areas of 
family pathology had a boy in the low im- 
Pulse control group. Significantly, this boy 
Was ranked thirty-sixth on impulse control in 


a population of 36. : 

Analysis of those cases in Table 2 that did 
not adhere to the general direction 1s also 
quite revealing. One of the three boys with 
high impulse control, despite coming from a 
multiproblem family, was by far the oldest 
Subject, 15 years 3 months. His high impulse 
Control may reflect the significant relation- 
Ship between impulse control and age (rho = 
39, significant at the 05 level). Of the four 
low impulse control boys who came from 
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families with few problem areas, one is an 
autistic schizophrenic, two have findings of 
organic brain pathology, and the last is an 
enigma not only in the present research but 
to the clinical staff of the Astor Home as well. 
Certainly in the cases of the boys with low 
impulse control concomitant with brain dam- 
age, one might expect low impulse control de- 
spite the lack of family pathology. This also 
might be true in the case of the autistic child. 


Discussion 


Colvin’s (1958) research on the effects of 
institutionalization upon the development of 
impulse control suggests that lack of oppor- 
tunity for parental identifications may be a 
major factor in the inadequate development 
of impulse control. The present research sug- 
gests that contact, and probably identification, 
with pathological parental figures also hinders 
the development of impulse control. 

The Multiproblem Family Index can be 
viewed as a measure of the parents’ control 
of their own impulses. That is, in the context 
of family living, impulse control may well be 
reflected by the ability to sustain a marriage, 
refrain from impulsively striking a child, stay 
within the boundaries of acceptable social be- 
havior, and manage family financial affairs so 
as not to need public assistance. Although this 
interpretation might be less appropriate for 
mental illness in the family, even here inade- 
quate control of primitive instincts and im- 
pulses is frequently viewed as central to the 
problem of mental illness. However, physical 
illness in the family certainly cannot be in- 
terpreted as evidence of parental impulsivity. 
This is consistent with the finding that family 
illness was the only problem area not related 
to the impulse control level of the subjects. 

This seemingly close association between 
the impulse control level of children and their 
parents is consistent with Redl and Wine- 
man’s (1957) observations of The Aggressive 
Child. They note that the parents of these 
children present a picture of impulsivity no 
less striking than that of their children: 


Seldom do we see children who have been so grossly 
and continuously exposed to traumatization on so 
many different levels. . . . With them benign experi- 
ence is the exception, trauma the rule. From the 
things we know about them, it is this that emerges 
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as the most powerful theme in their prior experience 
(pp. 54-55). 
SUMMARY 


This study investigated the relationship of 
family pathology, as measured by Bloch and 
Behrens Multiproblem Family Index, and the 
level of impulse control in 36 children. The 
presence of family pathology was found to 
be significantly associated with low impulse 
control, the absence of family pathology with 
high impulse control. It was suggested that 
contact, and probably identification, with 
pathological parental figures hinders the de- 
velopment of impulse control. 
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EXAMINER REINFORCEMENT AND SITUATIONAL 
VARIABLES IN A PROJECTIVE TESTING 
SITUATION * 


LAWRENCE SIMKINS ? 
University of Houston 


A clinical area which lends itself to the 
application of operant conditioning principles 
is the projective testing situation. Psychol- 
ogists readily agree that there are wide indi- 
vidual differences in the behavior of examiners 
ìn a testing situation. A number of investiga- 
tions (Alden & Benton, 1951; Baughman, 
1951; Fahmy, 1953; Gibby, 1952; Lord, 
1950; Sanders & Cleveland, 1953) have al- 
ready shown that in many instances the re- 
Sponses given by a subject (S) are dependent 
Upon his subjective definition of the test, per- 
Sonality characteristics and attitudes of the 
experimenter (Æ), and the conditions under 
Which the test is administered. It is quite 
Possible also that in the testing situation re- 
Sponses emitted inadvertently by the Æ are 
Serving as reinforcers or punishment for some 
Classes of the Ss’ responses. Recent studies by 
Fahmy (1953), Wickes (1956), and Gross 
(1959) showed that it was possible to increase 
the frequency of human movement responses 
8iven by Ss in an inkblot testing situation 
through the application of reinforcement tech- 


niques. 

This study was de: 
Owing questions: (a) 
“ ‘ticular projective feet 

Ptible to examiner reinfor 
*ontent, location, or determinants? (b) What 
'S the effect of obvious or strong approval as 
Opposed to subtle or weak approval? Is ial 
*Pbroval any more effective than no approva 
rom a study done by the 
t of the requirements for 


a D degree at the University of ee 
one wishes to express his gratitude to t ae mihas 
the doctoral committee for their help and e 
ement in this work. =, 
Now at Florida State University- 


signed to answer the fol- 
What dimensions of a 
que are most sus- 
cement, that is, 


* This paper is derived f 


a ri 
uthor in partial fulfillmen 


at all? (c) What is the effect of situational 
variables in a projective testing situation? 
Thus what effect will reinforcement have in 
situations that are perceived as ego involving 
as contrasted to more relatively neutral con- 


ditions? 
METHOD 


Study I 


This study was designed to answer the first two 
questions. The third question required some modifica- 
tion in design and will be referred to as Study II. 

The experimental situation involved the adminis- 
tration of an inkblot test devised by Holtzman 
(1958). The Holtzman cards were administered at 
each session. An immediate inquiry was conducted 
after each response. The Ss were college students 
selected from introductory psychology classes at the 
University of Houston and student nurses at the Uni- 
versity of Texas Medical Branch. There was no par- 
ticular selection procedure other than having approx- 
imately the same number of males and females due to 
the possibility of sex differences. 

A group of Ss (N = 25) designated as the control 
group was administered 75 Holtzman cards over five 
testing sessions. Another group of Ss designated as 
the experimental group (N = 36) was administered 
15 of the Holtzman cards in the first session. At the 
end of the first test session the experimental Ss’ re- 
sponses were scored and tabulated. There were three 
main dimensions considered in this research: content, 
location, and determinants. At this point the experi- 
mental Ss’ protocols were analyzed to see if there 
were any other Ss who had given the same number of 
responses in this category. If there was such an S, 
both of these Ss were then matched and considered 
an experimental pair. 

This pair of Ss was in turn matched with Ss from 
the general control group who had also given the 
same frequency of responses in this particular cat- 
egory in their first session. The Ss selected from the 
general control pool for matching purposes were re- 
ferred to as the matched control group. 

Procedure. Prior to the first testing session all 
experimental and control Ss were given the following 


instructions: 
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You may have heard about the Rorschach inkblot 
test. It is a test used by clinical Psychologists as a 
method for evaluating personality and the diagnosis 
of abnormal behavior. Unfortunately, many recent 
investigations conducted on the Rorschach have 
shown that it is not as effective as we would like 
it to be. Dr. Holtzman of the University of Texas 
has developed a new set of inkblots designed in 
such a manner that it is hoped will eliminate some 
of the weaknesses of the Rorschach test. Because 
of the comparative newness of this test we would 
like to get an idea of the types of things seen by 
the normal individual. At this point we are not 
interested in personality factors because we frankly 
do not yet know what the test measures. All we 
want to get is an idea of the common types of 
things that are seen. Also, unlike the Rorschach this 
test has many more inkblots so that it will be 
Necessary to test you over a five-day period. Each 
testing should take only thirty to forty-five min- 
utes. I am going to show you these cards one at a 
time. They are really only inkblots and people see 
different things in them. There are no right or 
wrong answers—you just tell me what you see. I 
would like you to see two things in every card— 
that is, give me two responses per card. Do you 


‘ave any questions before we begin? O.K. Here's 
the first card. 


Using this Procedure the control Ss went through 


experimental Ss, 
earlier took place. i 


hich was in the category that & 
+ Strong quality reinforcement 
at’s fine.” The other 
Is pair received a Weak quality 
which consisted simply- of saying 

mm-hm” and nodding of E's head, Reinforcement 
xperimental Ss during the 
at is Sessions 2, 3, and 4. On 
sion reinforcement Was not 


middle three sessions, th: 
the fifth and final ses 
administered, 


A total of 36 experimental Ss Were employed in a 
three-by-two design with six replications, That is, 
there were three inkblots dimensions, two qualities of 
reinforcement, and six Ss in each cell, 


Study 11 


The investig 
question of thi 
Pendent study, H 
dependent upon th 
one thing, whatever ink 
tion, or determina 
tible to reinforcemen 
Also, instead of usi 
forcement, only weak 
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which is probably used routinely and inadvertently 
in the clinical testing situation. This is particularly 
relevant when one considers the factor of the anxiety 
of the patient in the testing situation. It is pens 
that under anxiety producing conditions the patient 
may become more sensitive to relatively subtle stimuli 
than he would under more neutral conditions. 

The control group for this study received instruc- 
tions that were made as neutral as possible with the 
emphasis being placed on a “new test we didn t know 
too much about” and minimizing personality cmp 

The experimental Ss in this part of the study hac 
not been used previously. There were two ezpere 
mental conditions in this part of the experiment; Ss 
in both groups received weak reinforcement. One 
experimental group was run under conditions of y 
positive set; the other under a negative set. Although 
both sets were designed to induce ego involvement Aa 
the Ss, it was hoped that the positive set woul 
motivate an S to “do his best” whereas it was €x- 
pected that a negative set would provide a ttina 
ing test atmosphere. Again there was a three-way 
match on some content category between the two 
experimental Ss and a control S (or group of Ss). 
All of these Ss were matched on the basis of a 
first testing session. Unlike Study I, both tes aaa 
Ss received weak reinforcement. One experimental 2 
of the pair received reinforcement under a negahive 
set instruction; the other S under a positive set in 
struction, , vadi 

A total of 12 experimental Ss (that is, six pairs i 
experimental Ss) participated under the two apei 
mental set instructions, As in Study I, the Ss wer 
tested over five sessions with no reinforcement being 
administered in the first and final sessions. 9 

Prior to the first session, the experimental Ss n 
ceived instruction similar to those given to the Stu y 
I Ss. However, the instructions were slightly modifie 
to the extent of emphasizing that the first test session 
was only a practice session. Thus they were told: 


i A f . i „ssion 
Today’s Session will constitute a practice sessi 


Which will serve to acquaint you with the ne 
In other words I'm not particularly concerned WI 
what you see today, 


Following the first session (control) the S’s pro- 
tocol was evaluated and he was matched vin $ 
control and another experimental S on the particu i 
class of responses the E wished to reinforce. He w 
then either assigned to a positive set or a negative sè 
experimenta] group. ‘ 

Negative Set, On the day of the second we 
experimental Ss in the negative set group were give 


test instructions worded in a negative tone. Thus they 
were told: 


: s j- 
These inkblots have been found adequate for nad 
cating neurotic people . . . . ‘The immediate Pie 
pose of this research is to determine the neuro 


i a tl- 
potential that is inherent within a normal po? 
lation, 


aS at 
Under this experimental set, it was anticipated n 
such signs of interest on the part of the Æ W0 


| 
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Variables in Projective Testing 


TABLE 1 

Compa 
aes ko MEAN D Scores OF EXPERIMENTAL 
S TO MEAN D SCORES OF THEIR RESPECTIVE 
Marenen CONTROLS nS 
(V = 6 in each group) 


Mean as 


Locatio inf 

n Reinf e 

Matched G 13 i 
e i i 4 

M terminant Reinforcement 43 GP 

ae Control =26 

Moe Reinforcement 1:9 si 
atched Control =e 


b = ood Sign rank test 
-025, one-tailed test. 


Constit 
SUtute P 
Sations punishment, Again, as in previous investi- 
Session, ere not given in the final 
Positi 
< OSttive ‘ á r P 
his group Set Instructions. The instructions given to 
given to nee the first day were identical to those 
heir fir. e experimental Ss in the negative set on 
a arst day, 
, * Sitti 
sion hie Set. On the day 
g Hatton to this group wer 
Phere, Th anner as to create a “positive” 
Th hus, the Ss were told: 
e ir 
een ree you will be sho 
Posed a to a superior in 
people that have been succe: 


Professi 
S : R 
ions... . Despite the divergen 

hat we refer to 


Of th 
as a ae people) the tests get at W 
mon creative potential . . -+ The imme- 


» reinforcements w 


of the second testing ses- 
e modified in 
test atmos- 


wn from now on have 


tellectual group com- 
ssful in their 


t interests 


e— STR 


-_ WEAK REINF' 
o--o MATCHED CONTROL 


MEAN FREQUENCY OF RESPONSE 


y of response: 


Fic, 1, Mean frequenc 
matched control su 


i 2 
SESSION 

s in 

pjects in the 
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TABLE 2 


COMPARISO:; p D! 
ON . D’ Scores OF WEAK AND STRON 
Quanity REINFORCEMENT i 
(N = 18 in cach group) 


Mean SD T% 
Weak 6.3 22 
Strong 5.8 om oes 


n pair rank test. 


diate purpose of this re: i 
se of tl search is to det ine j 
how much of this intellectual and tron Ae 
you possess. È poiana 
The reinforcement for Ss i i 
$ 1 s in this group was i 
tical to the experimental Ss in the Tai se na 
It was hoped that under this experimental a 
the responses emitted by E would serve a: am 
ing stimuli to increase the frequency oF ronbos 
class of responses. aropin 
ResuLtTs 


Study I 
The change or the difference in the numb 

of responses in a reinforced category from the 
first to the last session for a given S from : 
on will be referred to as a D score. A au rte 
sign before this score indicates a decrease the 
omission of a sign indicates an increase pi 
A comparison was made between the D 
scores of the experimental Ss and the D scores 


JONG REINFORCEMENT 
'ORCEMENT 


of experimental and 


test session 


each n e 
Jocation dimension. 
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e— Strong reinforcement 
O-—— Weak reinforcement 
O-—-O Matched control 


16 


14 


MEAN FREQUENCY OF RESPONSE 


SESSION 


Fic. 2. Mean frequency of responses in each test session of experimental and 
matched control subjects in the determinant dimension, 


of their respective matched controls for each 
dimension, Using the Wilcoxen matched pair 
sign rank test, the results in Table 1 list the 
mean D scores for experimental and matched 


teach significance, 
In order to compare the effects of reinforce- 


ment quality a different score was deriven: 
The D’ score represents the net change or l 
difference between D score of the experimenta 
S and his respective matched control. Using 
the D’ scores, each strong quality reinforce- 
ment S was compared with the matched a 
perimental S that received weak quality re 


MEAN FREQUENCY OF RESPONSE 


inforcement. The results presented in Table 2 


Gig Strong reinforcement 
o0 Weak reinforcement 
i œ=- Matched control 


SESSION 


Fic. 3. Mean frequency of responses in eac 
matched control subjects in ¢ 


h test session of experimental and 
he content dimension. 


i 


Variables in Projective Testing 


o——© POSITIVE SET 
A&A NEGATIVE SET 
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(+--+ MATCHED CONTROL 


NUMBER OF RESPONSES 


[i 2 


3 4 5 


TEST SESSION 


Fic. 4. Mean number of responses independent of the rei 
tal subjects and their matched controls in Study II. 


at each test session by experimen! 


show that the differences between qualities of 
reinforcement were not significant for 17 de- 
grees of freedom. 

The graphs presented in Figures 1, 2, and 3 
show the mean number of responses given in 
the reinforced categories by experimental Ss 
and their matched controls for each testing 
session. In both the location and content 
dimensions weak reinforcement was slightly 
more effective than strong reinforcement. The 
reverse was true of the determinant dimen- 
sion. In the determinant dimension, however, 
one experimental S receiving strong reinforce- 
ment showed an unusually large increase in 
the reinforced category, thus producing an 
artificial elevation of the mean for that group. 
If his score is deleted and the mean of only 
five Ss used, then the weak reinforcement 


TABLE 3 
Comparison OF MEAN D’ CHANGES OF POSITIVE AND 
NEGATIVE SET EXPERIMENTAL SUBJECTS 
(N = Gin each group) 


Mean Ts 
Positive Set +7.3 o* 
—3.0 


Negative Set 


a Wilcoxon pair rank test. 
*p = 025, one-tailed test. 


nforcement category given 


mean would be slightly higher than the strong 
reinforcement. However, there were no signifi- 
cant differences between weak and strong 
quality reinforcement in any of the inkblot 


dimensions. 


Study II 

Again using D’ scores, a comparison of the 
two experimental groups (positive set and 
negative set) in Study II was made. The mean 
D' scores for each experimental group are 
shown in Table 3. The graph presented in 
Figure 4 shows the mean number of responses 
given in each session by the experimental Ss 
and their matched control Ss. Since one of the 
problems of this research was to study the 
relative effectiveness of reinforcement in an 
anxiety producing situation, a comparison was 
made between the D’ scores of content Ss 
receiving weak reinforcement in Study I and 
the D’ scores of the positive set Ss of Study II. 
The mean increase of the Study I Ss was 9.5; 
the mean increase of the Study II Ss was 7.3. 
The differences were not significant. 


DISCUSSION 
Study I 
The results show that the content and de- 
terminant dimensions are about equally vul- 
nerable to reinforcement effects. It is also 
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fairly clear that of the three inkblot ene 
sions, the location dimension is the most diffi- 
cult to manipulate, that is, shows the least 
change as a function of reinforcement. One 
possible reason for the ineffectiveness of re- 
inforcement on the location category may lie 
in the nature of the response class. The loca- 
tion dimension is vaguely defined, in that 
there is a great deal of dissimilarity between 
the various members that compose this re- 
sponse class. Because of the vagueness of this 
category and the relatively short training pe- 
riod with the experimental Ss, an adequate 
stimulus discrimination is difficult to achieve. 
Previous investigators (Buss & Durkee, 1958; 
Greenspoon, 1955; Salzinger, in press) have 
considered the nature of the response class an 
important variable in determining the effects 
of reinforcement. In general, the larger the 
response class, the less effective is the condi- 
tioning. As the size of the response class in- 
creases, there is frequently a corresponding 
increment in heterogeneity among members of 
the response class. Consequently, generaliza- 
tion of reinforcement to all members of the 


response class is not as effective as with a 
more homogeneous class. 
Although there is 


cases more effective t 
It is interesting to 
were found by Wickes (1956) and Gross 
(1959). Both of the 
with investigating the effects of verbal versus 
nonverbal reinforc 
sponses on an inkblot test. In both studies 
verbal and nonverbal reinforcements were ef- 
fective with nonverbal being slightly more 
effective than the verbal reinforcement. 
The results of this study indicate th 
pressions of “neutral” interest that th 
ican uses in administerin 
reinforcing stimuli, 


Study II 


at ex- 
e clin- 
8 a test can serve as 


n so> t was expected that the 
same “weak” stimuli that Æ made contingent 
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upon a given class of responses would have 
different effects under these two experimental 
sets. As can be seen from Figure 4 and Table 
3, the two experimental groups responded to 
the reinforcing stimuli in a different manner. 
‘Although the amount of change (D) was not 
significant in every case, the direction of 
change was consistent with all experimental Ss. 

The small mean amount of change (— 3.0) 
on the part of the negative set group was 
probably a function of a limitation in the ex- 
perimental design. Classes: of responses con- 
taining five to eight responses in the initial 
session were considered as probably the best 
for experimental manipulation. This range of 
responses was found to be satisfactory for 
demonstrating the effects of positive reinforce- 
ment; it is not adequate, however, for effec- 
tively demonstrating a decrease. Even if the S 
drops from five responses in his initial session 
to no responses, the decrease is not very strik- 
ing. In light of these considerations, any con- 
clusions regarding the effectiveness of rein- 
forcement under conditions of a negative set 
are, at best, quite tenuous. Future research 
employing a better design and a larger number 
of Ss is necessary to substantiate these find- 
ings. 

The interpretation of Study II results be- 
comes even more difficult when a comparison 
is made between the neutral set Ss of Study Í 
and the positive set Ss of Study II. The dif- 
ference between these two groups was not 
significant. Does the induction of anxiety have 
very little to do with reinforcement effect? 
partial answer is suggested by some of the 
verbalizations of the experimental Ss in Study 
IL who stated that although they felt very 
uncomfortable and nervous during the earlier 
Stages of the testing they felt less anxious 45 
the testing progressed. Apparently the B’s at- 
tempt to induce anxiety was somewhat un- 
successful. One explanation of this failure is 
that there may be adaptation effects in 4 
prolonged testing Series. It will be recalle 
that the anxiety Producing instructions were 
Siven in the second Session and not mentione 
again after that. As the sessions progressed the 
Ss might have experienced less subjective dis- 
comfort. The E also noticed that in the earlier 
Stages of the testing the Ss seemed to make 
more affective statements concerning the ink- 


| 
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blots, and expressed more concern about their 
performance than in later sessions. 

In view of the questionable effectiveness of 
the anxiety producing set, definitive conclu- 
sions cannot be drawn. Future research along 
these lines should make an attempt to main- 
tain a high anxiety level if a prolonged testing 
series is required. One trend from Study IT 
does appear to be evident. There is an inter- 
action between what the S conceived the test 
to be and the manner in which stimuli serving 
as reinforcement affected some of his re- 


sponses. 
SUMMARY 


The purpose of this research was to study 
the behavioral effects of certain contingent 
stimuli emitted by the examiner in a projec- 
tive testing situation. This research was di- 
vided into two separate investigations. The 
first investigation was concerned with deter- 
mining what dimensions of an inkblot test 
(content, location, determinants) were the 


most sensitive to reinforcement and also com- 


paring the effects of strong and weak quality 
reinforcement. Strong quality reinforcement 
consisted of the examiner’s saying “good” or 
“that’s fine” to a partciular response in the 
class of responses the examiner had decided 
to reinforce. Weak quality reinforcement con- 
sisted of the examiner’s nodding his head and 
saying “mm-hm.” This part of the research 
was referred to as Study I. , es 

The second part of the investigation was 


concerned with situational variables and their 


relationship to examiner reinforcement. Thus 


it was expected that under anxiety producing 
situations the effects of examiner reinforce- 
ment would be accentuated. This part of the 
research was referred to as Study II. 

The results of Study I were as follows: , 

1. The content and determinant dimensions 


showed the largest reinforcement effects. 


2. The location dimension was the most 
resistant to change as a function of reinforce- 
ment. 

3. There was no difference between strong 
and weak reinforcement; if anything, weak 
reinforcement was slightly more effective than 
strong reinforcement. 


547 


The results of Study II were as follows: 
There were significant differences between the 
negative and positive set experimental sub- 
jects. Under positive set instructions weak 
quality stimuli had the effects of positive 
reinforcers. Under negative set instructions 
weak quality stimuli had a punishing effect 
producing a decrease in the reinforced cat- 
egory. Only tentative limitation conclusions 
with respect to the effectiveness of reinforce- 
ment under a negative set, however, could be 
drawn because of design and the few number 
of subjects employed. This was attributed to 
the questionable effectiveness of the anxiety 
producing instructions given only at the be- 
ginning of a long testing series. 
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FURTHER EVIDENCE CONCERN 


NG THE EFFECT 


OF HANDWRITING HABITS UPON THE 
LOCATION OF DRAWINGS? 


WAYNE DENNIS anp EVELYN RASKIN 
Brooklyn College 


The placement on the page of a human fig- 
ure drawing often is interpreted as resulting 
from Personality factors (Levy, 1950; Mach- 
over, 1949). Although the location of draw- 
to personality, other 
influences are also operative. Among these 
is the influence of handwriting habits upon 
drawings. In a Previous study (Dennis, 1958) 
it was found that among American Armenian, 


pf 


Armenians and the top-righ 
Arabic-writing Peoples. 
This finding is not entirel 
the differences between the groups in regard 
to the location of drawings might conceiy- 
ably be due to factors other than handwriting, 
However, if in many other linguistic groups, 
the location of drawings is found to be re- 
lated to the Starting position in h 
the hypothesis that ha 
drawing responses will 


y conclusive since 


METHOD 


The top-left handwritin, 
Cambodians, and J 


bodia, Washburn, 


Tong Ho; in Isracl, May Bere Merom, 


ing groups consisted of Iranians and Israelis, pe 
number of Ss in each group is indicated in Table F 

In each group the Ss were school children ager 
6-12. They were given either the Goodenough Draw- 
a-Man Test or the Machover Draw-a-Person Teni 
Wherever possible, drawings were obtained in pencl 
on 84” by 11” white unruled paper. a 

As in the previous study, each drawing was score 
as to whether the topmost or bottommost Shoe 
the drawing was nearer to its respective edge of ide 
drawing paper. Similarly, we determined which si i 
of the body was nearer to its corresponding pe 
edge of the paper, This produced a four-fold ca e 
Sorization of locations: top-left, top-right, bottom 
left, and bottom-right, While this method of scoring 
does not attempt to measure the body surface Dog 
in each quadrant, in the majority of instances, the 
scored quadrant contains a larger part of the area 
of the drawing than does any other quadrant. 


RESULTS 


Table 1 shows that the results of this study 
are completely in accord with the motor 
transfer hypothesis, In each of the five lin- 
guistic groups, the quadrant which contains 
the largest number of location scores is that 
in which the initial handwriting movement is 


TABLE 1 


Location OF DRAWINGS IN DIFFERENT 
HANDWRITING GROUPS 


Location of Drawings (%) 


Group N TL TR BL BR 

Top-Left 

Turkish 289 42 nü 2 14 

Cambodian 138 gy 9 235 2 

Japanese 349 38 16 34 12 
Top-Right 

Iranian 307 16 4 17 2 

Israeli 366 27 45 12 16 
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Handwriting Habits and Location of Drawings 


ordinarily made. Computation shows that in 
each group the inequality of the quadrants 
exceeds the .001 level of confidence. 

The above finding should not be construed 
as necessarily indicating that placement of the 
drawing on the page is devoid of significance 
for Personality interpretation when the writ- 
ng habits of the individual child are known. 
Within each of our groups, as Table 1 shows, 
: ere were always some children who placed 
their drawings in positions which deviated 
Markedly from their writing conventions. Such 
deviations may indicate differences in a child’s 
Personality, Our data do not show whether 


'S Is, or is not, the case. 


SUMMARY AND INTERPRETATION 


Corroborating the results of an earlier 


aap which involved three linguistic groups, 
€ present study of five additional linguistic 
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groups indicates that human figure drawings 

tend to be located in that quadrant of the 

page within which the subject begins to write, 
to a greater degree than in any other quad- 
rant. 

Thus this study, like the preceding one, 
supports the hypothesis that a factor other 
than personality plays a marked role in cre- 
ating intergroup differences in locations of 
drawings. 

REFERENCES 

Dennis, W. Handwriting conventions as determinants 
of human figure drawings. J. consult. Psychol., 
1958, 22, 293-295. 

Levy, S. Figure drawings as a projective test. In 
L. E. Abt & L. Bellack (Eds.), Projective psy- 
chology. New York: Knopf, 1950. Pp, 257-297. 

Macuover, K. Personality projection in the drawing 
of the human figure. Springfield, Ill.: Charles C 


Thomas, 1949. 
(Received October 2, 1959) 


ourn: Consulting Psychology 
Teria, 24, No. 6, 550-551 


ACHIEVEMENT MOTIVATION AND FIELD 
INDEPENDENCE 


GILBERT HONIGFELD axp IRWIN M. SPIGEL 1 


Temple University 


perception 
(Witkin, Lewis, Hertzman, Machover, Meiss- 

0 positions 
attribute low need for achievement (n Ach) 
o restrictive 
nds for inde- 
For this rea- 


female and 11 male 


ained a 40 product- 
moment correlation, 


Owing to the possible differences between 
males and females in terms of their perform- 
ance on these variables a more balanced sam- 
ple was deemed desirable, and so this repli- 
cation was performed. The basic hypothesis 
of a significant positive relationship between 
n Ach and field independence remained unal- 
tered, but additional efforts were undertaken 
to explore the possibility of systematic sex 
differences. 

Fifteen male and 13 female subjects (Ss) 
were used. They were undergraduates at a 
large, urban university and were naive re- 
garding the instruments used and the goals of 
the study. Ss in small numbers were adminis- 
tered under neutral set a group adaptation 
of the McClelland et al. (1953) four card 
TAT for measuring n Ach. They were indi- 


+The help of Horace A. Page in supplying mate- 
rials and assistance is appreciated. 


vidually administered a modification of the 
Gottschaldt Embedded Figures Test (EFT), 
in which simple figures had to be isolated 
from complex figures (Thurstone, 1944). 

Scores on the n Ach test ranged from —2 
to +19 with a median of 8.5 (the higher the 
Score, the greater the achievement motiva- 
tion). All scoring was done by one of the au- 
thors who obtained a rescoring reliability co- 
efficient of .92. Total times to isolate simple 
from complex figures ranged from 216” to 
1660” with a median of 703.5”. Both males 
and females were shown to be homogeneous 
with regard to median scores on both vari- 
ables (chi square tests indicated no differ- 
ences). 

A Yates-corrected chi square of 6.04 (p < 
05, 2 df) was obtained between EFT scores 
dichotomized at the median and the n Ach 
test trichotomized at the upper and lower 
quartiles, after the manner of McClelland 
et al. (1953). Peculiarities in the distributions 
Were suggested when a 2x2 chi square 
analysis (median test) on the same data 
proved Statistically insignificant. It appeared 
that such peculiarities might be attributable 
to sex differences. When separate rank-order 
Correlations were computed for each sex it 
Was found that only the females behaved in 
the manner Suggested by the Wertheim and 
Mednick (1958) hypothesis. The degree of 
correlation for the women (.42, 11 df) agrees 
with their earlier finding. In the case of the 
males a slight correlation (—.12, 13 df) in 
the opposite direction was revealed. The dif- 
ference between correlations was significant 
beyond the .01 level. In light of these appa" 
ently divergent trends, the low-order ove! 
all correlation obtained (.24, 26 df) suggests 
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Achievement Motivation 


an interaction effect operating across sexes. It 
will be recalled that the Wertheim and Med- 
nick (1958) sample favored females in the 
ratio of 3 to 1. It is possible then that their 
results reflect the female contribution which 
alone is in the hypothesized direction. 

In conclusion, the replication confirmed the 
finding by Wertheim and Mednick (1958) 
of a significant positive relationship between 
n Ach and field independence. However, there 
is evidence in the present study to suggest 
that this hypothesis is pertinent only to a fe- 
male population. 
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RATIONALES FOR THE “TWISTED PEAR” 
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In a recent article, Jerome Fisher (1959) 
described a phenomenon which he termed the 
“twisted pear.” Essentially, this refers to the 
fact that in a number of situations in which a 
predictor variable with a cutting point is re- 
lated to a dichotomous criterion such as brain 
damage present or not present—or poor versus 
good adjustment—the predictions made yield 
a substantial proportion of true positives at 
the expense of very few false positives, but 


with a large number of false negatives. This 
phenomenon 


fact of the p 
predictor, si 


twisted pear doe 
tributions are not characteri 


r ic stic of the popula- 
tions involved, (b) that t 


he twisted pear is 
such a way that it 
ment of the cutting 
) that various rela- 
tors and criteria can 


score on the predictor, (c 
tionships between predic 
result in twisted pears. 
By quoting a statement made by Boring, 
Fisher at least implies that his twisted pear 
is inconsistent with a normal curve. However, 
the figures in his fourfold tables are derived 
from considering two distributions, each di- 
vided in two parts by a cutting score on the 
predictor. Especially when the abnormal pop- 
ulation is heavily overrepresented with respect 
to the base rate of the abnormality in the 
general population, two normal distributions 
could easily combine to produce a twisted 


at the bottom of the normals’ distribution but 
extended further into the “high” scores. , 

The relationship between IQ as a predictor 
variable and educational achievement as a 
criterion is a poor example of the twisted pear. 
While low IQs accurately identify persons who 
will not go to college (with few false positives 
and many false negatives) high IQs in Fisher's 
table do the same thing for people who will 
achieve at least at the ninth grade level. Ta- 
bles 1 and 2, which are derived from the table 
on page 402 in Fisher’s article (1959), show 
that equally effective predictions can be made 
using both high and low cutting scores. 

Since discrimination can be obtained at 
either end of the continuum merely by shift- 
ing the cutting point on the predictor, Fisher’s 
description of the twisted pear does not apply. 
This clearly indicates that the generality of 
the phenomenon he describes is not unlimited. 

One set of Circumstances which can yield 
twisted pear Predictor-criterion fourfold tables 
has to do with the multiplicity of causes bring- 
ing about the poor performance or inadequate 
adjustment represented by one part of the 
dichotomous Criterion. These causes may func- 
tion in an interrelated manner or may be 
relatively independent. If we assume perfect 
reliability of our criterion and perfect reliabil- 
ity and validity of a predictor as a measure- 
ment of just one or a subset of several of the 


TABLE 1 
RELATIONSH1p BETWEEN Low INTELLIGENCE 
AND EDU: 


CATIONAL ACHIEVEMENT 
Se ee 


pear. This is exactly what would happen if the ths —— 

mean of the abnormals on the predictor were Intelligence No College Some College 

higher than’ that of the normals and the 

variance were also greater in such a way that oo defective 100% 0% 

the range of the abnormals’ distribution began i 1% 29% 
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TABLE 2 


RELATIONSHIP BETWEEN HIGH INTELLIGENCE 
AND ACHIEVEMENT 


Achievement 


Thtelligence Grade 8 and lower Grade 9 or more 


Gifted 0% 100% 
Control 30% 61% 


Causes related to a malfunctioning of the 
organism, it is to be expected that a subgroup 
Of those individuals who are nonadaptive with 
respect to the criterion will be accurately (no 
alse positives) picked up by the predictor. 

ther predictors related to other causes would 
© needed to identify cases missed by the 
Predictor in question. Each predictor can be 
Considered as a way of dipping into a pool 
(class) of unfortunates in order to select a 
Subclass of problem cases. In this way, each 
‘imple or complex predictor would produce a 
twisteq pear effect with respect to the crite- 
"on because some cases would be accurately 
identifed and others left behind. , 
,~Onsider for a moment people with hearing 
ifficulties and assume we cannot ask aa 
about their hearing. Relatively iidependent 
Causes of hearing loss can consist of aean n 

€ eardrum, middle ear, cochlea, and cen re 
Nervous system. A good determination a = 
integrity and functioning of the as 
Would Pick up some hearing problems a 
alse positives but with many false negatives. 
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In a similar fashion, other hearing problems 
would be discovered by investigating the 
bones of the middle ear or the functioning of 
the cochlea, but in each case many hearing 
problems would be missed. 

Clearly, predictor variables need not be 
determinations of causes but often consist of 
measurements of concomitant effects. For ex- 
ample, psychological consequences of brain 
damage such as memory disturbances, Pio- 
trowski’s signs, or impairment in perceptual- 
motor coordination, or physiological and an- 
atomical anomalies such as EEG disturbances 
or displacement of the ventricles will identify 
different cases of damage to the central nery- 
ous system. Even though there will be overlap 
in the sense that some of the cases will be 
identified by more than one measure, the fact 
that some cases are missed by each measure 
yet identified by other means will make the 
predictor-criterion relationship look like a 
twisted pear for that measure. 

More complex circumstances and relation- 
ships than those represented by the above 
examples could also produce twisted pears. 
The phenomenon is neither universal nor 
unitary and depends upon a variety of condi- 
tions affecting the distributions of both pre- 
dictor and criterion measurements. 
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BRIEF REPORTS 


MANIFEST ANXIETY AND SOCIAL PERCEPTION 


JOEL R. DAVITZ 


AND 


Teachers College, Columbia University 


This study investigated the relationship be- 
tween manifest anxiety and two aspects of social 
perception: accuracy of perception and assumed 
similarity. Previous studies report somewhat con- 
tradictory findings concerning the relationship of 
adjustment and accuracy of perception, although 
the results of both studies support the hypothesis 
that personal adjustment is positively related to 
the tendency to perceive others as similar to one- 
self. 

The subjects (Ss) of this research were 156 
members of three college fraternities. Each S 


nswered an interest questionnaire for himself 
and predicted the answ. 


efinitely do not like. 
the short form of the 


developed by Bendig 
(1956). The data were analyzed by comparing 


Ss in the upper (high anxious) and lower (low 
anxious) quartiles of the distribution of manifest 
anxiety scores. 

Following the general procedure used by Bieri, 
Blacharsky, and Reid (1955) and also by Chance 
(1958), accuracy of perception scores was com- 
puted separately for items on which Ss were 
actually similar to the persons whose responses 
they predicted, and for items on which they were 
actually dissimilar on the interest questionnaire, 
Answers to a given item were considered similar 
if they did not differ by more than one scale- 
unit on the five-point scale indicating interest in 
each activity; answers were defined as dissimilar 
if they differed by more than one scale-unit, Ac- 


DONALD J. MASON 


University of Colorado 


curacy scores were obtained by summing the 
arithmetic differences in ratings of an S’s pre- 
dictions of the six persons whose responses he 
predicted, and the actual responses of these six 
persons on the interest questionnaire. Assumed 
similarity scores were computed by summing the 
arithmetic differences in ratings of an S’s own re- 
sponses and his predictions of the responses of 
the six persons whose answers he predicted. 
There was no significant difference between the 
high and low anxious groups for either accuracy 
of perception on similar items, or accuracy of 
perception on dissimilar items. However, the dif- 
ference between the high and low anxious groups 
for assumed similarity was significant at the .01 
level in the direction of high anxious Ss percelv- 
ing themselves as less similar to others.? : 
The results for assumed similarity are consist- 
ent with the two Previous studies (Bieri et al., 
1955; Chance, 1958); however, the absence of 
Significant differences for accuracy of perception 


appears to contradict the findings of Bieri, Bla- 
charsky, and Reid. 
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THE INTERPRETATION OF THREE PERSONALITY 
FACTORS IN JUVENILE DELINQUENCY * 


HERBERT C. QUAY, 
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DONALD R. PETERSON, 
University of Illinois 
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Vanderbilt University 


In a previous study (Peterson, Quay, & Cam- 
eron, 1959) a factor analysis of two personality 
scales known to measure tendencies toward juve- 
nile delinquency revealed the presence of three 
personality and two environmental background 
factors. The three personality factors were tenta- 
tively labeled “Psychopathic Delinquency” (PD), 
“Neurotic Delinquency” (ND), and “Tnadequate 
Delinquency” (ID). 

This study sought further clarification of the 
meaning and validity of these three personality 
factors by investigating their relationships to 
variables logically related to them. 

Two groups of subjects (Ss) were used. The 
first group consisted of 170 consecutive admis- 
sions to the Illinois State Training School for 
Boys while the second group was composed of 
93 boys incarcerated in the Tennessee State Vo- 
cational Training School. Approximately one-half 
of this latter group were new admissions whereas 
the remainder had been institutionalized for pe- 
riods of one month to one year. Both groups were 
administered the factor scales, while the Ten- 
nessee sample was administered, in addition, the 
children’s form of the Taylor Manifest Anxiety 
Scale (CMAS). 

For Group 1, scores on each of the three fac- 
tor scales were related, by appropriate correla- 


1An extended report of this study may be ob- 
tained without charge from H. C. Quay (Department 
of Psychology, Vanderbilt University; Nashville, 
Tennessee) or for a fee from the American Docu- 
mentation Institute. Order Document No. 6407 from 
ADI Auxiliary Publications Project, Photoduplica- 
tion Service, Library of Congress; Washington 25, 
D. C., remitting in advance $1.25 for microfilm or 
$1.25 for photocopies. Make checks payable to: 
Chief, Photoduplication Service, Library of Congress. 


tional methods, to: (a) type of offense leading 
to commitment, (b) problem behavior in the in- 
stitution, (c) length of incarceration, (d) re- 
cidivism, and (e) intelligence as measured by the 
Otis test. For the second group, in which in- 
formation in regard to the above variables was 
unavailable, factor scores were related only to 
CMAS. 

Results indicated that the PD factor was 
significantly related to commitment for crime 
against the person (rpi = -14, p= .05), recidi- 
vism (ry, = -16, p= .05), problem behavior 
while incarcerated (rpi =-16, p=.05), and 
longer institutionalization (r = .19, p = .01), but 
was not significantly related to intelligence or 
CMAS. The ND factor related significantly only 
to CMAS (r= .52, p = .01). ID related signifi- 
cantly to intelligence (r =— .23, p=.01) and 
to longer institutionalization (r = .16, p= .05). 

While the correlations were rather uniformly 
of low magnitude, they were, with but one ex- 
ception, in keeping with the generally accepted 
meanings of the factors. The unexpected relation- 
ship of ID to length of institutionalization sug- 
gests a failure to adjust to institutional routine 
based on ineptness rather than a pattern of rule- 
violating psychopathy. 

The results are interpreted as providing em- 
pirical support for the applicability of factor la- 
bels originally arrived at through analysis of item 
content. 
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MANIFEST ANXIETY AND SOCIAL BEHAVIOR +! 
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Although various theories of anxiety have em- 
phasized the relationship between anxiety and 
negatively evaluated social behavior, relatively 
little research has been focused on this problem. 
The purpose of this study, therefore, was to test 
the hypotheses that persons who report high anx- 
iety, in comparison to those who report low anx- 
iety, are described by others as more withdrawn, 
inhibited, hostile, unconscientious, conceited, and 
selfish. 

The procedure involved collecting the following 
data: (a) self-reports of anxiety, (b) ratings of 
the social behavior of each subject (S) by per- 
sons with whom each S$ lived. The hypotheses 


of Ss in the upper and lower quartiles of the to- 
tal sample on the scale of manifest anxiety. 


The Ss were 157 male college students who 
were members of three fraterniti 
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the fraternity brothers assigned to his particu- 
lar subgroup on six eight-step scales: friendly- 
withdrawn, frank-inhibited, good-natured-hostile, 
conscientious-unconscientious, modest-conceited, 
generous-tight, Brief definitions were given of 
each point on the scales. 

The distribution of social behavior ratings was 
skewed toward the positively evaluated end of 
each scale; however, the high anxious Ss, in com- 
parison to the low anxious Ss. were described by 
their peers as less frank, less good-natured, less 
Conscientious, and less generous. The differences 
on these scales were significant beyond the .05 
level. The differences between the high and low 
anxious groups were not significant on two scales: 
friendly—withdrawn, and modest~conceited. 

In general, high anxious Ss were described less 
positively than were low anxious Ss. Theoretical 
interpretation of these results, however, are lim- 
ited by the fact that the data essentially represent 
correlations between reported anxiety and rat- 
ings of social behavior. Obviously, an antecedent- 
consequent relationship between anxiety and s0- 
cial behavior cannot be inferred; nevertheless, 
this research Supports the general thesis that 
manifest anxiety and various kinds of social be- 
havior covary, and the task of future research is 
to clarify the nature of these relationships. 
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Despite the importance of the role-playing 
ability (RPA) concept and related concepts (em- 
pathy, social sensitivity, etc.) little empirical re- 
search has been done relating RPA to other per- 
sonality variables; to performance in relevant 
professions (e.g., psychotherapist); or to differ- 
N ences in communication, perception, and other 
functional skills. Hitherto, most researchers on 
these variables have used a clumsy cross-predic- 
tion design, where the need for comparison data 
restricts applications to a relatively small num- 
ber of Ss who have had an opportunity to ob- 
serve each other. McClelland (1951) recognized 
% the potential value of a more generally applicable 
measure and developed a 32-item questionnaire 
as a test of RPA. It is the purpose of the pres- 
ent study to replicate McClelland’s developmen- 
tal procedure and assess the validity of his 
measure. 

McClelland selected items for his test in terms 
of their capacity to discriminate Ss rated high 
and low on criterion scales considered relevant 
to this ability. Ss were rated for social intelli- 
gence, reaction sensitivity, ability to predict 
behavior, rigidity, and self-objectivity. A sixth 


1This study was conducted at UCLA, supported 
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AN EMPIRICAL EVALUATION OF A TEST OF 
ROLE-PLAYING ABILITY? 


‘ 
ZANWIL SPERBER 
Children’s Hospital, Philadelphia 


scale, RPA, was discarded when his Ss claimed 
not to have information on which to base ratings. 

In the present study, the 30 students in E’s 
UCLA course in psychological interviewing filled 
out the RPA test early in the semester. Students 
also met weekly in role-playing laboratory groups 
throughout the semester. In the last session each 
S rated himself and the three to five other mem- 
bers of his laboratory group on all six criterion 
scales. RPA test scores were separately corre- 
lated with both self-ratings and the average of 
the other lab group members for the sum of rat- 
ings on all six criterion scales, the five scales Mc- 
Clelland used, and the RPA scale alone. Con- 
trary to expectation almost half the correlations, 
including the only significant one, were negative. 

In two class sessions towards the end of the 
semester two male sophomore volunteers, J and 
M, were interviewed separately. J and M then 
rank ordered 16 adjectives descriptive of inter- 
personal personal behavior in terms of how they 
had just behaved in the interview, and their gen- 
eral social behavior. Immediately after observing 
the interview Ss were asked to predict how J and 
M ranked the adjectives. In addition Ss predicted 
J’s and M’s responses to a 100-item sentence 
completion test presented in a five-alternative 
multiple-choice format. These three cross-predic- 
tion tasks served as criteria against which to vali- 
date the RPA test. For both J and M on all three 
criterion tasks the correlations between RPA 
score and cross-predictions were all of zero or- 
der, and many were negative. It was concluded 
that the questionnaire is not a valid measure of 
RPA. 
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Studies of the generality of goal setting be- 
havior (level of aspiration) have reported cor- 
relations ranging from almost zero to as high as 
-93. Since many of these studies have regarded 
goal setting as a nonsituational dispositional vari- 
able, the failure to find any consistent generality 
has frequently been attributed to the unreliability 
of the test. Another Possibility is that there is 
less generality to behavior than has been as- 
sumed by theorists relying on broad dispositional 
concepts. It is possible to regard situational vari- 
ables, not as sources of error which result in un- 
reliable measurement, but as necessary variables 
to take into account in any prediction of behav- 
ior. The Present study was Concerned with the 
use of instructions as a Means of defining the 
Psychological situation to the subject (S). It was 
hypothesized that with the physical similarity of 


correlation between 
two tasks would be 
definitions of the 


` Sixty-six colle 


ge students, 27 males and 39 fe- 
males, from a 


n introductory Psychology class 
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served as Ss. Each S was given eight trials on 
each of two level of aspiration tasks and was re- 
quired to predict his score prior to each trial. 
The first task required S to record as many words 
as he could think of beginning with each of eight $ 
letters orally presented by the experimenter (£). 
Following each trial a weighted score was ie 
ported to S based on a scale from 1 to 100. All | 
Ss received the same predetermined scores, The 
second task consisted of sorting a deck of 100 k. 
playing cards into a four compartment box ac- 
cording to suit. The score was the number oll 
rectly sorted in 30 seconds. All Ss were again.» 
given the same scores by the Æ calling time when bi 
the appropriate number of sortings had been — 
achieved. Scores on the second task were a con- | 
stant three points higher by trial than those f 
given on the first task. The instructions for 
Task I were the same for all Ss and were to the 
effect that they were to take a short-form in- 5 
telligence test which seemed to correlate vel 
with other intelligence tests and with academic j 
achievement. The instructions for Task II varied 
depending on the group. Alternate Ss were told | 
that the second task was another short-form in- 
telligence test or that it was a motor-speed test | 
which seemed to correlate well with things requir- | 
ing speed and manual dexterity. { 
he Product-moment correlation for Group I 
between mean expectancy statements Trials 2-8 
on the two tasks Was .88. The corresponding Cor- 
relations for Group II was .62. Both correlations 
are significant at less than the .001 level and the 


difference between them is significant at less than 
the .002 level. Under 


mize generality, with 
formance contr 
ancy statements 
chological situati 
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